Machine Learning Engineer - Large Language Models & Generative AI Inference

$147K - $272K Cupertino, CA, US Mid Level LLM Engineer

Interested in this LLM Engineer role at Apple?

Apply Now →

About This Role

AI job market dashboard showing open roles by category

The Intelligence Platform team empowers clients across Apple’s operating systems with a high quality user\-centric search and data platform, and the primary inference platform that enable next generation user experiences for Apple Intelligence. We are in search of an accomplished and driven Machine Learning Engineer who has a robust understanding of Large Language Models, Generative AI and high\-performance systems computing. Your primary role will involve working with our cross\-functional teams that build foundation models, as well as our client teams that want to run inference using these models and continue the development of our inference stack. By contributing to our team, you’ll play an integral part in developing Siri, Photos, Music, and various other services, leaving a significant footprint on the evolution of our AI platforms. Join us in our exciting journey as we push the frontiers of Machine Learning and Generative AI. Your expertise and contributions will be invaluable in shaping the future of our Intelligence Platform.

Description

As a Machine Learning Engineer on the Apple Intelligence Platform Inference Tean, you’ll join a phenomenal team of top\-performing engineers and and will be entrusted with a range of responsibilities. Your tasks will include: Leading the exploration and application of Large Language Models and Generative AI, venturing into new areas within these fields. Translating the latest research into high\-performing systems and a model serving stack that can be practically applied to enhance user experiences. Contribute to setting the team’s strategic direction, cultivating an environment that encourages innovation and professional growth. Collaborating with various teams to develop and implement evolving requirements of our clients on the GenAI inference stack, ensuring performance optimization and alignment with broader business goals.

Preferred Qualifications

An advanced degree (Master’s or Ph.D.) in Computer Science, Artificial Intelligence, Machine Learning, or a related field is required

Ongoing professional development in Machine Learning and Artificial Intelligence domains is expected.

Minimum Qualifications

In\-depth experience in Machine Learning, with a particular emphasis on Large Language Models (LLMs) and Generative AI.

Published research in the field of Machine Learning or AI is highly desirable, indicating an ability to not only understand but also contribute to cutting\-edge research.

Proven ability to comprehend, interpret, and apply cutting\-edge research into tangible applications.

Proven problem\-solving and leadership abilities, with the capacity to steer the team’s research and practical applications in a collaborative and fast\-paced environment.

Pay \& Benefits

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple's discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple's Employee Stock Purchase Plan. You'll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses \- including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Salary Context

This $147K-$272K range is above the 75th percentile for LLM Engineer roles in our dataset (median: $155K across 6 roles with salary data).

View full LLM Engineer salary data →

Role Details

Company Apple
Title Machine Learning Engineer - Large Language Models & Generative AI Inference
Location Cupertino, CA, US
Category LLM Engineer
Experience Mid Level
Salary $147K - $272K
Remote No

About This Role

LLM Engineers specialize in building applications powered by large language models. They design RAG systems, fine-tune models, build agent frameworks, and optimize inference pipelines for cost and latency. This is the role that didn't exist three years ago and now has thousands of open positions.

The scope is broad. You might be building a customer support chatbot that needs to pull from a knowledge base of 50,000 documents, or designing an agent that can navigate a company's internal tools to complete multi-step tasks. The common thread is taking a foundation model and making it do something useful, reliably, at scale, without bankrupting the company on API costs.

Across the 4,133 AI roles we're tracking, LLM Engineer positions make up 0% of the market. At Apple, this role fits into their broader AI and engineering organization.

LLM Engineer is one of the fastest-growing AI job titles. Every company building AI-powered products needs people who understand the full stack: from embedding models to vector stores to inference optimization. The supply of experienced LLM engineers is thin because the field is so new, which keeps compensation high and demand strong.

What the Work Looks Like

A typical week includes: building and testing RAG pipelines (chunking strategies, embedding models, retrieval evaluation), debugging why the agent took a wrong action path, optimizing inference costs (caching, batching, model selection), and working with the product team on new LLM-powered features. You'll context-switch between deep technical work and cross-functional collaboration.

LLM Engineer is one of the fastest-growing AI job titles. Every company building AI-powered products needs people who understand the full stack: from embedding models to vector stores to inference optimization. The supply of experienced LLM engineers is thin because the field is so new, which keeps compensation high and demand strong.

Skills in Demand for This Role

Python (51% of roles) Aws (32% of roles) Azure (24% of roles) Rag (22% of roles) Gcp (20% of roles) Pytorch (16% of roles) Prompt Engineering (15% of roles) Claude (14% of roles)

RAG and vector databases are the most common requirements. Expect to work with LangChain or LlamaIndex, embedding models, and at least one vector store (Pinecone, Weaviate, Chroma). Python is non-negotiable. Understanding the cost/latency/quality tradeoffs between different model providers and architectures is what separates senior from junior engineers.

Fine-tuning experience is valuable for specific use cases but most production LLM work is RAG-based. Agent frameworks (LangGraph, CrewAI, custom orchestration) are increasingly important as companies move beyond simple chat interfaces. Evaluation and observability tools (LangSmith, Arize, custom dashboards) are essential for production deployments.

Look for roles that specify the production stack, mention specific use cases, and talk about cost optimization. Companies that understand LLM engineering will mention evaluation methodology, latency requirements, and scale targets. Vague 'build AI features' postings often mean they haven't figured out their architecture yet.

Compensation Benchmarks

LLM Engineer roles pay a median of $162,240 based on 11 positions with disclosed compensation. Mid-level AI roles across all categories have a median of $165,778. This role's midpoint ($209K) sits 29% above the category median. Disclosed range: $147K to $272K.

Across all AI roles, the market median is $200,700. Top-quartile compensation starts at $254,000. The 90th percentile reaches $307,500. For comparison, the highest-paying categories include AI Safety ($274,200) and AI Engineering Manager ($268,700). By seniority level: Entry: $97,760; Mid: $165,778; Senior: $227,400; Director: $250,000; VP: $250,000.

Apple AI Hiring

Apple has 62 open AI roles right now. They're hiring across AI/ML Engineer, LLM Engineer, AI Product Manager, AI Software Engineer. Positions span Cupertino, CA, US, San Diego, CA, US, Seattle, WA, US. Compensation range: $190K - $487K.

Location Context

Across all AI roles, 14% (583 positions) offer remote work, while 3,532 require on-site attendance. Top AI hiring metros: New York (2,760 roles, $211,000 median); San Francisco (2,258 roles, $253,000 median); Los Angeles (1,841 roles, $195,000 median).

Career Path

Common paths into LLM Engineer roles include Software Engineer, ML Engineer, Data Engineer.

From here, career progression typically leads toward AI Architect, Principal Engineer, AI Engineering Manager.

The fastest path is through software engineering. If you can build production systems and you understand LLM capabilities and limitations, you're already qualified for most roles. Build a portfolio project that demonstrates RAG implementation, evaluation, and cost optimization. Open-source contributions to LLM frameworks are strong signals to hiring managers.

What to Expect in Interviews

Technical screens cover RAG architecture design, embedding model selection, chunking strategies, and retrieval evaluation. Expect questions about cost optimization: how you'd reduce inference costs by 50% without degrading quality. System design rounds often present scenarios like 'design a customer support chatbot that can access 100K documents' and evaluate your understanding of the full stack from embedding to serving.

When evaluating opportunities: Look for roles that specify the production stack, mention specific use cases, and talk about cost optimization. Companies that understand LLM engineering will mention evaluation methodology, latency requirements, and scale targets. Vague 'build AI features' postings often mean they haven't figured out their architecture yet.

AI Hiring Overview

The AI job market has 4,133 open positions tracked in our dataset. By seniority: 106 entry-level, 1,901 mid-level, 1,663 senior, and 463 leadership roles (Director, VP, C-Level). Remote roles make up 14% of the market (583 positions). The remaining 3,532 roles require on-site or hybrid attendance.

The market median for AI roles is $200,700. Top-quartile compensation starts at $254,000. The 90th percentile reaches $307,500. Highest-paying categories: AI Safety ($274,200 median, 57 roles); AI Engineering Manager ($268,700 median, 42 roles); Research Engineer ($260,000 median, 442 roles).

LLM Engineer is one of the fastest-growing AI job titles. Every company building AI-powered products needs people who understand the full stack: from embedding models to vector stores to inference optimization. The supply of experienced LLM engineers is thin because the field is so new, which keeps compensation high and demand strong.

The AI Job Market Today

The AI job market spans 4,133 open positions across 15 role categories. The largest categories by volume: AI/ML Engineer (2,865), Data Scientist (339), AI Software Engineer (313). These three account for the majority of open positions, though smaller categories often have higher per-role compensation because of specialized skill requirements.

The seniority mix tells a story about where AI teams are in their maturity. Entry-level roles (106) are outnumbered by mid-level (1,901) and senior (1,663) positions, reflecting that most companies are past the 'build a team from scratch' phase and need experienced engineers who can ship production systems. Leadership roles (Director, VP, C-Level) total 463 positions, representing the bottleneck between technical execution and organizational strategy.

Remote work availability sits at 14% of all AI roles (583 positions), with 3,532 requiring on-site or hybrid attendance. The remote share has stabilized after the post-pandemic correction. Senior and specialized roles (Research Scientist, ML Architect) are more likely to be remote-eligible than entry-level positions, partly because experienced hires have more negotiating power and partly because these roles require less hands-on mentorship.

AI compensation is structured in clear tiers. The market median sits at $200,700. Top-quartile roles start at $254,000, and the 90th percentile reaches $307,500. These figures include base salary with disclosed compensation. Total compensation (including equity, bonuses, and sign-on) runs 20-40% higher at companies that offer those components.

Category matters for compensation. AI Safety roles lead at $274,200 median, while Prompt Engineer roles sit at $140,000. The spread between highest and lowest-paying categories reflects the premium on specialized technical skills versus broader analytical roles.

The most in-demand skills across all AI postings: Python (2,128 postings), Aws (1,324 postings), Azure (1,003 postings), Rag (916 postings), Gcp (817 postings), Pytorch (655 postings), Prompt Engineering (639 postings), Claude (571 postings). Python dominates, appearing in the vast majority of role descriptions regardless of category. Cloud platform experience (AWS, GCP, Azure) is the second most common requirement. The newer entrants to the top skills list (RAG, vector databases, LLM APIs) reflect the shift from traditional ML toward generative AI applications.

Frequently Asked Questions

Based on 11 roles with disclosed compensation, the median salary for LLM Engineer positions is $162,240. Actual compensation varies by seniority, location, and company stage.
RAG and vector databases are the most common requirements. Expect to work with LangChain or LlamaIndex, embedding models, and at least one vector store (Pinecone, Weaviate, Chroma). Python is non-negotiable. Understanding the cost/latency/quality tradeoffs between different model providers and architectures is what separates senior from junior engineers.
About 14% of the 4,133 AI roles we track offer remote work. Remote availability varies by company and seniority level, with senior and leadership roles more likely to offer location flexibility.
Apple is among the companies actively hiring for AI and ML talent. Check our company profiles for detailed breakdowns of open roles, salary ranges, and hiring trends.
Common next steps from LLM Engineer positions include AI Architect, Principal Engineer, AI Engineering Manager. Progression depends on whether you lean toward technical depth, people management, or product strategy.

Get Weekly AI Career Intelligence

Salary data, skills demand, and market signals from 16,000+ AI job postings. Every Monday.