Interested in this Data Engineer role at McKinsey & Company?
Apply Now →Skills & Technologies
About This Role
Job ID: 109242
Atlanta
Boston
Chicago
New Jersey
New York City
Raleigh
San Francisco
\+ 2 More
Do you want to do work that matters, alongside supportive leaders who will help you grow faster than you ever thought possible? Are you a creative problem\-solver who is energized by challenges? You’ve come to the right place.
YOUR IMPACT
You will build the foundational data infrastructure that powers cutting\-edge AI applications leveraging LLMs, retrieval systems, workflows, and emerging agentic architectures.
You will design and maintain scalable data pipelines, manage secure data environments, and prepare data for AI\-driven systems while collaborating with cross\-functional teams and clients. You’ll tackle real\-world challenges by contributing to the development of next\-generation AI systems and grow as a technologist by working alongside diverse experts across industries.
You will design and build the scalable, reproducible data components essential for machine learning, agentic, and autonomous AI systems. You’ll assess data landscapes, apply data quality fundamentals, and prepare data for AI solutions. Additionally, you’ll learn to translate simple hypotheses into engineered features, manage secure data environments, and contribute to R\&D initiatives focused on innovating and scaling next\-generation AI capabilities.
Your work will help solve some of the most complex and high\-impact challenges facing clients across industries. Collaborating across QuantumBlack, AI by McKinsey and QuantumBlack Labs teams, you’ll help develop innovative AI capabilities and scalable enterprise solutions. You’ll help build robust data foundations for scalable, production\-ready AI systems. Your contributions will directly support the firm’s ability to accelerate AI adoption, solve complex business problems at scale, and enable clients to achieve meaningful, lasting impact.
You’ll be based in one of our North American offices as part of our global Data Engineering community. Working in cross\-functional Agile teams, you’ll collaborate closely with Data Scientists, Machine Learning Engineers, and industry experts to deliver AI solutions. By partnering with clients—from data owners to C\-level executives—you’ll help solve complex problems that drive tangible business value and begin to build your skills as a client\-facing technologist.
You will grow at the forefront of AI, data engineering, and emerging agentic technologies. You’ll develop expertise at the intersection of technology and business by tackling diverse challenges in the evolving AI landscape. Working alongside multidisciplinary teams, you’ll gain a holistic understanding of how data engineering enables advanced AI while collaborating with leading AI and data experts in the industry.
YOUR GROWTH
Driving lasting impact and building long\-term capabilities with our clients is not easy work. You are the kind of person who thrives in a high performance/high reward culture \- doing hard things, picking yourself up when you stumble, and having the resilience to try another way forward.
In return for your drive, determination, and curiosity, we'll provide the resources, mentorship, and opportunities you need to become a stronger leader faster than you ever thought possible. Your colleagues—at all levels—will invest deeply in your development, just as much as they invest in delivering exceptional results for clients. Every day, you'll receive apprenticeship, coaching, and exposure that will accelerate your growth in ways you won’t find anywhere else.
When you join us, you will have:
- Continuous learning: Our learning and apprenticeship culture, backed by structured programs, is all about helping you grow while creating an environment where feedback is clear, actionable, and focused on your development. The real magic happens when you take the input from others to heart and embrace the fast\-paced learning experience, owning your journey.
- A voice that matters: From day one, we value your ideas and contributions. You’ll make a tangible impact by offering innovative ideas and practical solutions, all while upholding our unwavering commitment to ethics and integrity. We not only encourage diverse perspectives, but they are critical in driving us toward the best possible outcomes.
- Global community: With colleagues across 65\+ countries and over 100 different nationalities, our firm’s diversity fuels creativity and helps us come up with the best solutions for our clients. Plus, you’ll have the opportunity to learn from exceptional colleagues with diverse backgrounds and experiences.
- World\-class benefits: On top of a competitive salary (based on your location, experience, and skills), we provide a comprehensive benefits package to enable holistic well\-being for you and your family.
YOUR QUALIFICATIONS AND SKILLS
Degree in Computer Science/Engineering, or equivalent experience
2\+ years of relevant professional experience building and deploying data solutions
Strong proficiency in Python and SQL for data engineering and experience writing robust, production\-grade code, including deploying code across environments
Proven experience building end\-to\-end data pipelines and platforms for Agentic AI, Generative AI, Machine Learning, or Business Intelligence, covering data preparation, embeddings generation, vector search, and system integration using modern frameworks (Spark, dbt, LangChain)
A strong foundation in system design, data storage, and reliability with commonly used data platforms (Databricks, Snowflake, BigQuery, PSQL, etc.) and data engineering tools (e.g., Pandas, Spark, dbt, etc.)
Hands\-on experience with MLOps/LLMOps principles, including CI/CD for data workflows, automated agent evaluation (LangSmith, Opik, Langfuse), and infrastructure as code (Terraform)
Experience building systems with different data formats (structured vs unstructured) and data processing methods (streaming vs batch) and deploying across major cloud platforms (AWS, Azure, GCP)
Exceptional time management in a complex and largely autonomous work environment
Commercial client\-facing or senior stakeholder management experience is beneficial
Experience using coding agents (Cursor, Claude Code, Codex, etc.) is a plus
Strong communication skills, both verbal and written, in English and local office language(s)
Please review the additional requirements regarding essential job functions of McKinsey colleagues.
Our unwavering commitment to integrity drives everything we do, guiding us to always act in the best interests of our clients, our people, and the communities we serve.
Role Details
About This Role
Data Engineers build the pipelines that feed AI models. They design ETL workflows, manage data lakes, and ensure training and inference data is clean, timely, and accessible. Without good data engineering, AI projects fail. It's that simple.
The AI era has expanded the data engineer's scope far beyond batch ETL jobs. You're building real-time embedding pipelines for RAG systems, managing vector databases, ensuring training data quality at scale, and building the infrastructure that lets ML teams iterate on data as fast as they iterate on models. Data quality is the biggest predictor of model quality, and you're the person responsible for it.
Across the 3,823 AI roles we're tracking, Data Engineer positions make up 1% of the market. At McKinsey & Company, this role fits into their broader AI and engineering organization.
Data Engineer demand in AI contexts is strong and growing. Every company building AI needs clean, reliable data pipelines. The shift toward real-time AI applications (chatbots, recommendation engines, agent systems) means data engineering is more critical than ever. Companies are willing to pay premium salaries for data engineers with AI/ML pipeline experience.
What the Work Looks Like
A typical week includes: debugging a data pipeline that's producing stale embeddings for the RAG system, optimizing a Spark job that processes training data, building a data quality monitoring dashboard, meeting with the ML team to understand their next data requirements, and writing dbt models that transform raw event data into ML-ready features. The work is deeply technical and high-impact.
Data Engineer demand in AI contexts is strong and growing. Every company building AI needs clean, reliable data pipelines. The shift toward real-time AI applications (chatbots, recommendation engines, agent systems) means data engineering is more critical than ever. Companies are willing to pay premium salaries for data engineers with AI/ML pipeline experience.
Skills Required
SQL, Python, and distributed systems (Spark, Airflow, dbt) are core. Cloud data platforms (Snowflake, BigQuery, Redshift) are increasingly standard. Many AI-focused roles also want familiarity with vector databases and embedding pipelines. Understanding data modeling, pipeline orchestration, and data quality frameworks covers the essentials.
AI-specific data engineering skills include: building feature stores, managing training data versioning, implementing data lineage tracking, and building real-time embedding pipelines. Experience with streaming systems (Kafka, Flink) is valuable for real-time AI applications. Understanding ML data requirements (balanced datasets, data augmentation, evaluation set construction) makes you much more effective working with ML teams.
Strong postings specify the data stack, mention ML pipeline work, and describe the scale of data you'll be working with. Look for companies that understand the connection between data quality and model quality. Avoid roles that conflate data engineering with data analysis.
Compensation Benchmarks
Data Engineer roles pay a median of $208,300 based on 266 positions with disclosed compensation. Mid-level AI roles across all categories have a median of $165,000.
Across all AI roles, the market median is $200,100. Top-quartile compensation starts at $253,500. The 90th percentile reaches $307,500. For comparison, the highest-paying categories include AI Engineering Manager ($275,000) and AI Safety ($274,200). By seniority level: Entry: $97,880; Mid: $165,000; Senior: $227,400; Director: $247,800; VP: $250,000.
McKinsey & Company AI Hiring
McKinsey & Company has 3 open AI roles right now. They're hiring across AI/ML Engineer, Data Engineer. Based in Atlanta, GA, US.
Location Context
Across all AI roles, 15% (590 positions) offer remote work, while 3,217 require on-site attendance. Top AI hiring metros: New York (2,643 roles, $211,000 median); San Francisco (2,168 roles, $253,000 median); Los Angeles (1,792 roles, $191,580 median).
Career Path
Common paths into Data Engineer roles include Backend Engineer, Database Administrator, Analytics Engineer.
From here, career progression typically leads toward Senior Data Engineer, ML Engineer, Data Platform Lead.
Master SQL and Python first. Then learn a distributed processing framework (Spark or its modern alternatives) and a pipeline orchestrator (Airflow, Dagster, Prefect). Build a portfolio project that demonstrates end-to-end pipeline construction: ingest, transform, validate, serve. If you want to specialize in AI data engineering, add vector databases and embedding pipelines to your skill set.
What to Expect in Interviews
Expect SQL deep-dives (query optimization, partitioning strategies, data modeling), Python coding focused on data pipeline patterns, and system design questions about building scalable ETL workflows. Companies with ML teams will ask about feature stores, embedding pipelines, and training data management. Be ready to discuss data quality monitoring, pipeline orchestration, and how you'd handle schema evolution in a production data lake.
When evaluating opportunities: Strong postings specify the data stack, mention ML pipeline work, and describe the scale of data you'll be working with. Look for companies that understand the connection between data quality and model quality. Avoid roles that conflate data engineering with data analysis.
AI Hiring Overview
The AI job market has 3,823 open positions tracked in our dataset. By seniority: 112 entry-level, 1,798 mid-level, 1,516 senior, and 397 leadership roles (Director, VP, C-Level). Remote roles make up 15% of the market (590 positions). The remaining 3,217 roles require on-site or hybrid attendance.
The market median for AI roles is $200,100. Top-quartile compensation starts at $253,500. The 90th percentile reaches $307,500. Highest-paying categories: AI Engineering Manager ($275,000 median, 41 roles); AI Safety ($274,200 median, 55 roles); Research Engineer ($260,000 median, 434 roles).
Data Engineer demand in AI contexts is strong and growing. Every company building AI needs clean, reliable data pipelines. The shift toward real-time AI applications (chatbots, recommendation engines, agent systems) means data engineering is more critical than ever. Companies are willing to pay premium salaries for data engineers with AI/ML pipeline experience.
The AI Job Market Today
The AI job market spans 3,823 open positions across 15 role categories. The largest categories by volume: AI/ML Engineer (2,629), Data Scientist (322), AI Software Engineer (279). These three account for the majority of open positions, though smaller categories often have higher per-role compensation because of specialized skill requirements.
The seniority mix tells a story about where AI teams are in their maturity. Entry-level roles (112) are outnumbered by mid-level (1,798) and senior (1,516) positions, reflecting that most companies are past the 'build a team from scratch' phase and need experienced engineers who can ship production systems. Leadership roles (Director, VP, C-Level) total 397 positions, representing the bottleneck between technical execution and organizational strategy.
Remote work availability sits at 15% of all AI roles (590 positions), with 3,217 requiring on-site or hybrid attendance. The remote share has stabilized after the post-pandemic correction. Senior and specialized roles (Research Scientist, ML Architect) are more likely to be remote-eligible than entry-level positions, partly because experienced hires have more negotiating power and partly because these roles require less hands-on mentorship.
AI compensation is structured in clear tiers. The market median sits at $200,100. Top-quartile roles start at $253,500, and the 90th percentile reaches $307,500. These figures include base salary with disclosed compensation. Total compensation (including equity, bonuses, and sign-on) runs 20-40% higher at companies that offer those components.
Category matters for compensation. AI Engineering Manager roles lead at $275,000 median, while Prompt Engineer roles sit at $140,000. The spread between highest and lowest-paying categories reflects the premium on specialized technical skills versus broader analytical roles.
The most in-demand skills across all AI postings: Python (1,979 postings), Aws (1,190 postings), Azure (899 postings), Rag (839 postings), Gcp (726 postings), Pytorch (595 postings), Prompt Engineering (595 postings), Claude (540 postings). Python dominates, appearing in the vast majority of role descriptions regardless of category. Cloud platform experience (AWS, GCP, Azure) is the second most common requirement. The newer entrants to the top skills list (RAG, vector databases, LLM APIs) reflect the shift from traditional ML toward generative AI applications.
Frequently Asked Questions
Get Weekly AI Career Intelligence
Salary data, skills demand, and market signals from 16,000+ AI job postings. Every Monday.