NVIDIA: Software Engineer, DGX Cloud AI

Interested in this AI Software Engineer role at NVIDIA?

Apply Now →

Skills & Technologies

PythonPytorch

About This Role

AI job market dashboard showing open roles by category

NVIDIA is at the forefront of the generative AI revolution, building the software and systems that power the world’s most advanced large language model workloads. We are looking for a Software Engineer focused on bring\-up, triage, benchmarking, analysis, and optimization of distributed training and inference workloads across NVIDIA GPU platforms at the largest scales we run.

In this role you will help bring up, benchmark, and debug distributed LLM workloads on multi\-GPU and multi\-node deployments, and own the design and implementation of the benchmarking tooling, automation, and debugging workflows that support them. This is a hands\-on role for an engineer who enjoys deep technical problems across deep learning systems, GPU performance, distributed computing, and large\-scale operations.

What you’ll be doing:

Bring up, validate, and debug large\-scale AI clusters, infrastructure, and end\-to\-end workloads.
Bring up, tune, and benchmark AI pre\-training, post\-training, and inference workloads using PyTorch, NeMo / Megatron, TensorRT\-LLM, and adjacent NVIDIA AI software stacks.
Perform root\-cause analysis of failures in large distributed environments
Contribute to the resilience and failure\-attribution tooling that detects, triages, and attributes node, fabric, and workload failures across the cluster.
Build and maintain repeatable benchmark suites, automation, acceptance criteria, and qualification workflows on new platforms.
Tune runtime settings, communication parameters, and deployment configurations in close partnership with framework, systems, and platform teams.
Deliver actionable, data\-driven recommendations based on profiling, benchmark results, and cluster characterization.

What we need to see:

Bachelor’s or Master’s in Computer Science or a related technical field (or equivalent experience).
3\+ years of experience developing software for AI, HPC, or systems\-level applications.
Hands\-on experience with multi\-GPU or multi\-node workloads and CUDA\-aware distributed execution.
Backgroun with debugging and scaling distributed systems.
Experience debugging and triaging AI applications across the full stack, from the application level toward the hardware.
Experience operating workloads in scheduled, containerized cluster environments.
Excellent analytical, debugging, and communication skills, and a collaborative approach across teams.
Strong Python and C/C\+\+ programming skills.

Ways to stand out from the crowd:

Hands\-on experience with NCCL and CUDA\-aware distributed execution.
Deep familiarity with the RDMA software stack (NCCL, IB verbs, UCX, libfabric) and with InfiniBand / RoCE congestion debugging.
Experience building acceptance tests, benchmark harnesses, regression gates, or cluster qualification tooling for AI platforms, including MLPerf.
Experience diagnosing performance jitter
Experience building resilience, fault\-detection, or failure\-attribution systems for datacenter\-scale infrastructure.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward\-thinking and hardworking people in the world working for us. If you’re creative, autonomous, and love a challenge, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 116,000 USD \- 189,750 USD for Level 2, and 140,000 USD \- 224,250 USD for Level 3\.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 7, 2026\.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Salary Context

This $116K-$224K range is below the median for AI Software Engineer roles in our dataset (median: $190K across 193 roles with salary data).

Role Details

Company NVIDIA

Title Software Engineer, DGX Cloud AI Infrastructure

Location Austin, TX, US

Category AI Software Engineer

Experience Mid Level

Salary $116K - $224K

Remote No

About This Role

AI Software Engineers build the applications and systems that AI models run inside. They own the API layers, data pipelines, frontend integrations, and infrastructure that turn a model into a product users interact with. Every AI company needs engineers who can build the software around the AI.

The challenge is building reliable systems around inherently unreliable components. Models are probabilistic. They'll give different answers to the same question. They hallucinate. They're slow. They're expensive. Your job is to build an application layer that handles all of this gracefully while delivering a product that users trust and enjoy.

Across the 3,824 AI roles we're tracking, AI Software Engineer positions make up 7% of the market. At NVIDIA, this role fits into their broader AI and engineering organization.

AI Software Engineer roles are among the most numerous in the AI job market. Every company deploying AI needs software engineers who understand AI integration patterns. The demand is broad, spanning startups to enterprises, across every industry adopting AI capabilities.

What the Work Looks Like

A typical week includes: building API endpoints that serve model inference with caching and fallback logic, designing the data pipeline that feeds context to a RAG system, implementing streaming responses in the frontend, debugging a race condition in the async inference pipeline, and optimizing database queries for the vector search layer. It's full-stack engineering with AI at the center.

Skills Required

Python (51% of roles) Pytorch (15% of roles)

Full-stack engineering skills with AI integration experience. Python and TypeScript are the most common requirements. You'll need to understand API design, database architecture, and how to build reliable systems around probabilistic outputs. Experience with streaming, async processing, and caching patterns is increasingly important as real-time AI applications proliferate.

Knowledge of vector databases, embedding APIs, and LLM integration patterns (function calling, structured outputs, retry logic) differentiates AI software engineers from general software engineers. Understanding cost optimization (caching strategies, model routing, batched inference) is valuable since inference costs can dominate application economics.

Strong postings describe the product you'll be building, the AI integration patterns you'll work with, and the scale requirements. Look for companies that have existing AI features and need engineers to improve and expand them, not companies that are 'planning to add AI' someday.

Compensation Benchmarks

AI Software Engineer roles pay a median of $234,620 based on 682 positions with disclosed compensation. Mid-level AI roles across all categories have a median of $160,000. This role's midpoint ($170K) sits 27% below the category median. Disclosed range: $116K to $224K.

Across all AI roles, the market median is $200,000. Top-quartile compensation starts at $253,000. The 90th percentile reaches $307,500. For comparison, the highest-paying categories include AI Engineering Manager ($293,500) and AI Safety ($274,200). By seniority level: Entry: $97,380; Mid: $160,000; Senior: $227,400; Director: $243,000; VP: $250,000.

NVIDIA AI Hiring

NVIDIA has 22 open AI roles right now. They're hiring across AI Software Engineer, AI/ML Engineer, AI Product Manager, MLOps Engineer. Positions span Austin, TX, US, Santa Clara, CA, US, CA, US. Compensation range: $224K - $379K.

Location Context

AI roles in Austin pay a median of $218,800 across 493 tracked positions. That's 9% above the national median.

Career Path

Common paths into AI Software Engineer roles include Software Engineer, Full-Stack Developer, Backend Engineer.

From here, career progression typically leads toward Staff Engineer, AI Architect, Engineering Manager.

If you're a software engineer, you're already 80% there. Learn the AI integration patterns: RAG, streaming inference, function calling, structured outputs. Build a project that demonstrates you can wrap an AI model in a production-quality application with proper error handling, caching, and user experience. That's the portfolio piece that gets you hired.

What to Expect in Interviews

Technical screens look like standard software engineering interviews with an AI twist. Expect system design questions about building reliable applications around probabilistic models: handling streaming responses, implementing retry logic for API failures, and designing caching strategies for LLM outputs. Coding rounds test standard algorithms plus practical integration patterns like async processing and rate limiting.

When evaluating opportunities: Strong postings describe the product you'll be building, the AI integration patterns you'll work with, and the scale requirements. Look for companies that have existing AI features and need engineers to improve and expand them, not companies that are 'planning to add AI' someday.

AI Hiring Overview

The AI job market has 3,824 open positions tracked in our dataset. By seniority: 119 entry-level, 1,813 mid-level, 1,472 senior, and 420 leadership roles (Director, VP, C-Level). Remote roles make up 16% of the market (613 positions). The remaining 3,187 roles require on-site or hybrid attendance.

The market median for AI roles is $200,000. Top-quartile compensation starts at $253,000. The 90th percentile reaches $307,500. Highest-paying categories: AI Engineering Manager ($293,500 median, 31 roles); AI Safety ($274,200 median, 51 roles); Research Engineer ($260,000 median, 401 roles).

The AI Job Market Today

The AI job market spans 3,824 open positions across 15 role categories. The largest categories by volume: AI/ML Engineer (2,702), Data Scientist (281), AI Software Engineer (258). These three account for the majority of open positions, though smaller categories often have higher per-role compensation because of specialized skill requirements.

The seniority mix tells a story about where AI teams are in their maturity. Entry-level roles (119) are outnumbered by mid-level (1,813) and senior (1,472) positions, reflecting that most companies are past the 'build a team from scratch' phase and need experienced engineers who can ship production systems. Leadership roles (Director, VP, C-Level) total 420 positions, representing the bottleneck between technical execution and organizational strategy.

Remote work availability sits at 16% of all AI roles (613 positions), with 3,187 requiring on-site or hybrid attendance. The remote share has stabilized after the post-pandemic correction. Senior and specialized roles (Research Scientist, ML Architect) are more likely to be remote-eligible than entry-level positions, partly because experienced hires have more negotiating power and partly because these roles require less hands-on mentorship.

AI compensation is structured in clear tiers. The market median sits at $200,000. Top-quartile roles start at $253,000, and the 90th percentile reaches $307,500. These figures include base salary with disclosed compensation. Total compensation (including equity, bonuses, and sign-on) runs 20-40% higher at companies that offer those components.

Category matters for compensation. AI Engineering Manager roles lead at $293,500 median, while Prompt Engineer roles sit at $142,800. The spread between highest and lowest-paying categories reflects the premium on specialized technical skills versus broader analytical roles.

The most in-demand skills across all AI postings: Python (1,968 postings), Aws (1,203 postings), Azure (882 postings), Rag (877 postings), Gcp (735 postings), Prompt Engineering (587 postings), Pytorch (586 postings), Claude (554 postings). Python dominates, appearing in the vast majority of role descriptions regardless of category. Cloud platform experience (AWS, GCP, Azure) is the second most common requirement. The newer entrants to the top skills list (RAG, vector databases, LLM APIs) reflect the shift from traditional ML toward generative AI applications.

Frequently Asked Questions

Based on 682 roles with disclosed compensation, the median salary for AI Software Engineer positions is $234,620. Actual compensation varies by seniority, location, and company stage.

About 16% of the 3,824 AI roles we track offer remote work. Remote availability varies by company and seniority level, with senior and leadership roles more likely to offer location flexibility.

NVIDIA is among the companies actively hiring for AI and ML talent. Check our company profiles for detailed breakdowns of open roles, salary ranges, and hiring trends.

Common next steps from AI Software Engineer positions include Staff Engineer, AI Architect, Engineering Manager. Progression depends on whether you lean toward technical depth, people management, or product strategy.

Get Weekly AI Career Intelligence

Salary data, skills demand, and market signals from 16,000+ AI job postings. Every Monday.

Skills & Technologies

About This Role

What you’ll be doing:

What we need to see:

Ways to stand out from the crowd:

Salary Context

Role Details

About This Role

What the Work Looks Like

Skills Required

Compensation Benchmarks

NVIDIA AI Hiring

Location Context

Career Path

What to Expect in Interviews

AI Hiring Overview

The AI Job Market Today

Frequently Asked Questions

Get Weekly AI Career Intelligence

Related AI Jobs