Member of Technical Staff, Research Engineer

San Francisco, CA, US Senior Research Engineer

Interested in this Research Engineer role at Plato?

Apply Now →

About This Role

AI job market dashboard showing open roles by category

Introduction

================

Plato is an applied research lab building the foundational infrastructure to train specialized AI agents.

We turn real\-world data streams into high\-fidelity simulated environments that generate the training signal needed to make capable models. Our work supports frontier labs, hyperscalers, and enterprises building AI systems for complex, high\-stakes work.

Today, only a handful of players can train models for capable work. Compute and algorithms are rapidly commoditizing, but reinforcement learning data remains the bottleneck. Plato is changing that by automatically scaling training environments from proprietary real\-world data.

Why This Role Matters

=========================

Research engineering is central to Plato's product and research loop.

The hard part of training specialized agents is not producing tasks that look plausible. It is finding tasks that are grounded in real workflows, difficult for current models, resistant to reward hacking, and useful as training signal. To do that, we need research engineers who can turn messy traces, model failures, and researcher hypotheses into environments, verifiers, rewards, evaluations, and curricula that improve continuously.

As a Member of Technical Staff, Research Engineer, you will own the loop that discovers high\-signal training targets for frontier models.

Role Description

====================

You will design experiments, build task generation systems, run evaluations, inspect model failures, and develop methods for mining tasks that are just out of reach of today's agents.

The work is empirical, systems\-heavy, and close to the frontier. You will consume real\-world trajectories or researcher hypotheses, materialize realistic data, propose candidate tasks, benchmark those tasks against frontier computer\-use and agent models, and hill\-climb until you find the failures that produce useful learning signal.

This is not a role for someone who only wants to run experiments or only wants to write research code. You will own the full loop: hypothesis, implementation, evaluation, analysis, iteration, and productionalization.

You Will Work On

====================

  • Discover model failure modes from real\-world traces, agent telemetry, targeted researcher hypotheses, and customer workflows.
  • Generate realistic curricula grounded in actual workflows rather than toy synthetic benchmarks.
  • Benchmark candidate tasks against frontier CUA and agent models using pass rates, rollouts, and behavioral traces as difficulty signals.
  • Build hill\-climbing loops that mutate, filter, and rescore tasks until they surface high\-signal targets.
  • Study reward hackability, distribution mismatch, task realism, long\-horizon failures, and transfer from simulation to deployed agents.
  • Turn research prototypes into reliable internal systems for continuous curriculum generation.

What We're Looking For

==========================

We're looking for someone who is excited to work at the intersection of empirical AI research, systems engineering, and model evaluation.

You may be a strong fit if you:

  • Have strong implementation ability and can turn ambiguous research ideas into working systems.
  • Have experience with RL, LLM agents, computer\-use agents, evals, post\-training, synthetic data, simulation, or model behavior analysis.
  • Care deeply about whether a task is grounded, difficult, reward\-hack\-resistant, and capable of producing actual learning signal.
  • Are comfortable interpreting ambiguous model behavior and negative results.
  • Enjoy building continuous research loops rather than static benchmark artifacts.

How We Work

===============

Being an engineer at an early\-stage AI startup is not easy. These are the values we care about.

Ownership

-------------

We value teammates who bring novel ideas to the table, experiment, and see results through end to end. You'll have access to massive compute budgets to test large scale experiments.

Move Fast, Build Durable

----------------------------

Demand is growing faster than our team. We move quickly, prioritize ruthlessly, and ship systems that keep working under load.

Reality Over Narratives

---------------------------

Training data is incredibly fragile and prone to reward\-hacking. We prioritize digging deep through data, manually if we have to, to garner deep intuition on retaining high quality throughput.

Stay Close to the Frontier

------------------------------

New AI capabilities rapidly change how we think about problems and what doors open. We stay close to the frontier of model capability, and encourage teammates to constantly share new findings and update their world model of what's possible.

Get In Touch

================

Send us an email at [email protected] if you're interested in building the simulation infrastructure of the future!

Role Details

Company Plato
Title Member of Technical Staff, Research Engineer
Location San Francisco, CA, US
Experience Senior
Salary Not disclosed
Remote No

About This Role

Research Engineers bridge the gap between research and production. They implement papers, build experiment infrastructure, optimize training pipelines, and make research prototypes production-ready. They're the engineers who make research work at scale.

The role sits at a unique intersection. You need to understand the math well enough to implement novel architectures correctly, and you need the engineering chops to make them run efficiently on distributed systems. When a research scientist has a breakthrough idea, you're the person who turns it from a notebook prototype into a training pipeline that runs on 256 GPUs.

Across the 3,824 AI roles we're tracking, Research Engineer positions make up 2% of the market. At Plato, this role fits into their broader AI and engineering organization.

Research Engineer roles are growing as AI labs recognize that research velocity depends on engineering quality. The role is less competitive than Research Scientist (no PhD required), but the bar for engineering skill is very high. These roles are concentrated at major labs and well-funded startups.

What the Work Looks Like

A typical week involves: implementing a new attention mechanism from a recent paper, profiling and optimizing a training pipeline that's bottlenecked on data loading, building evaluation infrastructure for a new benchmark, debugging distributed training issues across a GPU cluster, and pair-programming with a research scientist on their latest experiment. The work is deeply technical.

Research Engineer roles are growing as AI labs recognize that research velocity depends on engineering quality. The role is less competitive than Research Scientist (no PhD required), but the bar for engineering skill is very high. These roles are concentrated at major labs and well-funded startups.

Skills in Demand for This Role

Python (51% of roles) Aws (31% of roles) Azure (23% of roles) Rag (23% of roles) Gcp (19% of roles) Prompt Engineering (15% of roles) Pytorch (15% of roles) Claude (14% of roles)

Strong software engineering fundamentals plus ML knowledge. Python, C++, and CUDA experience are common requirements. You'll need to read papers and turn ideas into working code. Distributed systems experience (especially distributed training) is highly valued. Performance optimization skills separate great candidates from good ones.

Experience with large-scale training infrastructure (FSDP, DeepSpeed, Megatron), GPU programming (CUDA, Triton), and the internals of ML frameworks (PyTorch internals, custom autograd functions) is what makes candidates stand out. The best research engineers can debug issues that span the full stack from GPU memory management to numerical precision to algorithmic correctness.

Strong postings mention the team's recent research, the infrastructure scale, and the specific technical challenges. They often list the research areas you'd support. Look for roles that emphasize both implementation quality and research understanding.

Compensation Benchmarks

Research Engineer roles pay a median of $260,000 based on 401 positions with disclosed compensation. Senior-level AI roles across all categories have a median of $227,400.

Across all AI roles, the market median is $200,000. Top-quartile compensation starts at $253,000. The 90th percentile reaches $307,500. For comparison, the highest-paying categories include AI Engineering Manager ($293,500) and AI Safety ($274,200). By seniority level: Entry: $97,380; Mid: $160,000; Senior: $227,400; Director: $243,000; VP: $250,000.

Plato AI Hiring

Plato has 1 open AI role right now. They're hiring across Research Engineer. Based in San Francisco, CA, US.

Location Context

AI roles in San Francisco pay a median of $253,000 across 1,990 tracked positions. That's 26% above the national median.

Career Path

Common paths into Research Engineer roles include Software Engineer, ML Engineer, Research Intern.

From here, career progression typically leads toward Senior Research Engineer, Research Scientist, ML Architect.

This is one of the best entry points into AI research without a PhD. Build a strong engineering portfolio with ML projects, contribute to open-source ML frameworks, and demonstrate that you can implement complex ideas correctly and efficiently. The transition to Research Scientist is possible with published first-author work, which some research engineer roles support.

What to Expect in Interviews

Technical screens test both engineering skill and research understanding. Expect coding rounds with performance-critical implementations (GPU optimization, efficient data loading). Be prepared to discuss papers relevant to the team's research area and explain how you'd implement key ideas. System design questions focus on training infrastructure: distributed training, experiment tracking, and compute resource management.

When evaluating opportunities: Strong postings mention the team's recent research, the infrastructure scale, and the specific technical challenges. They often list the research areas you'd support. Look for roles that emphasize both implementation quality and research understanding.

AI Hiring Overview

The AI job market has 3,824 open positions tracked in our dataset. By seniority: 119 entry-level, 1,813 mid-level, 1,472 senior, and 420 leadership roles (Director, VP, C-Level). Remote roles make up 16% of the market (613 positions). The remaining 3,187 roles require on-site or hybrid attendance.

The market median for AI roles is $200,000. Top-quartile compensation starts at $253,000. The 90th percentile reaches $307,500. Highest-paying categories: AI Engineering Manager ($293,500 median, 31 roles); AI Safety ($274,200 median, 51 roles); Research Engineer ($260,000 median, 401 roles).

Research Engineer roles are growing as AI labs recognize that research velocity depends on engineering quality. The role is less competitive than Research Scientist (no PhD required), but the bar for engineering skill is very high. These roles are concentrated at major labs and well-funded startups.

The AI Job Market Today

The AI job market spans 3,824 open positions across 15 role categories. The largest categories by volume: AI/ML Engineer (2,702), Data Scientist (281), AI Software Engineer (258). These three account for the majority of open positions, though smaller categories often have higher per-role compensation because of specialized skill requirements.

The seniority mix tells a story about where AI teams are in their maturity. Entry-level roles (119) are outnumbered by mid-level (1,813) and senior (1,472) positions, reflecting that most companies are past the 'build a team from scratch' phase and need experienced engineers who can ship production systems. Leadership roles (Director, VP, C-Level) total 420 positions, representing the bottleneck between technical execution and organizational strategy.

Remote work availability sits at 16% of all AI roles (613 positions), with 3,187 requiring on-site or hybrid attendance. The remote share has stabilized after the post-pandemic correction. Senior and specialized roles (Research Scientist, ML Architect) are more likely to be remote-eligible than entry-level positions, partly because experienced hires have more negotiating power and partly because these roles require less hands-on mentorship.

AI compensation is structured in clear tiers. The market median sits at $200,000. Top-quartile roles start at $253,000, and the 90th percentile reaches $307,500. These figures include base salary with disclosed compensation. Total compensation (including equity, bonuses, and sign-on) runs 20-40% higher at companies that offer those components.

Category matters for compensation. AI Engineering Manager roles lead at $293,500 median, while Prompt Engineer roles sit at $142,800. The spread between highest and lowest-paying categories reflects the premium on specialized technical skills versus broader analytical roles.

The most in-demand skills across all AI postings: Python (1,968 postings), Aws (1,203 postings), Azure (882 postings), Rag (877 postings), Gcp (735 postings), Prompt Engineering (587 postings), Pytorch (586 postings), Claude (554 postings). Python dominates, appearing in the vast majority of role descriptions regardless of category. Cloud platform experience (AWS, GCP, Azure) is the second most common requirement. The newer entrants to the top skills list (RAG, vector databases, LLM APIs) reflect the shift from traditional ML toward generative AI applications.

Frequently Asked Questions

Based on 401 roles with disclosed compensation, the median salary for Research Engineer positions is $260,000. Actual compensation varies by seniority, location, and company stage.
Strong software engineering fundamentals plus ML knowledge. Python, C++, and CUDA experience are common requirements. You'll need to read papers and turn ideas into working code. Distributed systems experience (especially distributed training) is highly valued. Performance optimization skills separate great candidates from good ones.
About 16% of the 3,824 AI roles we track offer remote work. Remote availability varies by company and seniority level, with senior and leadership roles more likely to offer location flexibility.
Plato is among the companies actively hiring for AI and ML talent. Check our company profiles for detailed breakdowns of open roles, salary ranges, and hiring trends.
Common next steps from Research Engineer positions include Senior Research Engineer, Research Scientist, ML Architect. Progression depends on whether you lean toward technical depth, people management, or product strategy.

Get Weekly AI Career Intelligence

Salary data, skills demand, and market signals from 16,000+ AI job postings. Every Monday.