AI Evaluation & Testing Jobs

Find AI jobs focused on model evaluation, benchmarking, and quality assurance. ML testing and AI safety positions.

20
Open Positions
$257K
Avg. Salary
3
Remote Roles

Data updated weekly. Last refreshed 2026-03-08.

Prompt Engineer
Senior Software Engineer - Retrieval-Augmented Generation (RAG) System
RELX Group
$86K - $156K Philadelphia, PA, US
View Role →
Prompt Engineer
Senior Software Engineer - Retrieval-Augmented Generation (RAG) System
Elsevier
$86K - $156K Philadelphia, PA, US
View Role →
AI/ML Engineer
Member of Technical Staff - Enterprise Model Evaluation
Xai
$180K - $440K Palo Alto, CA, US
View Role →
AI/ML Engineer
2026 Summer Intern - DDC - Applied AI Engineer - Agents & Evaluation
Genentech
San Francisco, CA, US
View Role →
AI/ML Engineer
2026 Summer Intern - DDC - Applied AI Engineer - Agents & Evaluation
Genentech
San Francisco, CA, US
View Role →
AI/ML Engineer
Machine Learning Engineer - Model Evaluations, Public Sector
Scale AI
$208K - $300K New York, NY, US
View Role →
Research Engineer
Research Engineer - Safety System, Evaluation and Foundation
Meta
Menlo Park, CA, US
View Role →
AI/ML Engineer
Applied Science Manager - Web Search/Retrieval/Ranking, AGI Info - Web Information Systems
Amazon.com
$165K - $286K Sunnyvale, CA, US
View Role →
AI/ML Engineer
Applied Science Manager - Web Search/Retrieval/Ranking, AGI Info - Web Information Systems
Amazon.com
$165K - $286K El Segundo, CA, US
View Role →
AI/ML Engineer
ML Engineer, Foundation Model Evaluation
Waymo
$170K - $216K Remote
View Role →
Research Scientist
AIML - Sr Applied AI Scientist - GenAI Model Autograding, Evaluation
Apple
Cupertino, CA, US
View Role →
Research Scientist
AIML - Sr Applied AI Scientist - GenAI Model Autograding, Evaluation
Apple
Cupertino, CA, US
View Role →
Data Scientist
Machine Learning Evaluation Engineer
Bedrock Robotics
San Francisco, CA, US
View Role →
RAG Engineer
Senior Product Manager, Retrieval Engine Platform
Google
$183K - $271K San Francisco, CA, US
View Role →
RAG Engineer
Senior Product Manager, Retrieval Engine Platform
Google
$183K - $271K Sunnyvale, CA, US
View Role →
LLM Engineer
Head of Evaluation and Oversight Research
Scale AI
$252K - $315K New York, NY, US
View Role →
AI/ML Engineer
Applied ML Engineer - AI/ML Evaluation & Simulation
Apple
Seattle, WA, US
View Role →
AI/ML Engineer
Senior Test & Evaluation Engineer - AIS
Anduril
$146K - $183K Remote
View Role →
AI/ML Engineer
Senior Test & Evaluation Engineer - AIS
Anduril
$146K - $183K Remote
View Role →
AI/ML Engineer
Applied Science Manager - Web Search/Retrieval/Ranking, AGI Info - Web Information Systems
Amazon.com
$165K - $286K El Segundo, CA, US
View Role →

Frequently Asked Questions

AI Pulse currently tracks 20 AI job openings that require AI Evaluation & Testing skills. 3 of these are remote positions.
AI roles requiring AI Evaluation & Testing pay an average of $257K based on disclosed compensation. Specialized skills like AI Evaluation & Testing combined with production experience typically command 10-20% premiums over general AI roles.

Get Weekly AI Career Intelligence

Salary data, skills demand, and market signals from 16,000+ AI job postings. Every Monday.