Position: Senior Staff Machine Learning Engineer, Data & Eval
Senior Staff Machine Learning Engineer, Data & Eval
United States
AI and ML are at the heart of the Airbnb product. From Trust to Payments, and from Customer Service to Marketing, we rely on ML to ensure that guests and hosts have the best possible experience with Airbnb.
The Core ML team is responsible for driving CSxAI (Customer Support x Artificial Intelligence) initiatives by adopting Generative AI technologies to enable an intelligent, scalable, and exceptional service experience. The team develops and enhances AI models, ML services, and tools including LLM fine‑tuning and optimization, RAG/Search, LLM evaluation and testing automation, feedback‑based learning, and guardrails for a wide range of applications at Airbnb.
The Difference You Will Make:
In this Senior Staff role, you will set technical direction and lead execution for ML evaluation and the end‑to‑end data flywheel powering CSxAI products (e.g., assistive agents, issue resolution, and tooling). Your work will define how we measure quality, how we turn feedback into learning signals, and how we continuously improve models and products safely and efficiently. You will partner closely with product, engineering, design, operations to build evaluation systems that are trusted, scalable, and actionable—connecting offline metrics to online outcomes.
A Typical Day:
: instrumentation, feedback collection, data quality checks, labeling strategy, dataset versioning, and governance to support continuous improvement.
Minimum Qualifications:
:
PhD in Computer Science, Mathematics, Statistics, or related technical field (or equivalent practical experience).
: 10+ years building, testing, and shipping ML/AI systems end‑to‑end; including 2+ years of experience with GenAI/LLM systems in production.
: 5+ years leading large, ambiguous technical initiatives as a senior IC, influencing roadmap and engineering/science direction across teams.
:
Preferred Qualifications:
:
Experience applying ML/AI to customer support workflows (e.g., agent assist, classification/routing, resolution recommendation, QA).
:
Experience building robust evaluation platforms for agent behavior validation, safety/guardrails, and continuous improvement.
:
Proven ability to take evaluation and data flywheel work from incubation to production; iterating quickly while maintaining scientific rigor.
Your
Location:
This position is US - Remote Eligible. The role may include occasional work at an Airbnb office or attendance at offsites, as agreed to with your manager. While the position is Remote Eligible, you must live in a state where Airbnb, Inc. has a registered entity. for the up‑to‑date list of excluded states. This list is continuously evolving, so please check back with us if the state you live in is on the exclusion list.
If your position is employed by another Airbnb entity, your recruiter will inform you what states you are eligible to work from.
How We’ll Take Care of You:
Our job titles may span more than one career level. The actual base pay is dependent upon many factors, such as training, transferable skills, work experience, business needs and market demands. The base pay range is subject to change and may be modified in the future. This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.
#J-18808-Ljbffr
FULL TIME
senior
5/7/2026
You will be redirected to airbnb, Inc.'s application portal.
Sign in and we'll score your resume against this role.
Browse roles in the same category, level, and remote setup.
Sign in to open the target role workbench.