We are looking for a Senior MLE Tech Lead to join a centralized evaluation organization and define the next generation of autograder quality across 20+ of Apple's most visible generative AI features. You will own the end-to-end technical vision for how we evaluate model outputs at scale - pioneering state-of-the-art methods, raising the technical bar, and leading a team of talented MLEs to build a robust autograder training and hillclimbing system from the ground up.\\n\\nThis is a high-impact, hands-on leadership role at the intersection of model evaluation, data quality, and ML systems engineering. You will work closely with model developers, data teams, and product partners to ensure our autograders are fast, accurate, and continuously improving - directly shaping the quality of AI experiences used by hundreds of millions of people.
In this role you will focus on:
Technical Leadership
People & Collaboration
Master's or PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related field.
5+ years of industry experience in machine learning, with a strong focus on LLM or VLM systems.
Deep expertise in prompt-tuning and fine-tuning techniques (SFT, RLHF, DPO, or equivalent), with proven experience of model calibration and uncertainty estimation.
Familiarity with data flywheel design - leveraging model outputs to continuously improve future training data.
Proficiency in Python and ML frameworks (PyTorch preferred).
Strong ML systems instincts - you care deeply about data quality, reproducibility, latency, and scale.
Background in human-in-the-loop annotation pipelines and inter-annotator agreement analysis.
Prior experience on an evaluation infrastructure or model quality team.
FULL TIME
senior
4/27/2026
You will be redirected to Apple's application portal.
Sign in and we'll score your resume against this role.