Research Engineer: AI Model Efficiency & Speed

Cohere

San Diego, US

Remote

Job Description

Position: Staff Research Engineer: AI Model Efficiency & Speed

A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold a PhD in Machine Learning and have experience with model architecture and inference optimization. Join a diverse team committed to innovation within a collaborative and remote-friendly work culture, complete with generous benefits and vacation time.

#J-18808-Ljbffr

Skills & Requirements

Technical Skills

Machine learningModel architectureInference optimizationAi researchMachine learning

Employment Type

FULL TIME

Level

senior

Posted

5/1/2026

Apply Now

You will be redirected to Cohere's application portal.