Staff Research Engineer: AI Model Efficiency & Speed (Hiring Immediately)

Cohere
San Quentin, US
On-site

Job Description

A leading AI research company in San Francisco is seeking a Staff Research Engineer to enhance the efficiency of large language models. In this role, you will develop and implement advanced techniques to optimize model performance in production. Ideal candidates will hold a PhD in Machine Learning and have experience with model architecture and inference optimization. Join a diverse team committed to innovation within a collaborative and remote-friendly work culture, complete with generous benefits and vacation time. #J-18808-Ljbffr

Skills & Requirements

Technical Skills

Machine learningModel architectureInference optimizationAi researchModel efficiencyModel performance

Level

Mid-Level

Posted

5/8/2026

Apply Now

You will be redirected to Cohere's application portal.

Sign in and we'll score your resume against this role.

Find Similar Jobs

Browse roles in the same category, level, and remote setup.

Sign in to open the target role workbench.