Machine Learning Engineer/Senior Machine Learning Engineer - Devops

Genentech
Washington, US
Remote

Job Description

The Position

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come. Creating a world where we all have more time with the people we love. That’s what makes us Roche.

Advances in AI, data, and computational sciences are transforming drug discovery and development. Roche’s Research and Early Development organisations at Genentech (gRED) and Pharma (pRED) have demonstrated how these technologies accelerate R&D, leveraging data and novel computational models to drive impact. Seamless data sharing and access to models across gRED and pRED are essential to maximising these opportunities. The new Computational Sciences Center of Excellence (CoE) is a strategic, unified group whose goal is to harness this transformative power of data and Artificial Intelligence (AI) to assist our scientists in both pRED and gRED to deliver more innovative and transformative medicines for patients worldwide.

The Opportunity

At Roche's AI for Drug Discovery (AIDD) group (Prescient Design), we are revolutionizing drug discovery with cutting-edge machine learning techniques. We are seeking a highly motivated and skilled ML Infrastructure DevOps Engineer to join our growing team within Genentech Research and Early Development AI Drug Development (gRED AIDD). This role is crucial for building and maintaining the scalable and robust infrastructure that powers our machine learning initiatives. The ideal candidate will be proactive, user-facing, and possess a "get-it-done" attitude, while consistently adhering to corporate standards and best practices.

What you'll do

Machine Learning Engineer - DevOps

  • Design, implement, and maintain scalable and reliable ML infrastructure on AWS.
  • Automate deployment, monitoring, alerting, and operational tasks using tools like Terraform and Helm.
  • Manage and optimize CI/CD pipelines and Git repositories for ML projects, ensuring efficient version control to support collaboration and deployment.
  • Collaborate closely with ML engineers and data scientists to understand their infrastructure needs and provide solutions.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Implement and enforce security best practices for ML infrastructure.
  • Document infrastructure designs, processes, and operational procedures.
  • Contribute to initiatives independently as part of a team, delivering assigned outputs.
  • Proactively identify issues and gaps, proposing ideas and suggestions for improvements.

Senior Machine Learning Engineer - DevOps

  • Lead the architecture and delivery of significant technical solutions
  • Mentor junior engineers and drive technical alignment and influence across teams
  • Design, implement, and maintain scalable and reliable ML infrastructure on AWS.
  • Demonstrated ability to lead technical projects from conception to completion and deliver high-quality, scalable, and reliable software." to "Who you are
  • Automate deployment, monitoring, alerting, and operational tasks using tools like Terraform and Helm.
  • Manage and optimize CI/CD pipelines and Git repositories for ML projects, ensuring efficient version control to support collaboration and deployment.
  • Collaborate closely with ML engineers and data scientists to understand their infrastructure needs and provide solutions.
  • Troubleshoot and resolve infrastructure-related issues in a timely manner.
  • Implement and enforce security best practices for ML infrastructure.
  • Document infrastructure designs, processes, and operational procedures.
  • Contribute to initiatives independently as part of a team, delivering assigned outputs.
  • Proactively identify issues and gaps, proposing ideas and suggestions for improvements.

Who you are

Machine Learning Engineer - DevOps

  • BS/MS with 2-3 years of industry experience required
  • Proven experience in designing, deploying, and managing infrastructure on Amazon Web Services (AWS), including services such as EC2, S3, RDS, EKS, SageMaker, etc.
  • Strong proficiency with Git and Git repository management.
  • Hands-on experience with Terraform for infrastructure provisioning and management.
  • Experience with Helm for deploying and managing applications on Kubernetes.
  • Proficiency in scripting languages (e.g., Python, Bash) for automation.
  • Excellent problem-solving skills and a strong ability to debug complex issues.
  • Strong communication and interpersonal skills to effectively collaborate with cross-functional teams and user-facing interactions.
  • Demonstrated ability to take initiative, anticipate needs, and drive projects to completion.
  • Ability to thrive in a fast-paced environment and adapt to evolving requirements while adhering to corporate guidelines.
  • Ability to write clean code with little syntax/convention feedback.
  • Applies software engineering best practices (linting automation, unit testing, documentati

Skills & Requirements

Technical Skills

AwsTerraformHelmCi/cd pipelinesGit repositoriesMl infrastructureSecurity best practicesDocumentationSoftware engineering best practicesCollaborationInterpersonal skillsInitiativeAdaptabilityClean codeFinanceHealthcare

Employment Type

FULL TIME

Level

Mid-Level

Posted

4/18/2026

Apply Now

You will be redirected to Genentech's application portal.