Machine Learning Engineer - Generative AI & Full-Stack

CVS Health

Boston, US

On-site

Job Description

Position: Staff Machine Learning Engineer - Generative AI & Full-Stack Applications

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time.

At CVS Health, our purpose is to deliver better health outcomes by meeting consumers where they are—through local care, digital experiences, and a nationwide team committed to quality, safety, and affordability.

Our Solutions Engineering and Infrastructure organization is building an enterprise AI/ML capability that delivers reliable, responsible, and secure AI‑powered platforms and solutions at Fortune 5 scale, and this role is foundational to help us develop that capability.

This is a senior individual-contributor role, focused on identifying evaluating and documenting high-value use cases, designing and prototyping AI-powered solutions, and evolving them into secure, resilient, enterprise‑ready products and platform components.

Key Responsibilities:

AI Solution Design & Prototyping

Partner with stakeholders to identify, evaluate, document, and shape GenAI use cases (copilots, automation, decision support, and insight generation) with clear success metrics.
Design solution architectures that integrate LLMs with enterprise systems, data sources, and tool/function calling while meeting latency and reliability expectations.
Develop prototypes rapidly and validate them through evaluation, red‑teaming, and user feedback; document tradeoffs and recommendations.

Production Engineering & Enterprise Readiness

Build production‑grade services and full‑stack experiences (APIs, UIs, workflows) with secure authentication/authorization, audit logging, and scalable deployment patterns.
Implement safety, privacy, and compliance controls (e.g., PHI/PII protection, prompt injection defenses, data residency constraints, and policy‑based filtering).
Instrument solutions end‑to‑end with metrics, traces, logs, and model/app observability; contribute to SLOs, error budgets, and operational runbooks.

Model Enablement & Evaluation

Build and maintain evaluation harnesses for LLM quality, safety, and business outcomes (offline tests, golden sets, regression suites, and online experiments).
Implement RAG pipelines (chunking, embedding, vector search, reranking) and optimize for accuracy, cost, and latency.
Collaborate with platform teams on deployment, monitoring, drift/quality detection, and incident response for model‑backed services.

Reusable Components & Engineering Excellence

Contribute reusable libraries and patterns for prompt management, retrieval, tool calling, and policy enforcement.
Participate in design reviews and code reviews; mentor senior and mid‑level engineers on GenAI engineering practices.
Continuously improve developer experience through templates, CI/CD automation, and documentation that accelerates safe adoption.

Required Qualifications:

7+ years of software engineering supporting Data or AI/ML initiatives, including building and operating production services.
3+ years applying ML/AI in production; demonstrated hands‑on GenAI delivery (LLMs, RAG, evaluation, and safety controls)
3+ years of experience delivering solutions in high‑scale, high‑availability environments with strong security and compliance requirements.

Preferred Qualifications

Strong full‑stack engineering skills (backend services, APIs, and modern web application development) with a focus on reliability and security.
Hands‑on expertise with LLM application patterns: RAG, tool/function calling, prompt management, evaluation, and guardrails.
Experience with Python and at least one additional backend language; familiarity with common ML libraries and serving frameworks.
Working knowledge of containerization and Kubernetes, CI/CD, infrastructure‑as‑code concepts, and production observability.
Ability to communicate clearly, influence across teams, and translate business needs into implementable technical plans.

Education:

Bachel…

Skills & Requirements

Technical Skills

PythonLlmsRagKubernetesCi/cdInfrastructure-as-codeProduction observabilityCommunicationInfluenceTeamworkHealthcare

Employment Type

FULL TIME

Level

senior

Posted

4/20/2026

Apply Now

You will be redirected to CVS Health's application portal.