Senior ML Engineer I

Patientco
Atlanta, US
On-site

Job Description

ABOUT THIS POSITION

We are seeking a highly skilled and innovative Senior ML Engineer with a passion for building robust, efficient, and domain-specific AI systems using Language Models (LMs) and agentic architectures. As a core member of the team, you will be instrumental in developing the entire ML pipeline, from sophisticated data extraction techniques to fine-tuning specialized LMs and orchestrating their interactions within a multi-agent framework.

This is a unique opportunity to apply state-of-the-art Generative AI and NLP techniques to a real-world, high-impact problem, leveraging the latest research in agentic AI and LMs to deliver economical and powerful solutions.

WHAT YOU'LL DO

Data Pipeline & Knowledge Base Construction:

  • Design, implement, and optimize robust pipelines for ingesting, parsing, and extracting structured information from complex documents (leveraging OCR, document layout analysis, Named Entity Recognition (NER), and Relationship Extraction (RE).
  • Develop rich, nested JSON schemas for representing structured data and ensure scalable storage
  • Generate and manage high-quality vector embeddings for efficient retrieval-augmented generation (RAG) within a Vector Database.

Language Model (LM) Development & Fine-tuning:

  • Research, select, and experiment with appropriate open-source Language Models (Large & Small) (e.g., Phi-3, Mistral, Llama, Nemotron-H families) for specialized tasks.
  • Design and execute efficient fine-tuning strategies (e.g., LoRA, QLoRA, full fine-tuning) on curated, domain-specific datasets to achieve precise performance for tasks like coverage determination, code lookups, and policy rule application.
  • Explore and implement knowledge distillation techniques to transfer capabilities from larger models to smaller, more efficient LMs.

Agentic System Design & Implementation:

  • Build and maintain the core agentic framework, including the orchestrator that intelligently routes queries and coordinates interactions between various specialized LM tools.
  • Develop and integrate "tools" (specialized LMs and external APIs) that perform atomic medical necessity tasks, ensuring strict behavioral alignment and structured outputs.

MLOps & Deployment:

  • Deploy, manage, and monitor LMs and agentic components on Google Cloud Platform (GCP) using services like Vertex AI, GKE, Cloud Functions, and Cloud Run.
  • Implement robust MLOps practices for continuous integration, continuous delivery (CI/CD), model versioning, and performance monitoring (latency, throughput, accuracy).

Continuous Improvement & Research:

  • Establish effective feedback loops from end-user interactions and system logs to identify areas for model improvement.
  • Curate and expand training datasets, ensuring data privacy (PHI/PII masking) and legal compliance.
  • Stay abreast of the latest research in LMs, agentic AI, NLP, and document understanding, applying relevant advancements to our system.

Collaboration:

  • Work closely with subject matter experts, product managers, and other engineers to translate complex requirements into technical solutions and evaluate system performance.

WHAT YOU'LL NEED

  • Bachelor's or Master's degree in Computer Science, Machine Learning, Artificial Intelligence, or a related quantitative field.
  • 3+ years of professional experience in Machine Learning Engineering, with a strong focus on NLP.
  • Proven experience with Language Models (LMs), including model selection, fine-tuning, and deployment.
  • Strong proficiency in Python and familiarity with ML frameworks (e.g., PyTorch, TensorFlow, Hugging Face Transformers).
  • Solid understanding and hands-on experience with core NLP techniques and architectures, especially Transformers.
  • Experience with cloud platforms, particularly Google Cloud Platform (GCP), including services like Vertex AI, Cloud Storage, and compute services.
  • Familiarity with MLOps principles and tools for model serving, monitoring, and pipeline automation.
  • Excellent problem-solving skills, attention to detail, and ability to work independently and collaboratively.
  • Active use of artificial intelligence (AI) tools and techniques to enhance performance, drive innovation, and improve decision-making across business functions.
  • Ability to leverage AI tools and platforms to streamline workflows, improve decision-making, and drive innovation.
  • Curiosity and adaptability in exploring emerging AI technologies, with a mindset for continuous learning and experimentation.

What Will Make You Stand Out (Preferred Qualifications):

  • Hands-on experience building or contributing to agentic AI systems or multi-agent frameworks.
  • Direct experience with document processing technologies such as OCR, layout parsing, Document AI, or custom information extraction from unstructured text.
  • Experience with Vector Databases (e.g., pgvector, Pinecone, Weaviate, Qdrant) and RAG architectures.
  • Exposure to the healthcare domain, particularly understanding medical

Skills & Requirements

Technical Skills

Language models (lms)NlpAgentic aiOcrDocument layout analysisNerReVector databasesRagLoraQloraMlopsGoogle cloud platform (gcp)Vertex aiGkeCloud functionsCloud runData privacyPhi/pii maskingLegal complianceHealthcare

Employment Type

FULL TIME

Level

senior

Posted

4/24/2026

Apply Now

You will be redirected to Patientco's application portal.