Senior Machine Learning Systems Engineer (Ads Infrastructure)

Mintegral
San Francisco, US

Job Description

About Us

Mintegral is a leading programmatic and interactive mobile advertising platform, starting from the APAC region and radiating out globally. Powered by advanced AI technology, we provide global advertisers and developers with innovative, comprehensive experiences. With our efficient mobile marketing and monetization solutions, we help our clients exceed their marketing goals.

As Mobvista’s self-developed programmatic platform, since Launched in 2015, Mintegral has quickly grown to become one of the largest mobile advertising platform in Asia. We offer a full stack of programmatic products and services including our Self-service Platform, DSP, SSP, Ad Exchange and DMP. We have also created the Mindworks Creative Studio, which offers publishers and brands cutting-edge creative solutions, from traditional creative right through to the latest interactive ad formats. For more information, please visit our website: https://www.mintegral.com/en/

About the Role

We are seeking a Senior Machine Learning Systems Engineer to architect and scale next-generation advertising ranking and serving infrastructure.

You will build large-scale real-time ML inference systems powering ad ranking, retrieval, and prediction across global traffic, focusing on distributed systems, ML infrastructure, and high-performance computing.

Responsibilities

  • Design end-to-end ads ranking and serving architectures for real-time bidding and recommendation systems
  • Build decoupled and disaggregated inference pipelines across CPU and GPU layers
  • Optimize latency for high-QPS ad delivery systems
  • Develop and optimize ML deployment pipelines for heterogeneous CPU/GPU environments
  • Improve model freshness and inference performance
  • Enable rapid iteration of ML models in production
  • Design large-scale embedding storage and retrieval systems
  • Build adaptive sharding strategies across heterogeneous hardware
  • Improve load balancing and system stability
  • Optimize QPS, latency, and throughput
  • Identify bottlenecks in inference pipelines
  • Improve end-to-end ranking performance
  • Build AOT compilation frameworks for ML models
  • Translate models into optimized C++/CUDA/ROCm execution
  • Improve inference efficiency across hardware backends
  • Work with engineers, and product teams
  • Productionize ML models
  • Define scalability and reliability standards

Required Qualifications

  • 4+ years in distributed systems or ML infrastructure
  • Experience in ML serving or recommendation systems
  • Strong distributed systems and performance optimization background
  • Experience with CPU/GPU systems
  • C++ / Python proficiency
  • ML frameworks experience

Preferred Qualifications

  • Ads tech or recommendation systems experience
  • ML compiler or inference runtime experience
  • GPU optimization (CUDA/ROCm)
  • High-scale system experience
  • High-level ownership experience

Impact

  • Enable large-scale real-time ads ranking
  • Improve inference efficiency and cost
  • Accelerate ML deployment cycles
  • Build foundational ads infrastructure

Skills & Requirements

Technical Skills

Distributed systemsMl infrastructurePerformance optimizationCpu/gpu systemsC++PythonMl frameworksAds infrastructureMachine learning

Level

senior

Posted

4/21/2026

Continue to LinkedIn

You will be redirected to the job posting on LinkedIn.