AI Platform Systems Engineer

NVIDIA
Toronto, CA; US
On-site

Job Description

Drive the development of a cutting-edge AI platform as a Systems Engineer. Focus on building efficient solutions for inference and training of large-scale models using advanced technologies.

This role involves collaborating on a unified AI platform, enhancing NVIDIA technologies including inference frameworks and ML compilers. You'll design solutions for scheduling AI workloads on GPU clusters and tackle industry-scale challenges like resource management and GPU scheduling. Your contributions will extend to collaboration with teams focusing on workload management and performance prediction technologies.

Key Responsibilities :

  • Develop AI platform for training and serving AI models
  • Design scheduling solutions for GPU cluster workloads
  • Explore solutions for resource management and GPU scheduling
  • Collaborate with adjacent teams on platform development
  • Contribute to live workload migration strategies

Requirements :

  • Bachelor’s degree in Computer Science or related field
  • 5+ years of relevant experience in large-scale systems
  • Strong coding skills in Python, Go, Rust, or C / C++
  • Experience with container-based deployments like Kubernetes
  • Solid understanding of algorithms, operating systems, and AI technologies

Utilize your systems engineering expertise in AI to innovate and optimize model performance on our advanced platform.

J-18808-Ljbffr

Skills & Requirements

Technical Skills

PythonGoRustC / c++KubernetesAlgorithmsOperating systemsAi technologiesCollaborationInnovationOptimizationAiPlatform development

Employment Type

FULL TIME

Level

senior

Posted

4/12/2026

Apply Now

You will be redirected to NVIDIA's application portal.