Senior AI Systems Development Engineer

Advanced Micro Devices
Markham, US
On-site

Job Description

Shape cutting-edge AI initiatives as a dedicated Senior Software Development Engineer. This role focuses on model execution, optimizing training infrastructure, and enhancing inference serving on advanced GPU architectures.

In this position, you will be responsible for managing the entire model execution stack on high-performance GPU systems. Key responsibilities include refining large-scale training processes for LLMs and addressing computational challenges with innovative solutions. Your expertise will directly influence the performance and efficiency of frontier models, making a significant impact in the AI domain.

Key Responsibilities :

  • Optimize large-scale model training on GPU clusters
  • Develop and maintain job orchestration and storage solutions
  • Resolve training-specific issues across GPU architectures
  • Write high-performance GPU kernels in relevant frameworks
  • Collaborate on next-gen GPU integration and design

Requirements :

  • Hands-on AI / ML infrastructure experience
  • Familiarity with frontier models and AMD hardware
  • Background in validation frameworks and proxies
  • Experience with large-scale distributed GPU systems
  • Advanced degree in related technical fields

Leverage your talents in GPU optimization and AI infrastructure to drive breakthrough solutions in advanced computing applications.

J-18808-Ljbffr

Skills & Requirements

Technical Skills

AiMlGpuPython

Employment Type

FULL TIME

Level

senior

Posted

4/11/2026

Apply Now

You will be redirected to Advanced Micro Devices's application portal.