Senior Deep Learning Tools Engineer – CUDA Tile

NVIDIA
Seattle; Washington, US
On-siteCareer-pivot friendly

Why this role

Pace
Fast Paced
Collaboration
High
Autonomy
Medium
Decision Impact
Team
Role Level
Individual Contributor
Career Pivot Friendly
Welcomes transferable skills

What success looks like

  • design performance testing frameworks
  • build automated pipelines
  • implement benchmarking systems
  • analyze performance trends
  • develop tools and dashboards
Typical background
5+ years of software engineering experiencestrong programming skills in Python

Transferable backgrounds

Skills & requirements

Required

performance testing frameworksautomated pipelinesbenchmarking systemsperformance analysisdebuggingtools developmentdashboardsscalable testinginfrastructure improvement

Preferred

GPU performance analysiscompiler internalsperformance dashboardshardware/software co-designdistributed testing infrastructure

Stack & domain

Performance testing frameworksCi/cd systemsAutomation frameworksHardware-aware performance analysisDeep learning frameworksData analysisProfilingRegression trackingGpu performance analysisCompiler internalsPerformance dashboardsLarge-scale telemetry systemsHardware/software co-designLow-level performance tuningDistributed testing infrastructureLarge-scale benchmarking systemsCollaborationDebuggingProblem-solvingInnovationDeep learningAiHpcPerformance engineeringBenchmarkingSystems optimization

About the role

NVIDIA is building advanced compiler technologies to accelerate AI workloads, and we are looking for an engineer focused on performance validation, analysis, and tracking. In this role, you will work at the intersection of deep learning compilers, GPU systems, and automation infrastructure, ensuring that performance improvements are measurable, scalable, and continuously validated over time.

Do you want to help drive the performance of next-generation compilers? Are you excited by how GPU performance powers breakthroughs in deep learning, autonomous systems, and high-performance computing?…

Similar roles