Research Engineer

VirtualVocations
Santa Ana, US

Job Description

A company is looking for a Research Engineer - RL Infrastructure.

Key Responsibilities

Build and optimize systems infrastructure for large-scale RL and distributed training workloads

Improve training efficiency across compute, memory, networking, and scheduling layers

Design and implement performance optimizations and contribute to open-source libraries

Required Qualifications

Strong systems engineering experience in AI / ML infrastructure

Familiarity with PyTorch and distributed training frameworks

Experience optimizing training performance across various dimensions

Hands-on experience with large-scale training techniques

Understanding of GPU architecture and performance debugging

Skills & Requirements

Technical Skills

Systems EngineeringAI / ML InfrastructurePyTorchDistributed Training FrameworksGPU ArchitecturePerformance DebuggingProblem SolvingTeamworkCommunicationMachine LearningArtificial IntelligenceDistributed TrainingPerformance OptimizationOpen-Source Libraries

Level

mid

Posted

3/26/2026

Apply Now

You will be redirected to VirtualVocations's application portal.