A company is looking for a Research Engineer - RL Infrastructure.
Key Responsibilities
Build and optimize systems infrastructure for large-scale RL and distributed training workloads
Improve training efficiency across compute, memory, networking, and scheduling layers
Design and implement performance optimizations and contribute to open-source libraries
Required Qualifications
Strong systems engineering experience in AI / ML infrastructure
Familiarity with PyTorch and distributed training frameworks
Experience optimizing training performance across various dimensions
Hands-on experience with large-scale training techniques
Understanding of GPU architecture and performance debugging
mid
3/26/2026
You will be redirected to VirtualVocations's application portal.