A company is looking for a Staff AI / ML Infrastructure Engineer to drive the design, performance, and reliability of their AI infrastructure platform.
Key Responsibilities
Design and maintain GPU and bare metal infrastructure in containerized and physical environments
Build scalable GPU clusters in partnership with networking and provisioning teams
Ensure reliable, high-performance provisioning of GPU infrastructure
Qualifications
5+ years experience working with bare metal infrastructure and hardware automation
Hands-on experience with modern NVIDIA / AMD GPU platforms and high-performance networking
Deep knowledge of BIOS, BMC, firmware, NICs, Redfish / IPMI, and PCIe systems
Strong Linux systems experience including device drivers and package management
Experience building infrastructure automation using Python and Bash
mid
4/14/2026
You will be redirected to VirtualVocations's application portal.