Staff AI/ML Infrastructure Engineer

VirtualVocations
Miami, US

Job Description

A company is looking for a Staff AI / ML Infrastructure Engineer to drive the design, performance, and reliability of their AI infrastructure platform.

Key Responsibilities

Design and maintain GPU and bare metal infrastructure in containerized and physical environments

Build scalable GPU clusters in partnership with networking and provisioning teams

Ensure reliable, high-performance provisioning of GPU infrastructure

Qualifications

5+ years experience working with bare metal infrastructure and hardware automation

Hands-on experience with modern NVIDIA / AMD GPU platforms and high-performance networking

Deep knowledge of BIOS, BMC, firmware, NICs, Redfish / IPMI, and PCIe systems

Strong Linux systems experience including device drivers and package management

Experience building infrastructure automation using Python and Bash

Skills & Requirements

Technical Skills

bare metal infrastructurehardware automationNVIDIA / AMD GPU platformshigh-performance networkingBIOSBMCfirmwareNICsRedfish / IPMIPCIe systemsLinux systemsdevice driverspackage managementPythonBash

Level

mid

Posted

4/14/2026

Apply Now

You will be redirected to VirtualVocations's application portal.