LLM Inference Systems Engineer (Zero-to-One, High-Performance)

Adaption Labs
New York, US
On-site

Job Description

A leading AI company located in New York is seeking to hire an experienced engineer for building and optimizing LLM inference systems. This role entails designing and implementing advanced inference techniques while collaborating closely with founders to optimize software–hardware co-design. The ideal candidate should possess strong programming skills in Python and have hands-on experience with inference frameworks. The company offers flexible work arrangements and a variety of benefits to support professional and personal growth. #J-18808-Ljbffr

Skills & Requirements

Technical Skills

Python

Employment Type

FULL TIME

Level

mid

Posted

4/9/2026

Apply Now

You will be redirected to Adaption Labs's application portal.