AI Specialist (AI Engineering)

Hyphen Connect

Washington, US

Remote

Job Description

We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.

Responsibilities

Compress and optimize large language and vision models for on-device inference.
Develop pipelines for model distillation and hardware-specific compilation.
Benchmark performance across various NPU/GPU architectures.

Qualifications

Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
Strong C++ and Python skills.

Skills & Requirements

Technical Skills

TensorrtOnnx runtimeC++PythonAiLarge language modelsVision modelsOn-device inference

Employment Type

FULL TIME

Level

Mid-Level

Posted

4/24/2026

Continue to LinkedIn

You will be redirected to the job posting on LinkedIn.