Senior AI Engineer — Inference & Agent Systems - NY/SF

RS Global Services
San Francisco, US
HybridVisa Sponsorship

Job Description

What our client is looking for:

Our client is seeking a Senior AI Engineer with 3-7 years of experience who has a proven track record of building and shipping AI applications used daily by real, paying customers. You should be comfortable in a fast-paced startup environment and have a history of building production systems resilient to LLM non-determinism. Bonus points if you have experience in the finance domain.

What you'll do:

  • Drive inference optimization to get Time to First Token (TTFT) below 400ms for multi-step agent pipelines.
  • Take ownership of our evaluation framework end-to-end, building ground truth datasets and automated scoring pipelines to detect regressions.
  • Work closely with the CTO to design and implement Plan-Execute-Synthesize agent pipelines that run sub-agents in parallel.
  • Integrate the latest state-of-the-art models (like Claude, GPT, Gemini) into our system and configure them for our analytics engine.
  • Lead the development of reliable orchestration on top of Temporal, implementing retries, timeouts, and partial failure recovery.
  • Build and maintain observability infrastructure to trace every token, tool call, and synthesis step.

Seniority

  • 3 -​ 7 years of experience in AI/ML engineering shipping AI applications in production.​

Work experience

Work Experience:

  • Experience at an AI company that has seen scale with real,​ paying customers -​ Series A+ or demonstrated product market fit required
  • Experience integrating LLMs (Claude,​ GPT,​ Gemini) into production applications;​ depth in at least 1-​2 of the following:​ inference optimization,​ agent architecture,​ or evals.​
  • Finance or fintech experience (portfolio analytics,​ trading systems,​ or financial data applications,​ not HFT or pure quant engineering)

Education

  • CS or closely related technical degree

Hard skills

  • Proficiency in Python for AI/ML development.​
  • Experience with LLM orchestration frameworks and production AI infrastructure.​
  • Experience with Go (Golang) is a plus.​

Soft skills

  • Strong individual contributor (IC) focused on building.​

Miscellaneous

  • Located in or willing to relocate to NY or SF -​ open to candidates in other Tier 1 tech hubs (e.​g.​,​ Seattle)

Salary: $175K - $250K Equity: Competitive equity

Visa sponsorship available : Open to sponsoring visas on a case-by-case basis.

Hybrid work policy : This role is hybrid, based in our New York or San Francisco offices. NY/SF strongly preferred but willing to consider strong candidates for a remote position if based in Tier-1 cities (e.g., Seattle).

Skills & Requirements

Technical Skills

PythonGo (golang)Problem-solvingCommunicationAiMlFinance

Salary

$175,000 - $250,000

year

Employment Type

FULL TIME

Level

senior

Posted

4/27/2026

Apply Now

You will be redirected to RS Global Services's application portal.

Sign in and we'll score your resume against this role.