Data Acquisition Engineer

Abaka AI
Mountain View, US
On-site

Job Description

About Abaka AI

Abaka AI is built on one mission: to be the world’s most trusted data partner for AI companies. More than 1,000 industry leaders across Generative AI, Embodied AI, and Automotive AI rely on us to power their data pipelines. With our headquarters in Silicon Valley—and teams in Paris, Singapore, and Tokyo—we support global partners with fast, reliable, and scalable data solutions.

Our offerings include a diverse catalog of off-the-shelf datasets (image, video, multimodal, reasoning, 3D, and beyond) as well as comprehensive data collection and annotation services. Whether teams need raw data, curated datasets, or full-cycle data engineering, Abaka AI provides the foundation for building high-performance AI systems.

About The Role

As a Data Acquisition Engineer at Abaka AI, you will own and scale our raw data supply ecosystem by combining technical systems building with hands-on supplier sourcing and management. This is a 0→1 builder role focused on creating scalable, AI-native infrastructure for discovering, evaluating, onboarding, and managing data suppliers globally.

You will design and implement automation, internal tools, and AI-driven workflows to increase sourcing leverage—while also directly identifying, engaging, and managing external data partners. You will work closely with leadership to develop commercial instincts and supplier negotiation skills as you take full ownership of the data supply pipeline.

This is a high-impact role at the intersection of engineering, growth, and operations.

Responsibilities

  • Build automated pipelines and AI-driven workflows to discover and evaluate new raw data sources
  • Design and implement internal tooling for supplier tracking, scoring, and performance management
  • Experiment with scraping, APIs, enrichment tools, and automation platforms to increase sourcing efficiency
  • Aggressively identify and outreach to new data suppliers across global markets
  • Evaluate supplier quality, reliability, and scalability in partnership with internal teams
  • Manage ongoing vendor relationships, ensuring quality, cost, and delivery standards are met
  • Track supplier performance using quantitative metrics and continuously improve processes
  • Collaborate cross-functionally with Data Engineering, Research, Product, GTM, Legal, and Finance to align supply with business needs
  • Support commercial discussions and contract processes with guidance from leadership
  • Build scalable systems that increase data throughput without increasing headcount

Qualifications

  • Strong technical foundation (engineering, data, scripting, automation, or systems building)
  • Experience building projects, tools, or pipelines from 0→1
  • Comfortable using AI-native tools (e.g., LLM agents, Cursor, automation platforms, workflow builders)
  • High ownership mindset with the ability to operate independently in ambiguous environments
  • Strong written and verbal communication skills
  • Interest in AI, machine learning, and data infrastructure
  • Growth-oriented mindset with bias toward experimentation and rapid iteration
  • Experience in startup or high-growth environments preferred
  • Exposure to data pipelines, scraping, APIs, or automation workflows is a strong plus
  • Prior vendor management experience is not required

Compensation & Benefits

The base salary range for this position is $110,000 - $160,000 USD annually.

Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work at Abaka AI. This role is eligible for equity, as well as a comprehensive benefits package (health, dental, vision, PTO, flexible work schedule).

Skills & Requirements

Technical Skills

Data engineeringData pipelinesScrapingApisAutomationAi-native toolsCommunicationLeadershipStrategic thinkingAiData infrastructureData supply pipeline

Salary

$110,000 - $160,000

year

Employment Type

FULL TIME

Level

mid

Posted

5/6/2026

Continue to LinkedIn

You will be redirected to the job posting on LinkedIn.

Sign in and we'll score your resume against this role.

Find Similar Jobs

Browse roles in the same category, level, and remote setup.