Data Engineer — Analytics Infrastructure (Foundational Hire)

Vast.ai
Los Angeles, US
On-site

Job Description

Vast.ai is dedicated to democratizing and decentralizing AI computing, aiming to optimize the world's computation. They are seeking a Data Engineer to build and manage their data platform, focusing on ingestion, modeling, governance, and analytics for various departments. This role involves designing schemas, implementing ETL processes, and ensuring data quality and accessibility across the company.

Responsibilities

  • Own the data pipeline: design, build, and operate batch/streaming ingestion from product, billing, CRM, support, and marketing/ad platforms into a central warehouse
  • Model the data: create clean, well‑documented staging and business marts (dimensional/star schemas) that map to the needs of Marketing, Sales, Accounting/Finance, and Operations
  • Enable: publish certified datasets with row‑/column‑level security, manage refresh SLAs, and make it easy for teams to self‑serve
  • Collaborate cross‑functionally: intake requirements, translate them into data contracts and models, and partner with Engineering on event/telemetry capture
  • Document & scale: maintain clear docs, lineage, and a pragmatic data catalog so others can discover and trust the data

Skills

  • 3+ years (typically 3–6) in a Data Engineering role building production ELT/ETL on a cloud platform (AWS strongly preferred)
  • Expert SQL and solid Python for data processing/automation
  • Proven experience designing data models (staging, marts, star schemas) and standing up a warehouse/lakehouse
  • Orchestration, scheduling, and operational ownership (SLAs, alerting, runbooks)
  • Experience enabling a BI layer (ideally QuickSight) with secure, governed datasets
  • Strong collaboration and communication; able to gather requirements from non‑technical stakeholders and translate to data contracts
  • Marketing/Sales/RevOps data (CRM, ads, attribution), Accounting/Finance integrations, or product telemetry/event pipelines
  • Stream processing (Kafka/Kinesis), CDC, or near‑real‑time ingestion
  • Data privacy/security best practices (e.g., CPRA), partitioning/performance tuning, and cost management on AWS

Benefits

  • Comprehensive health, dental, vision, and life insurance
  • 401(k) with company match
  • Meaningful early-stage equity
  • Onsite meals, snacks, and close collaboration with founders/tech leaders
  • Ambitious, fast-paced startup culture where initiative is rewarded

Company Overview

  • Global GPU rental platform for saving 5-6X on GPU compute using one simple interface. It was founded in 2016, and is headquartered in Los Angeles, California, USA, with a workforce of 11-50 employees. Its website is https://vast.ai.

Skills & Requirements

Technical Skills

AwsSqlPythonEtlKafkaKinesisCdcQuicksightData privacySecurityData engineeringData modelingData governanceAnalytics

Employment Type

FULL TIME

Level

mid

Posted

4/7/2026

Apply Now

You will be redirected to Vast.ai's application portal.