Data Platform Engineer

Trulioo
San Diego, US
Hybrid

Job Description

Are you ready to embark on a career that truly affects people around the world? Trulioo invites you to be a catalyst for change in the dynamic realm of digital identity verification. As the global front-runner in our industry, we are redefining how businesses grow, innovate and comply online.

Picture yourself at the forefront of innovation, contributing to our award-winning platform that enables organizations worldwide to quickly onboard customers, optimize costs and combat fraud. Fueled by Silicon Valley support, Trulioo stands as the trusted platform that can verify more than 5 billion people and 700 million business entities spanning 195 countries.

But Trulioo is more than a tech company. We are a united force of dedicated experts committed to establishing trust online - and we’re proud to be recognized as a BC Top Employer for the second consecutive year, reflecting our commitment to an inclusive, collaborative, and people-first workplace.

Headquartered in Vancouver and with strategic hubs in San Diego and Dublin, we foster a culture of collaboration and open communication. Our offices support a hybrid model and staff typically work three days per week at a hub location. Join us where excitement meets innovation and contribute to a world where trust and technology unite.

What We Offer

Comprehensive Benefits: We provide a robust benefits package for full-time, permanent employees, including health, dental, and vision coverage, retirement plans with company match, paid time off, parental leave, and an annual education & training stipend (equivalent to $1,000 in local currency). Specific benefits may vary by location and will be discussed further during the interview process.

Flexible Hybrid Working Environment: Our offices are designed to support both collaboration and flexibility. Enjoy weekly lunches, quality coffee, and regular social events. Many locations also feature parent rooms, on-site gyms, comfortable lounges, and adaptable workstations to support your comfort and productivity.

Wellness: We care about your well-being. Team members have access to wellness workshops and events, as well as a complimentary Headspace subscription to help you stay focused, grounded, and energized.

Employee Resource Groups: Belonging is an important part of doing your best work. Our ERGs provide an inclusive space, support and community for employees of diverse backgrounds and allies. We host informative, fun sessions and celebrations that are often open to the entire organization.

Position Summary

We are seeking a skilled Data Platform Engineer with a proven track record of innovation to design, build, and maintain robust data systems that power person and business search and verification services. The ideal candidate has experience integrating diverse data sources, developing scalable ETL pipelines, and applying ML techniques to improve data quality, entity resolution, and semantic search.

This role spans multiple data systems — relational, NoSQL, vector databases, search engines, and optionally graph databases — offering opportunities to tackle complex data challenges at scale.

This will be a full-time, permanent position, working out of the San Diego office on a hybrid model (3 days per week in the office).

What You’ll Be Doing:

Build, optimize, and maintain data ingestion and transformation pipelines from multiple sources (internal systems, vendor data, web data, APIs).

Design and implement data models using the most suitable tool for the task — SQL, NoSQL, GraphDBs, or VectorDBs.

Integrate machine learning models into pipelines for entity resolution, de-duplication, semantic enrichment, and embedding generation.

Work with Vector Databases (e.g., AWS S3 Vector, PostgresVectorDb, OpenSearch) to support similarity and semantic search applications.

Collaborate with data scientists, software engineers, and analysts to deliver reliable, high-performance data infrastructure.

Ensure data quality, consistency, and performance monitoring across all pipelines and systems.

What You’ll Bring:

5+ years of professional software development or data engineering experience.

Strong programming skills in Python.

Experience with data modeling and schema design in SQL and NoSQL systems.

Experience designing and maintaining data pipelines (Airflow, Dagster, Prefect, or similar).

Proficiency with cloud-based data services (AWS, GCP, Azure).

Proficiency in multiple programming languages.

Experience with entity resolution or record linkage algorithms.

Experience incorporating ML workflows into ETL pipelines.

Hands-on experience with Vector Databases and embedding-based search pipelines.

Familiarity with graph databases (Neo4j, Neptune, or Gremlin) for ETL, modeling, and querying.

Experience with OpenSearch / Elasticsearch, including index creation, tuning, and advanced queries.

Familiarity with streaming data systems (Kafka, Kinesis) or distributed processing fra

Skills & Requirements

Technical Skills

Etl pipelinesData modelsEntity resolutionRecord linkageMl techniquesVector databasesEmbedding-based search pipelinesGraph databasesOpensearchElasticsearchStreaming data systemsKafkaKinesisData engineeringData warehousing

Salary

$1,000+

week

Employment Type

FULL TIME

Level

mid

Posted

4/25/2026

Continue to Indeed

You will be redirected to the job posting on Indeed.