Data Engineer - Databricks / PySpark

Prophecy Technologies

Atlanta, US

On-site

Job Description

Job Summary:

We are seeking a Senior Data Engineer with strong hands-on experience in Databricks, PySpark, and Spark-based big data processing to design, develop, and maintain scalable data pipelines and data platforms. The role involves working with large-scale structured and unstructured datasets, building robust ETL pipelines, and supporting data science and analytics initiatives. The candidate will also mentor junior engineers and collaborate with cross-functional teams to deliver data-driven insights.

Key Responsibilities

Data Engineering & Pipeline Development

Design and develop scalable data pipelines using Databricks, PySpark, and Spark SQL.
Build pipelines to ingest, clean, transform, and aggregate data from multiple heterogeneous sources.
Implement and maintain Delta Lake, Delta Live Tables, and Databricks notebooks for efficient data processing.
Develop high-performance ETL/ELT workflows for large-scale datasets.

Big Data Processing

Work with Big Data technologies such as Hadoop, Spark, Kafka, Hive, HDFS, and cloud platforms.
Implement high-volume stream processing solutions using Apache Kafka and Spark Streaming.
Ensure efficient data storage and retrieval for analytics, machine learning, and reporting.

Data Quality & Governance

Define and implement data validation, quality checks, and normalization procedures.
Develop data policies, retention models, and anonymization frameworks.
Maintain governance standards for secure and reliable data access.

Data Modeling & Analytics Support

Work closely with Data Science and Business Intelligence teams to design data models.
Prepare datasets for analytics, BI reporting, machine learning, and advanced insights generation.
Support visualization platforms such as Tableau, Power BI, Spotfire, or OAC.

Collaboration & Leadership

Engage with business teams to gather requirements and design data solutions.
Lead and mentor junior data engineers and provide guidance on best practices.
Collaborate across projects to provide data engineering expertise and strategic insights.

Required Skills & Experience

7+ years of overall IT experience.
5+ years of experience in Data Engineering or ETL development.
Strong hands-on experience with Databricks and PySpark.
Expertise in Apache Spark, Spark SQL, and big data frameworks.
Experience with Delta Lake, Unity Catalog, and Databricks Notebooks.
Strong SQL skills with ability to write intermediate-to-advanced queries.
Experience building data ingestion pipelines and large-scale data architectures.
Familiarity with Agile methodologies (Scrum, Kanban, SAFe).

Preferred Skills

Programming in Python and/or Scala.
Experience with cloud platforms (AWS, Azure, or GCP).
Experience with Kafka, Amazon MSK, IBM MQ, or Tibco EMS messaging systems.
Experience with databases such as Databricks, Teradata, DB2, BigQuery, or Mainframe systems.
Knowledge of serverless cloud technologies (S3, Lambda, Glue, Kinesis).
Experience with Git-based version control systems.

Competencies

Strong expertise in Databricks and Spark ecosystem.
Ability to work with large-scale distributed data systems.
Excellent problem-solving and analytical skills.
Strong communication and leadership capabilities.

Skills & Requirements

Technical Skills

DatabricksPysparkSpark sqlDelta lakeDelta live tablesDatabricks notebooksHadoopSparkKafkaHiveHdfsCloud platformsApache kafkaSpark streamingData validationQuality checksNormalization proceduresData policiesRetention modelsAnonymization frameworksData modelsTableauPower biSpotfireOacPythonScalaAwsAzureGcpKafkaAmazon mskIbm mqTibco emsDatabasesDatabricksTeradataDb2BigqueryMainframe systemsServerless cloud technologiesS3LambdaGlueKinesisGit-based version control systemsCommunicationLeadershipProblem-solvingAnalytical skillsData engineeringBig data processingData qualityData governanceData modelingAnalytics supportCollaborationLeadership

Level

senior

Posted

4/10/2026

Apply Now

You will be redirected to Prophecy Technologies's application portal.