AWS Data & Solutions Engineer

EXL
New York, US
On-site

Job Description

Key Responsibilities

  • Design, build and unit test applications on Spark framework in Python.
  • Build Python and PySpark-based applications using data from relational databases (e.g., Oracle), NoSQL databases (e.g., DynamoDB, MongoDB) and filesystems (e.g., S3, HDFS).
  • Build AWS Lambda functions on Python runtime leveraging awswrangler, pandas, json, requests.
  • Build PySpark-based data pipeline jobs on AWS Glue ETL or EMR Clusters.
  • Build Python-based event-driven integration with Kafka Topics, leveraging Confluent libraries.
  • Leverage Apache Iceberg to manage schema evolution and ACID-compliant CDC merges within the data lake.
  • Design and build API services using FastAPI, understand Swagger metadata files and implement OAuth2/JWT authentication for protected endpoints.
  • Build process orchestration pipelines using AWS Step Functions and EventBridge rules.
  • Optimize performance for data access by choosing appropriate native Hadoop file formats (Avro, Parquet, ORC) and compression codecs.
  • Deploy applications on Docker and Kubernetes containers.
  • Leverage Copilot/GPT for agentic coding of the above tech stack.
  • Optimize Spark performance in Hadoop via configurations around Spark Context, Spark SQL, DataFrame, and Pair RDDs.
  • Setup Glue crawlers to catalog OracleDB tables, MongoDB collections and S3 objects.
  • Monitor, troubleshoot and debug failures using AWS CloudWatch and Datadog.
  • Resolve complex data-driven scenarios and triage production issues.
  • Participate in code release and production deployment.
  • Create documentation for user adoption, deployments, runbooks, and support client users for enablement or issues.
  • Perform code reviews with the team and enable development of code for complex scenarios.
  • Participate in the agile development process, document and communicate issues and bugs relative to data standards in scrum meetings.
  • Work collaboratively with onsite and offshore teams.
  • Communicate opinions to multiple teams to drive the initiative with strong leadership.

Education & Experience

  • Bachelor’s Degree or equivalent in computer science or related field and minimum 10+ years of experience.
  • AWS certifications: Solution Architect, Data Engineer or Data Analytics Specialty.
  • Hands-on experience with Python and PySpark programming.
  • Hands-on experience with AWS S3, Glue ETL & Catalog, Lambda Functions, EventBridge, Step Functions, Athena.
  • Hands-on experience with Kafka integrations.
  • Hands-on experience with Python pandas, requests, boto3.
  • Hands-on experience in writing complex SQL queries.
  • Hands-on experience using REST APIs with FastAPI or Flask.
  • Hands-on experience building Agentic AI workflows.
  • Preferred expertise on Snowflake, AWS Redshift & DynamoDB.
  • Ability to use AWS services, predict application issues and design proactive resolutions.
  • Technical coordination skills to drive requirements and technical design.
  • Requires aptitude to help build skillset within organization.

Knowledge, Skills & Abilities

  • Data pipelines using Python and PySpark on AWS Glue, EMR and Lambda functions.
  • Develop and secure RESTful APIs (FastAPI) on Docker/EKS containers and implement OAuth2/JWT authentication for protected endpoints.
  • Hands-on experience with Apache Iceberg tables for CDC and latest snapshots.
  • Event-based pipelines for consuming/publishing to/from Apache Kafka/MSK.
  • Lead and communicate complex technical designs and leverage Copilot/GPT for agentic coding of the above tech stack.

EXL Overview: EXL (NASDAQ: EXLS) is a leading operations management and analytics company that designs and enables agile, customer-centric operating models. For more information, visit www.exlservice.com.

EEO/Minorities/Females/Vets/Disabilities: EXL is an equal opportunity employer and will provide reasonable accommodation to those individuals who are unable to be vaccinated consistent with federal, state, and local law.

Skills & Requirements

Technical Skills

PythonPysparkAws s3Aws glue etl & catalogAws lambda functionsAws eventbridgeAws step functionsAws athenaKafkaPython pandasRequestsBoto3SqlRest apisFastapiFlaskSnowflakeAws redshiftDynamodbApache icebergAws cloudwatchDatadogSwagger metadata filesOauth2/jwt authenticationDockerKubernetesCopilot/gptApache kafkaMskFastapiOauth2/jwt authenticationDocker/eks containersApache iceberg tablesApache kafka/mskCopilot/gptLeadershipCommunicationAws solution architectAws data engineerAws data analytics specialtyFinanceHealthcare

Employment Type

FULL TIME

Level

senior

Posted

4/14/2026

Apply Now

You will be redirected to EXL's application portal.