Individuals must be legally authorized to work in the United States without the need for immigration support or sponsorship from Milliman now or in the future.
Milliman’s Health Research Team (HRT) is seeking an enthusiastic and detail-oriented Senior Data Engineer to join our team, under the guidance of the Senior Manager, Health Data Analytics. In this role, you will design, maintain, and enhance data ETL pipelines, ensuring the smooth ingestion and transformation of data ultimately destined for Databricks. You will primarily work with Python, PySpark, and SQL, using AWS services like EC2 and S3. Additionally, you will process healthcare data through a variety of Milliman tools.
This role is based out of the Milliman office in Chicago, IL, but candidates hired into this role may work remotely anywhere in the US or in the office on a weekly basis with flexible work arrangements.Travel to Milliman’s Chicago office for threeconsecutivedays each quarter isrequired.
Whowe are:
TheHRTsupports Milliman products and practices in a variety of capacities.These include but are not limited to financial functions, client support, project management, data engineering, and data analytics services. As a shared service team, we pride ourselvesondelivering high quality,timelysupport to Milliman's clients, consultants, and leadership teams.
Job Responsibilities:
- Work closely with key stakeholders to ensure data pipelines meet business objectives and data transformations meet data analyst requirements
- Write, incorporate, and maintain Databricks’ notebook scripts within our standard data and tool processing jobs
- Ensure the integrity of the HRT’s data pipelines, Databricks lakehouse, APIs, and Milliman’s products by conducting in-depth validation reviews and by applying data governance and retention policies
- Lead comprehensive data pipeline reviews across HRT's data assets, ensuring accuracy, completeness, consistency, and timeliness of healthcare data pipelines and downstream products
- Design, develop, and execute systematic data validation frameworks to identify anomalies, discrepancies, and integrity issues within our data pipelines and product tools
- Investigate and document root causes of data pipeline failures, partnering with data engineering and analytics teams to implement corrective actions and preventive controls
- Establish and maintain data quality metrics, scorecards, and dashboards to monitor ongoing pipeline health and communicate findings to stakeholders at all levels
- Deploy workflows into production environments, ensuring robust operation, monitoring performance, and promptly addressing any pipeline issues, to maintain reliability and stability
- Collaborate with other HRT data analytics, engineering, and project management professionals to identify, propose, and implement process enhancements
- Assist in developing ad hoc data workflows, which support Health Research Board (HRB) approved initiatives and Milliman practice client projects
- Provide support to Milliman’s practices via our help desk system by answering a variety of questions, most oftenabout our data assetsand data use policies
- Collaborate with HRT finance, operations, and project management teammates through providing technicaland healthcare dataexpertise
- Communicateprogress, status, issues, decisions onaregular cadence
Minimumrequirements:
- A bachelor’s degree in Computer / Software Engineering, Computer Science, Information Technology, Electrical Engineering, Data Science, Actuarial Science, or similar fields
- 6+ years of relevant experience in data engineering or related roles, with advanced programming in SQL or Python languages
- Demonstrates a strong capability to consistently producehigh quality,accuratedeliverables on schedule and effectively communicate progress, outcomes, and recommended next actions
- Four or more years of experience in the following areas:
- Building data pipelines for enrollment, claims (medical or pharmacy), risk scores, or other common healthcare data sources
- Data modeling and architecture within Databricks or similar cloud-based analytics and AI platforms
- Utilizing distributed data processing frameworks, preferably Apache Spark (PySpark/Spark SQL)
- Using GitHub for version control, code review, and branching strategies
- Applying best data governance and retention practices
- Practicing intermediate or higher knowledge of other Microsoft Office applications, including Teams, Word, PowerPoint, SharePoint, and Outlook, as well as Adobe Acrobat
- Travel to Milliman’s Chicago office for threeconsecutivedays each quarter isrequired
Competencies and Behaviors that Support Success in this Role
- Ability to use REST API to perform standard database functions
- Experience migrating between AWS and Azure
- Competent with coding in the SAS Windowing Environment
- General business knowledge of at least two healthcare lines of business (e.g. individual, employer group, Medicare, Part D, Medica