Job Title: Clinical Data Analyst
Job Description
The Clinical Data Analyst designs, builds, and maintains clinical data systems and pipelines that support high‑quality clinical trials and advanced analytics. This role develops study-specific documentation and edit checks, programs SAS-based data validations and reports, and creates robust ETL/ELT workflows for both structured and unstructured data. The Clinical Data Analyst collaborates closely with biostatistics, data science, and other stakeholders to ensure data quality, integrity, and security across platforms while enabling AI/ML model development and deployment.
Responsibilities
- Develop study-specific annotated case report forms (CRFs), database documentation, edit check specifications, data handling conventions, and data entry instructions to support clinical trials.
- Program SAS edit checks and SAS macros for clinical trials, including ETL processes, data validations, statistical analyses, and report generation.
- Design and develop robust ETL/ELT pipelines for structured and unstructured data to ensure efficient data ingestion, transformation, and loading.
- Apply hands-on database administration skills to manage structured and unstructured data sources, identify root causes of data issues, and resolve them promptly.
- Conduct data automation tasks, including software development, testing, and validation, to streamline clinical data workflows.
- Use tools such as Power BI, R, HTML, JavaScript, and REST APIs for data extraction, visualization, and reporting, where applicable.
- Collaborate with biostatisticians, data scientists, analysts, and other stakeholders to support AI/ML model development, validation, and deployment.
- Build and optimize data systems for performance, scalability, and reliability across multiple platforms and environments.
- Ensure data quality, integrity, and security throughout the data lifecycle, from capture and storage to processing and reporting.
- Apply knowledge of the software development life cycle (SDLC) to design, code, test, and maintain data-related applications and tools.
- Develop and maintain clear documentation for data architecture, data flows, processes, and workflows to support transparency and reproducibility.
- Establish and maintain REST API connections to integrate external data sources and support automated data downloading and integration.
- Perform data entry and data management tasks as needed to support clinical trial operations and reporting.
Essential Skills
- Proficiency in SAS programming, including SAS Base and related SAS components used for data management, analysis, and reporting.
- Strong experience developing and executing SAS edit checks and SAS macros for clinical trial data.
- Hands-on experience designing and implementing ETL/ELT pipelines for structured and unstructured data.
- Practical database administration skills, including managing data sources and troubleshooting data-related issues.
- Advanced SQL knowledge and experience working with relational databases, including writing complex queries and working with a variety of database systems.
- Ability to manipulate, process, and extract value from large datasets.
- Experience working with REST APIs, including establishing and managing Web REST API connections.
- Familiarity with the software development life cycle (SDLC) and experience in coding, testing, and design of data solutions.
- Strong analytical and problem-solving skills with attention to data quality, integrity, and security.
Additional Skills & Qualifications
- Minimum of a bachelor’s degree in Computer Science, Computer Engineering, Data Science, Statistics, Informatics, Information Systems, or a related quantitative field; an advanced degree is preferred.
- Experience with data visualization tools such as Power BI and R Shiny.
- Exposure to AI/ML concepts and experience supporting AI/ML model pipelines.
- Familiarity with R and other statistical or scripting languages used in data analysis.
- Experience using HTML and JavaScript for data presentation or integration is beneficial.
- Ability to work collaboratively with cross-functional teams including biostatistics, data scientists, and business stakeholders.
- Strong written and verbal communication skills, including the ability to document processes and present findings clearly.
Work Environment
This role operates in a technology-driven, data-focused environment that supports clinical research and advanced analytics. You will work extensively with SAS, SQL, relational databases, ETL/ELT tools, and data visualization platforms such as Power BI and R Shiny. The position involves regular collaboration with biostatisticians, data scientists, and analysts to design and maintain data pipelines and systems that enable AI/ML initiatives. Work is primarily computer-based in a professional office or remote setting, with a focus on accuracy, data security, and adherence to documented processes and the software developme