Zephyr is building an innovative AI platform to change the way we treat cancer, diabetes, and other chronic diseases. By aggregating massive data sets and harnessing advanced technologies and AI to increase our understanding of biology, Zephyr discovers insights that will transform how new therapies are developed and how we treat patients. We will use that knowledge to devise interventions that enable people to live longer and healthier lives. Working in close partnership with industry-leading institutions across academia, biopharma, and care delivery, Zephyr is advancing our understanding of how to characterize and treat chronic diseases. With an initial focus on cancer and diabetes, Zephyr is working to revolutionize drug development, reform clinical trials, and change healthcare to impact patient lives. Zephyr is based in Tysons Corner, VA, and currently operates as a remote-first organization.




We are hiring a Clinical Informatics Analyst to join our growing team. As a Clinical Informatics Analyst at Zephyr AI, you will be responsible for conducting complex analysis of clinical and biomedical data for integrity, trends and improvement opportunities which support effective and efficient development of machine learning models and achievement of strategic, clinical and operational goals. You will leverage clinical knowledge of processes, workflows and evidence-based practice to report information in a meaningful and understandable way. You will be involved in the design, development, testing, debugging, integrating, implementing, training and support of programs and applications. Joining Zephyr AI at the current stage will give you the opportunity to help build the company’s data management infrastructure and participate in making critical early stage contributions.


  • Interrogate data coming from a variety of sources to support quality improvement initiatives, compliance with regulations/standards and organizational goals

  • Conduct thorough data analysis including statistical analysis; evaluate findings in comparison to current and past trends, benchmark data from various RWE data providers and standardize information from various sources; develop and analyze data utilizing programs and tools to facilitate data-driven organizational decisions and achieve quality goals; interpret visual and summarized data findings.

  • Ensure data integrity by following data governance principles (e.g. data transformations, data quality, metadata management, data privacy); ensure that data is entered into respective databases, establish data definitions and perform data validation by verifying accuracy and completeness of data; collaborates with the organization’s software engineers, data scientists, and cancer biologists.

  • Test and debug coded programs to meet project specification requirements; detect and analyze syntax or logic errors in code; modify code to correct errors and increase operating efficiency or adapt to new requirements.

  • Convert user requests of modifications to data into functional specifications; develop expertise with datasets, data repositories and ETL (extract, transform, load) processes.

  • Critically profile data, troubleshoot and provide fixes for data pipeline inputs and outputs.

  • Collaborate with computational biologists, software engineers, and data scientists to develop and review machine learning models using common data science principles.

  • Effectively communicate technical clinical subjects to both technical and non technical team members.

  • Use AGILE product development to create, review and iterate on data transformation requirements, mapping and ETL to deliver best in class reusable data capabilities to integrate clinical data.

  • Use cloud technologies (AWS S3, Athena) to define transformations and manipulation  of standardized (SNOMED CT, LOINC, NDC, RxNorm, ICD-10 CM/PCS, HCPCS/CPT etc) and text representations of healthcare data  to ML ready data.


 Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions 


  • BS/MS in biological sciences, medically related field or computer/database related field with direct experience in oncology data.

  • Deep understanding of the delivery of oncology care in the real world to understand and interpret biomedical data derived from oncology EHRs and associated claims.

  • 5+ years of experience with the management of biomedical data with particular focus on EHR and claims data (Real World Data). Preference will be given to individuals with experience in multiple data types and those with clinical trial data experience as well. Must have experience with genetic data.

  • Broad knowledge of existing data standards in use in healthcare and biotech. Deep knowledge of oncology data. Familiarity with multi-omics data preferred.

  • Experience with medical code sets (HCPCS/CPT, ICD-10 CM/PCS, NDC, SNOMED CT, LOINC, RxNorm, etc).

  • Direct experience in applying a variety of tools and techniques to organize, analyze and visualize RWD using relevant technologies (especially Python).

  • Experience with modern scalable database and data lake technologies (SQL, Spark, etc.).

  • Experience with github preferred.

    Valid working visa or H-1B needed for all non-US personnel.

You will be a step ahead if you have:

  • Familiarity with cloud-based systems (AWS experience preferred).

  • Experience with HIPAA expert determination and  de-identification technology.

  • Experience working in a fast-paced environment.



This is a full-time, exempt, position, reporting to the Chief Data Officer. This position requires the ability to work cross functionally within the organization. 


We offer competitive compensation as well as a comprehensive benefits package including:


  • 100% Company Paid Medical/Dental/Vision Insurance 

  • Generous paid time off

  • Paid holidays

  • 401(k) program 

  • Voluntary life and disability plans

  • Employee assistance program (EAP)

  • Opportunities for advancement



We are an equal opportunity employer


Zephyr AI provides equal employment opportunities (EEO) to all applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. Zephyr AI complies with applicable state and local laws governing non-discrimination in employment in every location in which the company operates. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.



This position has been filled. Would you like to see our other open positions?