ABOUT US 


Zephyr is building an innovative AI platform to change the way we treat cancer, diabetes, and other chronic diseases. By aggregating massive data sets and harnessing advanced technologies and AI to increase our understanding of biology, Zephyr discovers insights that will transform how new therapies are developed and how we treat patients. We will use that knowledge to devise interventions that enable people to live longer and healthier lives. Working in close partnership with industry-leading institutions across academia, biopharma, and care delivery, Zephyr is advancing our understanding of how to characterize and treat chronic diseases. With an initial focus on cancer and diabetes, Zephyr is working to revolutionize drug development, reform clinical trials, and change healthcare to impact patient lives. Zephyr is based in Tysons Corner, VA, and currently operates as a remote-first organization.


WE ARE HIRING A 

SENIOR INFRASTRUCTURE & SECURITY ENGINEER

We are looking for a talented and driven Sr. Infrastructure & Security Engineer to join our team of innovators pushing the boundaries of precision medicine. As a member of our growing multidisciplinary team, you’ll help shape, build, and grow the foundations of our AI platform. We are excited for the perspectives you bring and looking forward to your approach to our goals. You will be responsible for contributing to, operating, and improving all aspects of our infrastructure and the developer experience with the infrastructure. You are able to maintain a holistic view of the development, testing, and production environments, while collaborating with all members of the technical team to support, maintain, and deploy the best products possible. The ideal candidate will be very experienced with Amazon Web Services, infrastructure-as-code tools like CloudFormation or Terraform, and have a strong desire to foster and grow an empathic and cooperative engineering culture.

ABOUT THE TEAM

Behind the scenes, the team is working to improve the durability and efficiency of our products by improving the tooling, deployment, and shared compute layer, while optimizing for cost, scale, and fault-tolerance on AWS. As Zephyr AI grows, these will be key initiatives enabling Zephyr AI developers the ability to deliver operationally excellent and highly available products to our customers.


The team's scope of work includes the scaling, observability and security of systems for continuous integration and deployment (CI/CD), access control, benchmarking, observability, incident response, database replication, cost engineering, and more.

ESSENTIAL DUTIES AND RESPONSIBILITIES 

  • Help improve our software development lifecycle from local development to production.

  • Actively identify, plan, and implement developer tooling and automation.

  • Participate in on-call rotations and assist on-call engineering teams with diagnostics and troubleshooting of platform and infrastructure.

  • Take an active role in defining our overall architecture and our approaches to scaling, observability and security.


What you will get from us:

  • Opportunities to solve problems of scale, debt, and security to evolve our technology and get our life changing technology into the hands of our customers.

  • A strong voice in what we work on, how it works, and how we build it.

  • Our trust in your ownership of your work.

  • Dedicated budget for training and career development.

  • Coworkers who you'll learn from and who are looking to learn from you.


 Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions 

SKILLS TO BE SUCCESSFUL IN THE ROLE 

  • 6+ years in infrastructure engineering, site reliability or roles with concentrations on developer tooling and CI/CD.

  • Track record of building great development experiences with custom tools and automation.

  • Proven ability to design and develop scalable, efficient, and cost-conscious fault-tolerant systems.

  • Experience supporting applications in one or more of Python, Typescript or Scala environments.

  • Expertise implementing infrastructure as code (Terraform, Cloudformation, Helm, etc.) and with containers and container orchestration systems (Docker, podman, K8S, ECS, etc.).

  • Expert knowledge of metrics, logging, observability systems in cloud environments.

  • Linux system administration expertise.


You will be a step ahead if you have:

  • Deep experience with IAM and implementation of hybrid RBAC/ABAC policy environments.

  • Database management expertise, especially the AWS flavors (RDS, Aurora).

  • Security and compliance operations experience, especially for SOC2, HIPAA, or HiTRUST control frameworks.

  • Any code, writing, or projects that are public or shareable demonstrating experience or understanding of how operational excellence is key to delivering great products.

  • Familiarity with pharmaceutical or biotech industry and meeting regulatory requirements.


WHAT WE OFFER 

This is a full-time, exempt, position, reporting to the Director of Engineering. This position requires the ability to work cross functionally within the organization. 


We offer competitive compensation as well as a comprehensive benefits package including:


  • 100% Company Paid Medical/Dental/Vision Insurance 

  • Generous paid time off

  • Paid holidays

  • 401(k) program 

  • Voluntary life and disability plans

  • Employee assistance program (EAP)

  • Opportunities for advancement



We are an equal opportunity employer


Zephyr AI provides equal employment opportunities (EEO) to all applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. Zephyr AI complies with applicable state and local laws governing non-discrimination in employment in every location in which the company operates. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, placement, promotion, termination, layoff, recall, transfer, leaves of absence, compensation, and training.




This position has been filled. Would you like to see our other open positions?