The Internet Archive is seeking a Software Engineer to join our core infrastructure team - specifically, the group responsible for managing the content of our storage cluster. The software developed by this group adds new data to the cluster, verifies it, derives additional data from it, makes it available to our front-end systems for public distribution, and assures its long-term integrity.


The work involves a long-lived code base; portions have been in use, and under active development, for 15-20 years. Because this code base is also in constant high-volume use, changes must be made in a manner consistent with ongoing activity.

 

The position accordingly requires someone who will enjoy refining and improving a venerable system while it operates; who sees appeal in refactoring and in adding features rather than building entirely new systems from the ground up. 


Our priority is code that will last and that will lend itself to being maintained and improved in the future: code that is clean, readable, well-structured, extensible, and highly reliable. You will effectively be collaborating not just with your contemporary colleagues, but also with your predecessors and successors, and the need is for software written with that in mind.

 

The work is primarily in PHP; preexisting deep knowledge of PHP is not required, just an interest in applying skills learned elsewhere to the PHP environment.

 

Although some organizations maintain a distinction between development and production, with code being handed off to production engineers when complete, here you will have ongoing responsibility for monitoring the behavior of your code in production, and addressing problems as they arise. That may, on occasion, involve intervening outside regular work hours.

 

It’s expected that it will take time to become familiar enough with our systems to begin contributing fully. We’re prepared to invest that time if you are.

 

This job may be performed remotely, so long as your work hours can overlap significantly with those in US Pacific time.

 

Responsibilities:


  • Work with the core infrastructure team to maintain and improve the Internet Archive’s PHP code base while keeping the system operational

  • Diagnose and fix operational issues as they arise

  • Manage the exchange of data among the Internet Archive’s servers

  • Provide back-end support for user-facing services developed by the front-end team

  • Help find ways to improve the performance of our systems, to meet the challenges posed by an ever-increasing volume of activity

 

Qualifications:


  • Ability to do patient and careful work, with attention to detail, both under supervision and independently

  • Sound technical judgment

  • Strong problem-solving skills

  • Excellent written and verbal communication skills

  • Knack for maintaining grace under pressure

  • Computer Science degree

  • At least five years practical experience

  • Also helpful: facility with the Linux operating system (Ubuntu, specifically), relational databases (PostgreSQL), in-memory data stores (redis), web servers (nginx), and version control systems (git)


Benefits & Perks:

The Internet Archive is a remote-first workplace and provides a comprehensive benefits package including: PTO, paid holidays, and medical benefits. Depending on where you live, we also provide these additional benefits: dental, vision, health savings accounts, flex spending accounts, commuter benefits, short-term disability, long-term disability and retirement programs.


At the Internet Archive, we believe we do our best work when our employees bring together diverse ideas. Members of all groups underrepresented in the tech industry and library world are strongly encouraged to apply. We are proud to be an equal opportunity workplace and are committed to equal employment opportunity regardless of race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, genetic information, veteran status, gender identity or expression, sexual orientation, or any other characteristic protected by applicable federal, state or local law.

The Internet Archive is a non-profit library, headquartered in San Francisco, with an ambitious mission: to provide universal access to all knowledge -- the books, web pages, audio, television and software of our shared human culture. Our engineers, archivists, librarians, and team members have built one of the top 300 websites in the world, https://archive.org/. The Internet Archive digitizes 1000 books a day and collects a hundred million web pages a week. In its 25 plus years, it has built one of the largest digital libraries in the world by working with hundreds of national and international libraries, archives, museums, universities, and non-profits.