Software Engineer, Platforms

Shoreline incident automation enables SREs and DevOps engineers to interactively debug at scale and quickly build remediations to eliminate repetitive work and minimize outages. As a Software Engineer at Shoreline, you’ll get to work on tools that give SREs and operators leverage in their work.

We are a small yet highly effective engineering team that truly cares about the product and each other.

As part of this team, you’ll have the opportunity to:

  • Build products that genuinely improve operational and SRE work, improve availability, and dramatically reduce time to resolution

  • Use modern development practices and get exposure to a variety of languages, platforms, and technologies

  • Have organizational visibility and get to influence the product and roadmap

What you'll be doing

  • Implementing a fault-tolerant distributed runtime for Shoreline Op, a purpose-built operations-oriented language designed to allow operators to debug operational events and automate remediations interactively 

  • Integrating with platforms (such as Kubernetes, VMware) and cloud providers (such as AWS, Azure, GCP) to provide fully automated discovery, monitoring, and management of resources across a customer’s environment

  • Building systems with arbitrary scale in mind, e.g., millions of nodes

  • Developing the Shoreline Agent that runs in customers' environments to monitor and execute corrective actions with minimal footprint and continually push the envelope on performance

  • Own end to end security of the Shoreline platform and develop features that make it easier for customers to manage their resources securely using Shoreline

 

Examples of some features and projects accomplished by the platforms engineering team:

 

  • Ability of Shoreline to manage customer resources across different clouds (AWS, GCP, Azure as well as on-prem), platforms (Kubernetes, VMs) and operating systems (Linux, Windows).

  • Routing service that automatically assigns agents to backends using rendezvous hashing. Enabled scaling of Shoreline to easily manage 30,000 machines and 2,500 Kubernetes clusters with millions of containers.

  • Observability of Shoreline using OpenTelemetry metrics and facilitate self-hosted Shoreline deployments: https://docs.shoreline.io/metrics/open-telemetry

  • Access control using groups to make fine grained permission management much easier for large organizations: https://docs.shoreline.io/access-control/groups.



Who you are

  • Hungry and passionate about building innovative software to create a large scale impact for customers in a highly untapped space

  • Excited and eager to dive into challenging and interesting computer science problems in areas such as core backend development, distributed systems, SRE/DevOps, scalability, cloud, and performance optimization

  • Attracted to the fast pace, breadth, and depth of learning and growth at a well-poised emerging startup

Get to know us