Software Engineer, Platforms
Shoreline incident automation enables SREs and DevOps engineers to interactively debug at scale and quickly build remediations to eliminate repetitive work and minimize outages. As a Software Engineer at Shoreline, you’ll get to work on tools that give SREs and operators leverage in their work.
We are a small yet highly effective engineering team that truly cares about the product and each other.
As part of this team, you’ll have the opportunity to:
-
Build products that genuinely improve operational and SRE work, improve availability, and dramatically reduce time to resolution
-
Use modern development practices and get exposure to a variety of languages, platforms, and technologies
-
Have organizational visibility and get to influence the product and roadmap
What you'll be doing
-
Implementing a fault-tolerant distributed runtime for Shoreline Op, a purpose-built operations-oriented language designed to allow operators to debug operational events and automate remediations interactively
-
Integrating with platforms (such as Kubernetes, VMware) and cloud providers (such as AWS, Azure, GCP) to provide fully automated discovery, monitoring, and management of resources across a customer’s environment
-
Building systems with arbitrary scale in mind, e.g., millions of nodes
-
Developing the Shoreline Agent that runs in customers' environments to monitor and execute corrective actions with minimal footprint and continually push the envelope on performance
-
Own end to end security of the Shoreline platform and develop features that make it easier for customers to manage their resources securely using Shoreline
Examples of some features and projects accomplished by the platforms engineering team:
-
Ability of Shoreline to manage customer resources across different clouds (AWS, GCP, Azure as well as on-prem), platforms (Kubernetes, VMs) and operating systems (Linux, Windows).
-
Routing service that automatically assigns agents to backends using rendezvous hashing. Enabled scaling of Shoreline to easily manage 30,000 machines and 2,500 Kubernetes clusters with millions of containers.
-
Observability of Shoreline using OpenTelemetry metrics and facilitate self-hosted Shoreline deployments: https://docs.shoreline.io/metrics/open-telemetry
-
Access control using groups to make fine grained permission management much easier for large organizations: https://docs.shoreline.io/access-control/groups.
Who you are
-
Hungry and passionate about building innovative software to create a large scale impact for customers in a highly untapped space
-
Excited and eager to dive into challenging and interesting computer science problems in areas such as core backend development, distributed systems, SRE/DevOps, scalability, cloud, and performance optimization
-
Attracted to the fast pace, breadth, and depth of learning and growth at a well-poised emerging startup
Get to know us
-
Blog post from the Founder and CEO Anurag Gupta about why he started Shoreline and his vision for Shoreline: https://shoreline.io/blog/why-i-built-shoreline-incident-automation
-
Shoreline culture: https://www.shoreline.io/join-us#culture
-
A talk from Anurag about why systems fail and how Shoreline can help: https://shoreline.io/blog/why-systems-fail-and-what-you-can-do-about-it
-
Shoreline blog with more posts about use cases and problems solved: https://shoreline.io/blog
-
Shoreline product demo: https://www.youtube.com/watch?v=VA1lsA6l1ao
-
Shoreline product documentation: https://docs.shoreline.io/
-
Shoreline youtube channel: https://www.youtube.com/@shoreline2556/videos