Shoreline closes $35M series B - Read the details
Pre-built automations that fix the most common issues impacting availability
Shoreline’s Op Pack library offers open source blueprints for automating away your most common incidents.
Shoreline’s Op Packs are Terraform modules that contain multiple elements:
Quickly fix the most common issues.
Shoreline provides a powerful library of alarms, debugging commands, and remediation actions that address common problems.
Save time using solutions vetted by a community of experts.
Instead of having to start from a blank slate, begin with solutions that others in the community have already solved.
Rest easy knowing safety and controls are built in.
Every Op pack has tests integrated, and safety built-in, including access controls, blast radius controls, and circuit breakers.
See below for a listing of our most popular Op Packs, and follow the link on each one to learn more about the specific issue and solution.
The community is actively adding to the library every month, so please contact us if you don’t see the issue that’s been driving you nuts. If we haven’t already built it, we’d love to add it to our list.
JVMs often face memory issues that can lead to hours of SSH-ing into box after box trying to catch the issue as it happens.
Network related issues are often hard to diagnose, and can lead to a very bad experience for customers.
When AWS Systems Manager marks a node for retirement, companies must gracefully terminate work on that node.
Disk full incidents can lead to wide-spread outages and data loss that can damage customer experiences and lose revenue.
CoreDNS, the default Kubernetes DNS service, can degrade in performance with too many calls causing massive latency.
Argo makes declaratively managing workflows easy, but it can leave behind many stale pods after workflow execution.
When Kubernetes pods won’t leave the terminating state, they must be identified and safely drained.
Unauthorized cryptocurrency miners must be stopped from abusing free tiers of cloud service providers.
Server environments can often be challenging to run. Sometimes processes silently die. Other times old versions of processes are left running.
Many production incidents are caused by issues that can be identified by analyzing log files. Unfortunately, centralized logging can be very expensive.
Many different types of application errors can lead to out of memory errors (OOMs) in Kubernetes.
Sooner or later every company gets bitten by expired certificates and when they do, it can cause a catastrophic outage.