Blog

Resources and insights

Read the latest stuff we're up to and what we're most excited about.

Mar. 12, 2024

Managing Zombie Processes in Containers

In container environments, the main process manages child processes. Poor management can lead to orphan processes, draining resources and risking operational integrity.

Nobutaka Ajito

Technical Writer

Mar. 5, 2024

3 Common Problems of a Queue and How to Fix Them

In this candid insight, Anurag shares valuable lessons on utilizing queues in system design for enhanced reliability, learned from firsthand experiences at AWS.

Anurag Gupta

CEO and Founder, Shoreline

Jan. 30, 2024

Shoreline.io CEO Anurag Gupta’s Top 5 Videos from 2023: Insights for DevOps and Cloud Operations Teams

Throughout 2023 Shoreline’s founder and CEO, Angug Gupta, shared his thoughts on DevOps and cloud operations excellence in many short videos on our YouTube channel. We’ve selected the five insights that stood out for DevOps and Cloud teams looking to innovate tools and processes in 2024.

Shoreline.io

Jan. 22, 2024

The Cost, The Challenges, and The Conquering of On-Call Operations

Delve into our blog about the intricacies of on-call operations, drawing from Shoreline's 2022 survey insights. Over 300 experts discuss the high costs and challenges faced, with tips for reducing escalations and automating tasks. Discover how Shoreline's tools can revolutionize your on-call strategy.

Kristine Newman

VP, Product Marketing, Shoreline

Jan. 10, 2024

2023 Benchmark Report: Transforming Incident Management in the Age of AI

Constellation Research's 2023 IT Ops report shows incident resolution costs $20-100M annually for big firms, highlighting automation's necessity in modern digital incident management.

Anurag Gupta

CEO and Founder, Shoreline

Dec. 19, 2023

Boosting Developer Productivity by 25% with Incident Automation

Shoreline recently helped Razorpay, a FinTech leader in India, elevate their system reliability and improve developer productivity by 25% as part of their strategic initiative for incident automation.

Kristine Newman

VP, Product Marketing, Shoreline

Nov. 14, 2023 Video

How We Secure our Tier Zero Service

Shoreline.io, like any observability tool, any incident management tool, or any incident automation tool, is a tier zero service. Tier zero services need to stand up - even when other things are burning down.

Anurag Gupta

CEO and Founder, Shoreline

Nov. 7, 2023 Video

Minimizing Data Collection Expenses

Observability tools rank as the 3rd-highest cost for engineering teams after their people and cloud infrastructure. That’s insane! What's even more insane? You hardly ever use the collected data.

Anurag Gupta

CEO and Founder, Shoreline

Nov. 6, 2023Podcast

The Role of Generative AI in Enhancing Production Operations

In this RedMonk Conversation, Stephen O'Grady and Anurag Gupta discuss how generative AI can help address reliability challenges and incident response.

Kristine Newman

VP, Product Marketing, Shoreline

Nov. 2, 2023 Video

4 Tactics to Ensure Power & Safety in Production Ops

How can we establish powerful production operations that avoid allowing SREs unrestricted SSH access to production environments? Here are four measures we implement to safeguard services.

Anurag Gupta

CEO and Founder, Shoreline

Sep. 21, 2023 Video

Razorpay's Advanced Approach to Threat Correlation & Remediation

Razorpay, a FinTech leader in India, built ARCTIC, a security & response solution coupling pinpoint accuracy in threat detection with rapid remediation from Shoreline. They posted a recent video about how they did it.

Anurag Gupta

CEO and Founder, Shoreline

Aug. 2, 2023Event

DASH 2023

DASH, by Datadog, is an annual conference with two days packed with hands-on learning and inspiration. Let’s build and scale the next generation of applications, infrastructure, security, and technical teams together.

Shoreline.io

Jun. 26, 2023

Shoreline @ Monitorama and Observa-Palooza 2023

Join Shoreline.io at Monitorama and Observa-Palooza to share reliability best practices and have some fun.

Chris Newton

VP of Marketing, Shoreline

Apr. 28, 2023

Observability without Action is just Storage

Observability in software ops is key for proactive issue resolution, going beyond data collection to include decisive actions based on logs, metrics, and traces. Not acting on insights leads to reduced productivity and poor user experiences.

Kristine Newman

VP, Product Marketing, Shoreline

Apr. 24, 2023 Video

Reliability Engineering: The Southwest Debacle

Because it's less expensive and quicker for passengers, Southwest operates on a point-to-point model. Any disruptions in one route affect the entire chain. But to engineer a reliable architecture, you need to balance cost versus reliability in an economically constrained way.

Anurag Gupta

CEO and Founder, Shoreline

Apr. 24, 2023 Video

How to Solve the Challenges of MELT Data at Scale

The bigger the data set, the slower it is to analyze. For MELT, you need to be able to execute a query at scale across your fleet and see what's going on in the live environment. That’s why, at Shoreline, we favor modeling the distributed system as a distributed system.

Shoreline.io

Apr. 24, 2023 Video

How to Reduce Alarm Noise

In any company, 50-80% of the alarms are noisy. Employees get trained to snooze these alarms – which isn’t always the right thing to do. Wouldn't it be better if you could easily see which are your top issues each week, and which alarms might be set incorrectly?

Shoreline.io

Apr. 24, 2023 Video

How to Bring Continuous Improvement in Operations

I deeply believe in making things 1% better each and every week by improving the performance of the software I've been responsible for and keeping my services up. Let’s talk about bringing continuous improvement to operations.

Shoreline.io

Apr. 24, 2023 Video

Building a Culture Around Reliability

It's not some other team's job to keep your service up. Just like it's not some other team's job to fix your bugs or make sure that your system doesn't have vulnerabilities. We all have to own it. That is what a culture of reliability requires.

Shoreline.io

Apr. 24, 2023 Video

3 Challenges of Meeting 4 Nines Availability

Availability for the 4 nines is equivalent to only 4.4 minutes of downtime in a month. Here are 3 challenges that keep people from meeting customer expectations for service availability.

Shoreline.io

Apr. 24, 2023 Video

Why You Need Automation Today

A ton of tools help you observe your environment and maybe half a ton help you route things and deduplicate them. But there's hardly anything out there that actually fixes your environment. That's the reason we need automation in production ops today.

Anurag Gupta

CEO and Founder, Shoreline

Apr. 24, 2023 Video

How to Setup Shoreline’s Incident Insights Tool

Learn step by step how to setup Shoreline's Incident Insights so that you can pinpoint the top causes of incidents, measure team health, and use trending data to drive continuous improvement. Get up and running in 2 minutes.

Shoreline.io

Apr. 24, 2023 Video

How Shoreline Helps You Get a 4 9’s SLA

Since we’re all sitting on similar infrastructure, if someone solves an issue, everyone should be able to benefit from it. That’s one of the ways we help our customers to save time, reduce errors, and get to a four 9’s SLA.

Shoreline.io

Apr. 24, 2023 Video

How Does Shoreline’s Incident Insights Work?

I know I should apply continuous improvement to operations. But where do I start? See how our free Incident Insights tool helps you remove noise and increase signal, making your team more productive and reducing costs by decreasing toil.

Shoreline.io

Apr. 24, 2023 Video

A Guide to Building Reliable Systems

Amazon S3's 11 nines claim promises near-immortal data storage, but real-world factors like solar events and correlated failures challenge this durability. Understanding the limits is crucial for robust system design.

Shoreline.io

Apr. 18, 2023Podcast

Slight Reliability Podcast: The reliability.org Community with Anurag Gupta

Shoreline founder and CEO, Anurag Gupta, joins Stephen Townshend to discuss the value of community, collaboration between organizations, vicious versus virtuous cycles for reliability, and more.

Shoreline.io

Apr. 12, 2023Podcast

Unlearn Podcast: Building Reliable and Resilient Systems with Anurag Gupta

Shoreline founder and CEO, Anurag Gupta, joins Barry O’Reilly to share the importance of embracing failure, creating a blameless culture, and his first-hand knowledge on how to build more reliable and resilient systems.

Shoreline.io

Mar. 30, 2023Webinar

Cloud Native Live: Designing and Operating Reliable Cloud Services – A View from the Trenches

Shoreline founder and CEO, Anurag Gupta, joins Niall Murphy, Co-founder & CEO of Stanza, and Stephen Townshend, Host of the Slight Reliability Podcast, to discuss the importance of investing in reliability and the new reliability.org community.

Shoreline.io

Jul. 20, 2021Podcast

Screaming in the Cloud: All Along the Shoreline.io of Automation with Anurag Gupta

Anurag Gupta joins Corey Quinn to discuss the large variety of services he helped launch and his transition back to start-ups with the founding of Shoreline.

Shoreline.io

Mar. 29, 2023 Video

Decoding Taylor Swift’s Ticketmaster Debacle

What can we learn from the Ticketmaster (Taylor Swift) Debacle? Ticketmaster experienced an unprecedented demand that resulted in their site crashing for many hours. If they had designed a reliable service with an escalator-like system instead of an elevator, this could have been avoided.

Shoreline.io

Mar. 8, 2023 Video

Shoreline on Shoreline: Idle EC2 Cost Savings Op Pack

Hear from Shoreline Op Pack Engineer, Kaustubh Prabhakar, on how valuable it is to use our Idle EC2 Cost Savings Op Pack.

Shoreline.io

Mar. 5, 2023

Top Tips for Writing an Enticing (and Honest) SRE Job Description

Learn the best practices when building an effective SRE job description, including some sample copy, to help SRE leaders improve how they recruit team members.

Kazi Zaman

Chief Development Officer, Shoreline

Feb. 16, 2023

Don’t Be Like Twitter

Twitter’s recent outage highlights the dangers of errors and staff shortages. Safeguard your business by implementing automated reliability solutions.

Chris Newton

VP of Marketing, Shoreline

Mar. 14, 2023Event

SLC DevOpsDays 2023

Every year, we look forward to connecting with our local DevOps community at SLC DevOps Days; Sharing and learning from experts in our community, and working with DevOps thought leaders that visit our event. We are very excited to be back in Salt Lake City March 14-15, 2023.

Shoreline.io

Nov. 6, 2023Event

KubeCon '23

The Cloud Native Computing Foundation’s flagship conference gathers adopters and technologists from leading open source and cloud native communities in Chicago, Illinois from November 6-9, 2023.

Shoreline.io

Jul. 11, 2023Event

AWS Summit New York 2023

Join us in person for AWS Summit New York to see how your peers and competitors are using the cloud to their advantage and learn all the ways you can use AWS to jump-start, grow, or supercharge your business and career to the next level. AWS Summit New York is a free event.

Shoreline.io

Jun. 26, 2023Event

Monitorama 2023

Monitorama brings together the brightest minds among the open source development and operations communities to continue to push the boundaries of observability software and practices, all while having a great time in a casual setting. Join us in Portland, Oregon June 26-28, 2023.

Shoreline.io

Jan. 30, 2023

What Does 2023 Have in Store for Cloud Reliability?

We asked the Shoreline team what predictions they have for cloud reliability in 2023. Here’s what we learned about cloud adoption, automation, and more.

Yuvraj Mehta

Head of Product, Shoreline

Jan. 30, 2023

Shoreline Audit Integration with AWS CloudTrail Lake

Shoreline's partnership with AWS CloudTrail Lake enables Shoreline and AWS customers to use CloudTrail Lake as their single source of truth for auditing events.

Andreea Covaliu

Software Engineer, Shoreline

Jan. 18, 2023 Podcast

Unleashing the Power of AI in IT Operations

In a compelling discussion, Evan and Anurag delve into the intricacies of Shoreline's AI Ops platform for incident response. Anurag, drawing from his experience leading reliable services at AWS, highlights the challenges of maintaining high availability in the face of rapid growth. He emphasizes the role of innovative automations in ensuring consistent service for demanding customers. Anurag suggests the first step in driving reliability for cloud services is understanding the root causes of incidents. He points to Shoreline's free tool designed to aid in this process. The conversation also features a case study of a major Shoreline client managing a 30,000-node fleet across multiple clouds and regions. Anurag shares how the client efficiently handles security checks and issue detections over thousands of instances simultaneously, treating the entire fleet as a single entity. For a deeper dive into this insightful discussion, the full video podcast is available on YouTube and LinkedIn.

Kristine Newman

VP, Product Marketing, Shoreline

Jan. 17, 2023

How to Analyze Incident Data to Optimize On-Call Operations

We get it, incident data is difficult to read. Dive into three different and effective ways to categorize and filter your data to gain actionable insights.

Chris Newton

VP of Marketing, Shoreline

Jan. 16, 2023 Video

Automate Based on Frequency not Recency | Shoreline.io

Prioritize automating frequent issues for efficiency. Automate critical tasks for safety and error reduction. AWS strategy: automate one issue weekly for significant yearly impact. Shoreline simplifies automation, aiming for quick, lasting fixes, transforming work efficiency.

Anurag Gupta

CEO and Founder, Shoreline

Jan. 4, 2023 Video

Shoreline on Shoreline: Unauthorized Root Access Detector

Hear from Shoreline Op Pack Engineer, Kaustubh Prabhakar, on how valuable it is to use Shoreline Unauthorized Root Access Detector.

Shoreline.io

Jan. 4, 2023 Video

Debugging an eCommerce Microservice - High Request Latency Debugging with Shoreline

Charles Carey, Shoreline's Chief Technology Officer, walks us through Shoreline's automation runbook experience.

Shoreline.io

Jan. 2, 2023

Why We Built Incident Insights

Ticketing data is messy. This new, free tool allows leaders to contextualize data to understand what issues occur most frequently and how long they take to resolve.

Charles Cary

Chief Technology Officer, Shoreline

Dec. 15, 2022 Video

Role of Empathy in Building a Great Company Culture

Obsessing about customers is important, but so is creating a culture where people take care of others and feel cared for. That’s why we put our values right on our website.

Shoreline.io

Dec. 15, 2022 Video

About Company Values

Part of the reason to create a company is to create the environment you want to be in.So it’s important that you reflect your values in your interview process. Otherwise, the sheer number of people joining will dilute things.

Shoreline.io

Dec. 13, 2022 Video

Risks of Automation vs. Human Errors

Automation is risky. Errors in the remediation code could worsen an outage. While that’s true, we also know that human error causes 5x more incidents than automation. You can fix code. You can't fix people.

Shoreline.io

Dec. 13, 2022 Video

How Notebooks Empower Your On-Call Teams

Some issues can't be automated. For things that require human judgment, we provide on-call teams with notebooks that are optimized for operations. That way you know what action to take and when.

Shoreline.io

Dec. 13, 2022 Video

Is Automation Too Time-Consuming?

Automation takes us too much time. We're way too busy fighting fires to think about it. The problem with this approach is that 48% of incidents are straightforward and repetitive. Don't have people fix them manually. Teach the computer how to do it.

Shoreline.io

Dec. 12, 2022 Article

Shoreline Software Launches Free Tool to Organize and Analyze Incident Ticketing Data Automatically

During AWS re:Invent 2022, Shoreline Software announced its Incident Insights solution, a free, AI-powered tool intended to help CloudOps teams analyze their incidents.

Shoreline.io

Dec. 9, 2022 Video

theCUBE Interviews Shoreline CEO Anurag Gupta at AWS re:Invent

Anurag Gupta joined John Walls to discuss innovation in the cloud with DevOps teams for the Global Startup Program at AWS re:Invent 2022.

Shoreline.io

Dec. 7, 2022 Video

"The Power is Huge" with Shoreline

Hear how TigerGraph VP of Product and Innovation, Dr. Jay Yu, used Shoreline to drive continuous improvement and bring up the productivity of his DevOps teams.

Shoreline.io

Nov. 28, 2022 Article

Shoreline.io Launches Incident Insights, a Free Tool for Analyzing Incidents

AI-powered tool automates the categorization, filtering, and analysis of incidents.

Chris Newton

VP of Marketing, Shoreline

Nov. 20, 2022

3 Key Takeaways from The State of DevOps Report

The biggest takeaways and trends from Google Cloud and DORA’s 2022 State of DevOps report.

Chris Newton

VP of Marketing, Shoreline

Nov. 18, 2022 Video

Shoreline Incident Insights

Shoreline's Incident Insights turns messy incident data into insights for FREE

Shoreline.io

Nov. 15, 2022

Incident Escalation Comes at a Cost

To ensure success within your on-call operations, you’ll need to understand what causes escalation, how it affects on-call teams, and how to reduce them.

Ashley Stirrup

COO, Shoreline

Nov. 10, 2022 Video

Shoreline on Shoreline: Alarms & Actions for Release Testing

Hear from Senior Director, Haritha Gongalore, on how rewarding it is to use Shoreline Alarms and Actions to test and certify our own releases.

Shoreline.io

Nov. 4, 2022 Video

[Training] Debugging Kubernetes with Runbooks

In this training, we walk you through the common issues and challenges troubleshooting Kubernetes, and Shoreline's pre-built K8s debugging runbooks.

Shoreline.io

Nov. 2, 2022

TigerGraph Reduces Operations Team Workload by 60 Hours Per Month with Shoreline

Shoreline helps TigerGraph scale in the cloud with a small operations team by automating mundane tasks and quickly finding the root cause of degraded service.

Chris Newton

VP of Marketing, Shoreline

Oct. 25, 2022 Article

Shoreline.io Offers Free Interactive Runbooks for Kubernetes Debugging

Diagnose and Repair Kubernetes Issues Across Pods, Nodes, and Services in Minutes

Chris Newton

VP of Marketing, Shoreline

Oct. 21, 2022 Video

The Surprising Cost of On-Call at Trace3 Evolve 2022

Ashley Stirrup dives into the hidden costs of on-call and discuss how one team saved 20 hours of DevOps time per month thanks to Shoreline.io automations.

Shoreline.io

Oct. 20, 2022

Automation Anywhere Optimizes Production Operations with Sumo Logic and Shoreline

At Sumo Logic’s Illuminate conference, Automation Anywhere discussed how the company optimizes production operations with Shoreline’s cloud reliability platform.

Chris Newton

VP of Marketing, Shoreline

Oct. 20, 2022 Article

Production Operations Survey Finds Companies Suffer 8.7 Major Cloud Infrastructure Incidents Annually

With a more productive and future-focused team, you can proactively eliminate the root causes that lead to major incidents and create tools that shorten time to resolution.

Ashley Stirrup

COO, Shoreline

Oct. 18, 2022 Article

Shoreline.io Announces Availability of the Datadog Incident Repair Kit

Improve customer satisfaction and reduce toil with instant fixes to common production incidents.

Chris Newton

VP of Marketing, Shoreline

Oct. 14, 2022 Video

How to Manage Failure without Wasting Resources

How can you better utilize the resources you keep aside for failover purposes? Here's how we utilized resources kept just for failover purposes to do things that could be stopped for some time when a failure happens and had resources doing useful background activity that can be deferred to when things hit the fan.

Shoreline.io

Oct. 14, 2022 Video

How to Reduce Waste for Unexpected Demands

Shoreline's back ends are low utilization most of the time. But once an hour, we pull telemetry data from all agents, resulting in a CPU, memory, and network utilization spike. See how we convert over-provisioned resources for demand spikes to waste and eliminate it.

Shoreline.io

Oct. 13, 2022 Video

Slack vs. Waste

Waste is when resources are deeply over-provisioned, underutilized, or not utilized at all. Slack appears like the same thing, but you create it with purpose. It's important to understand the difference to drive costs down.

Shoreline.io

Oct. 12, 2022 Video

Why You Should Automate Production Ops

Most of the on-call issues are commonplace, which means they happen again and again. It’s important to automate these issues because it’s a one-time investment, doesn’t make mistakes, and stays with you forever.

Shoreline.io

Oct. 12, 2022 Video

Shoreline Customer Spotlight: TigerGraph

Automating mundane tasks and debugging were just a few of the DevOps requirements TigerGraph VP of Product and Innovation, Dr. Jay Yu, needed to scale in the cloud with his small team. Shoreline delivered.

Shoreline.io

Oct. 12, 2022 Video

Automation Anywhere Connects Sumo Logic with Shoreline for Auto-remediation

Automaton Anywhere links Sumo Logic's data and log monitoring with Shoreline's automated incident repairs to improve customer experiences and save Dev time

Shoreline.io

Oct. 12, 2022 Video

Lessons from AWS: How to Write a Narrative

It’s not a presentation – you don’t tell people what to do. You simply put the facts on the table in a neutral tone.

Shoreline.io

Oct. 12, 2022 Video

Actively Managing Systems to Improve Utilization

We're all being asked to do more with less now a days. For those of us in production operations, one of the best ways we can do that is eliminate waste with automation to drive higher utilization.

Shoreline.io

Oct. 5, 2022 Video

Shoreline Datadog Incident Repair Kit Demo

Find it with Datadog. Fix it with Shoreline.

Shoreline.io

Sep. 27, 2022 Video

How to Reduce On-Call Incidents

If your on-call sucks, you must find a path to make incidents incidental.

Shoreline.io

Sep. 27, 2022

3 Ways to Reduce Downtime and Improve Site Reliability

Downtime and site reliability issues happen far too often. Operations teams need to focus on these three productivity hacks so teams can tackle more with less.

Ashley Stirrup

COO, Shoreline

Sep. 19, 2022 Video

Our Community-Driven Library of Shared Automations

Op Packs available for free with Shoreline.

Shoreline.io

Sep. 19, 2022 Video

About Shoreline’s Fleet-Wide Debugging and Repair

Debug across the fleet in about the same amount of time as an individual box.

Shoreline.io

Sep. 16, 2022 Video

The Best Way to Improve Your On-Call

Improve your on-call by building automations that eliminate common production incidents.

Shoreline.io

Sep. 13, 2022Webinar

TigerGraph: Scaling in the Cloud with a Small Ops Team

Shoreline founder and CEO, Anurag Gupta, joins Dr. Jay Yu, TigerGraph's VP of Product and Innovation, to discuss innovative ways to scale cloud operations fast without the need to incur a lot of costs and keep expanding the cloud DevOps team in this webinar hosted by DevOps.com.

Shoreline.io

Sep. 7, 2022Webinar

The Surprising Cost of On-Call Operations

Anurag Gupta and Ashley Stirrup presented data and insights from a survey conducted with 300 on-call practitioners, managers and executives at this DevOps.com hosted webinar on Sept 7th, 2022.

Shoreline.io

Aug. 22, 2022

What is Incident Automation?

DevOps is complicated. Incident automation is a new function within DevOps that enables engineers to optimize performance, reduce toil, and improve innovation.

Ashley Stirrup

COO, Shoreline

Aug. 17, 2022

Survey: The True Cost of Production Operations Issues

Our 2022 Benchmarking Production Operations Report reveals the leading cause of major incidents, and the impact of escalation, toil, and more.

Ashley Stirrup

COO, Shoreline

Aug. 15, 2022 Article

5 Ways to Prevent an Outage

The main challenge in preventing outages lies in the inevitable breakdown of various components like disks, nodes, and networks. To mitigate this, companies need to acknowledge human error as an unavoidable factor, especially when numerous commands are manually inputted daily. Investigating how minor errors can cause significant damage and implementing safeguards and redundancies are essential steps to reduce the risks and impacts of potential outages.

Kristine Newman

VP, Product Marketing, Shoreline

Aug. 9, 2022 Article

Production Operations Benchmark Survey Reveals Half of Incident Response Time Remains Toil Despite Millions Being Spent

On Average, Companies Spend 12 Person Years On Incident Response Annually

Shoreline.io

Aug. 7, 2022

Shoreline Enhancements Improve Safety for Cloud Production Operations

Shoreline announces customer-driven enhancements that provide enterprise customers with critical safeguards against human errors when executing large scale automations across their multi-cloud infrastructure.

Yuvraj Mehta

Head of Product, Shoreline

Jul. 29, 2022 Video

How to Do Continuous Improvement in Operations

Continuous improvement in operations is possible by automating few IT incident tickets on a regular basis

Shoreline.io

Jul. 28, 2022

Self-Healing: The Key to Fixing the Most Common Kubernetes Issues

Here are three tips for automatically fixing the most common Kubernetes issues through mastering Kubernetes, self-healing, and staying proactive.

Charles Cary

Chief Technology Officer, Shoreline

Jul. 14, 2022 Article

Shoreline.io Announces Open Source Solutions Library to Deliver Self-Healing Infrastructure

Shoreline’s library of pre-built Op Packs offers open source solutions for common production operations incidents, eliminating operational toil and increasing availability

Chris Newton

VP of Marketing, Shoreline

Jul. 14, 2022 Article

On-call cloud operations cost organizations an average of $2.5 million per year

The biggest opportunity to improve on-call productivity is by reducing incident escalations, which account for 78% of on-call time, according to a new report from Dimensional Research and Shoreline.io.

Shoreline.io

Jul. 14, 2022 Article

New open source solutions library from Shoreline.io aims to deliver self-healing infrastructure

Shoreline's pre-built op packs are geared at addressing common production operations incidents to increase teams’ productivity.

Shoreline.io

Jul. 13, 2022

Automation at Dataiku Eliminates DevOps Work and Improves Customer Experience

Almost 170 remediations were automatically triggered last month, conservatively saving over 20 FTE days of DevOps work, while improving app performance.

Louis-Philippe Kronek

GM Dataiku Online, Dataiku

Jul. 6, 2022 Video

3 Hacks to Reduce Your Cloud Computing Bill

1. Reduce waste, 2. Optimize what you’re using, and 3. Move towards reserved instances (RIs)

Shoreline.io

Aug. 12, 2021Podcast

CTO Connection Short Byte: Anurag Gupta - The importance of automating production ops

Peter Bell from CTO connection chatted with Anurag about the importance of automating production ops.

Shoreline.io

Jul. 3, 2022 Video

What I Learned at AWS About Managing SLOs

All we could do was apply automation to fix the issues when they occurred, providing customers much better availability.

Shoreline.io

Jul. 3, 2022 Video

Shoreline Fleetwide Repairs

Safely fix incidents across your entire fleet, with less overhead, and with fewer errors.

Shoreline.io

Jul. 3, 2022 Video

Shoreline End-to-end Automation

Easily and safely automate incident remediations with a few lines of code.

Shoreline.io

Jul. 3, 2022 Video

How to Fix an Incident Before It Happens

It requires predictive maintenance, including monitoring brownout and performing control actions

Shoreline.io

Jul. 3, 2022 Video

Shoreline on Shoreline: Open Port Check

It's critical to close ports that can be opened unintentionally in a development environment, especially port 22 for SSH and port 3389 for remote login.

Shoreline.io

Jul. 3, 2022 Video

Shoreline Fleetwide Debugging

Run a single command across the entire fleet to diagnose incidents more quickly.

Shoreline.io

Jul. 3, 2022 Video

Debugging a Fleet as Easily as an Individual Box

Underneath the covers, the underpinning technology is a lot like a parallel SQL database.

Shoreline.io

Jul. 3, 2022 Video

Datadog + Shoreline Integration Demo

See issues and act in real-time, directly from Datadog

Shoreline.io

Jul. 3, 2022 Video

Shoreline Operations Notebooks

Record, curate, and publish incident debug and repair best practices to safely empower on-call teams.

Shoreline.io

Jun. 30, 2022 Video

Shoreline Actionable Alarms

Shoreline Alarms identify issues with high specificity so that they are immediately actionable.

Shoreline.io

Oct. 18, 2022Event

DASH

Dash, by Datadog, is an annual conference about building and scaling the next generation of applications, infrastructure, security, and technical teams, hosted in 2022 at Javits Center North, New York.

Shoreline.io

Jun. 29, 2022 Video

How to Safely Fix Issues Without Escalation

Incident automation helps people automatically fix issues in production, and grow the number of people who can safely fix things without escalation.

Shoreline.io

Jun. 22, 2022 Video

Using Shoreline.io to root-cause transient issues (like JVM garbage collection)

Shoreline makes it easy to collect diagnostic information when you're doing a root-cause analysis of an issue.

Shoreline.io

Jun. 21, 2022 Video

Shoreline Incident Automation Demo

See Shoreline in action, debugging an incident and automating remediations in a fraction of the usual time.

Shoreline.io

Jul. 29, 2021Podcast

Software Engineering Daily: Fleet Automation with Anurag Gupta

Anurag had the opportunity to chat with Jeff Myerson on his podcast, Software Engineering Daily

Shoreline.io

Oct. 24, 2022Event

KubeCon ‘22

The Cloud Native Computing Foundation’s flagship conference gathers adopters and technologists from leading open source and cloud native communities in Detroit, Michigan from October 24 – 28, 2022.

Shoreline.io

Nov. 28, 2022Event

re:Invent

For 10 years, the global cloud community has come together at re:Invent to meet, get inspired, and rethink what's possible. Join us again this year in Las Vegas for our biggest, most comprehensive, and most vibrant event in cloud computing.

Shoreline.io

Jun. 8, 2022 Video

Why We Leverage Wavelets for Data Compression

Wavelets are the best way to deal with errors in the underlying data stream

Shoreline.io

Jun. 6, 2022 Article

Making Something From Nothing: Anurag Gupta Of Shoreline On How To Go From Idea To Launch

An Interview With Fotis Georgiadis - Overcoming the struggle in taking a good idea and translating it into an actual business.

Anurag Gupta

CEO and Founder, Shoreline

Jun. 3, 2022 Video

Shoreline Makes Production-Ops Smarter and Faster

Often people try to build a solution like Shoreline on their own. Here's why they fail.

Shoreline.io

Jun. 3, 2022 Video

How to Efficiently Manage Your Operational Data

Discover how to manage operational data for debugging and trend analysis efficiently, reducing costs and enhancing real-time insights with Shoreline.io's innovative approach.

Shoreline.io

Jun. 3, 2022 Video

How to Boost Reliability Without Hiring More SREs

How can companies increase reliability without hiring an army of engineers?

Shoreline.io

Jun. 3, 2022 Video

Why I Started Shoreline

Companies spend more on the people managing their cloud infrastructure than on the cloud infrastructure itself.

Shoreline.io

Jun. 3, 2022 Video

What We Do at Shoreline (In 140 Seconds)

Shoreline helps on-call operators reduce incidents resulting in a better on-call experience and better availability for their customers.

Shoreline.io

Jun. 3, 2022 Video

Niall Murphy on his experience with Shoreline's Incident Automation Platform

Niall Murphy, former SRE at Google and Microsoft and author of the O'Reilly book, Site Reliability Engineering, shares his experience of using Shoreline's Incident Automation Platform.

Shoreline.io

Jun. 3, 2022 Video

Shoreline Incident Automation Overview

Shoreline’s Incident Automation Platform was built to reduce manual and repetitive work, so that you can repair issues faster, increase team productivity, and eliminate thousands of hours of degraded service.

Shoreline.io

Jul. 27, 2021Webinar

Why Automated Remediation?

Redmonk Analyst and Steve O'Grady sits down with Anurag Gupta to define automated remediation

Shoreline.io

Aug. 18, 2021Webinar

Incident Response: The job of humans or machines?

In this webinar, John Egan, CEO of Kintaba, and Anurag Gupta, CEO of Shoreline, discuss the role humans and machines play when dealing with failures and responding to disasters.

Shoreline.io

Sep. 9, 2021Webinar

5 Technical Lessons Learned from Outages at AWS, Google and Microsoft

InfoQ webinar: 5 Technical Lessons Learned from Outages at AWS, Google and Microsoft

Shoreline.io

Apr. 6, 2021Podcast

Anurag Gupta on day 2 operations, devops, and automated remediation

In this podcast Anurag Gupta, founder and CEO of Shoreline.io, sat down with InfoQ podcast host Daniel Bryant and discussed: the role of DevOps and site reliability engineering (SRE), day 2 operations, and the importance of building observability into applications and platforms.

Shoreline.io

Jun. 2, 2022

How to Reduce SRE Toil

Get actionable strategies for reducing toil so your SRE and DevOps teams can spend more time on projects that create net-new value for your business.

Chris Newton

VP of Marketing, Shoreline

May. 25, 2022

Operations at the Edge

Processing, analyzing, then acting on observability data entirely within your own environment offers a cheaper, faster, fault tolerant, and more secure alternative.

Charles Cary

Chief Technology Officer, Shoreline

Apr. 21, 2022 Article

Debug issues and automate remediation with Shoreline and Datadog

The Shoreline Datadog App enables users to leverage Shoreline's debug and repair features entirely within the Datadog UI.

Chris Newton

VP of Marketing, Shoreline

Apr. 18, 2022 Article

The missing pillar in production operations that is killing innovation

Rethink on-call with automation to tackle repetitive incidents, reduce errors, and free up time for innovation in complex production fleets.

Shoreline.io

Apr. 18, 2022 Article

Shoreline.io announces open source solutions library to deliver self-healing infrastructure

The Incident Automation company Shoreline.io announced a collection of Op Packs that make it easier to diagnose and repair common infrastructure incidents in production cloud environments called the Shoreline open source solutions library.

Shoreline.io

Apr. 6, 2022

What Is a Runbook?

Streamline IT ops with runbooks to efficiently solve routine problems, paving the way for automation and freeing SMEs for higher-value tasks.

Anurag Gupta

CEO and Founder, Shoreline

Mar. 28, 2022 Article

Shoreline scores $35M Series B to build automated incident response platform

With $35M in new funding, Shoreline is addressing "missing piece" to automate incident response for production operations.

Shoreline.io

Mar. 28, 2022 Article

Shoreline.io Closes $35 Million Series B to Transform Production Operations

New funding allows company to expand its mission to help customers improve availability, reduce toil, and free up time for engineers to build

Chris Newton

VP of Marketing, Shoreline

Mar. 27, 2022

Shoreline.io Closes $35 Million Series B to Transform Production Operations

New funding allows company to expand its mission to help customers improve availability, reduce toil, and free up time for engineers to build

Chris Newton

VP of Marketing, Shoreline

Mar. 22, 2022

Code to resize a disk in Kubernetes on AWS

Learn how to manually and automatically resize a disk, including error handling

Sanjit Kalapatapu

Software Engineer, Shoreline

Mar. 20, 2022

What Is SRE (Site Reliability Engineering)?

Curious about site reliability engineering (SRE) and how it can help you iron out incidents more efficiently and consistently? Read this guide to learn more.

Chris Newton

VP of Marketing, Shoreline

Mar. 20, 2022

What Is Runbook Automation?

Runbook automation can transform your operations; eliminating repetitive tasks and late-night calls, and enhancing team efficiency and system reliability. Find out more about what this process is, and how it can be implemented with your team.

Gabe Wyatt

Senior Technical Writer / Content Platform Developer, Shoreline

Feb. 17, 2022 Article

Shoreline Extends Multicloud Incident Automation for CloudOps

The biggest CloudOps challenge is often just keeping things running as expected.

Shoreline.io

Feb. 16, 2022

Multi-Cloud Operations can be easy!

Shoreline’s platform hides complexity, eliminates the pain, and makes multi-cloud operations easy on the team.

Adnan Dosani

Founding Engineering Leader, Shoreline

Feb. 16, 2022 Article

Shoreline Announces Multi-Cloud Incident Automation

Added support for GCP and Azure enables Shoreline customers to debug, repair, and automate across multi-cloud environments, reducing operations complexity

Chris Newton

VP of Marketing, Shoreline

Jan. 19, 2022 Article

Tech Startups to Watch in 2022

Shoreline.io was named one among top 15 promising startups that Database Trends and Applications is watching in 2022.

Shoreline.io

Jan. 19, 2022 Article

16 High-Impact Lessons Every Leader Should Learn From The ‘Tech Giants’

Learn from tech giants like Apple & Google: their strategic choices and a culture accepting failures fuel innovation and success.

Anurag Gupta

CEO and Founder, Shoreline

Jan. 19, 2022 Article

DevOps startup Shoreline brings multicloud incident automation to AWS, Azure and Google Cloud

With Shoreline.io platform now available on AWS, Azure and Google Cloud, site reliability engineers have the ability to significantly improving the availability of cloud hosted applications and services.

Shoreline.io

Jan. 17, 2022 Article

Predictions for 2022: More Outages and Less SREs to Fix Them

Shoreline COO Ashley Stirrup shares his predictions for 2022.

Shoreline.io

Jan. 6, 2022 Article

SRE hiring trends for 2022

Ashley Stirrup, COO at Shoreline.io shares his 2022 predictions about the death of the Runbook, the rising cost of outages and SRE hiring trends.

Ashley Stirrup

COO, Shoreline

Dec. 15, 2021 Article

Shoreline launches online notebooks for site reliability engineers

Online Notebooks can be tied to alarms, making it easier to resolve incidents.

Shoreline.io

Dec. 15, 2021 Article

Shoreline.io Reinvents Runbooks with Industry’s First Purpose-Built Notebooks for On-Call Operations

Unlike runbooks, Shoreline Notebooks real-time debug data and dynamic repair actions enable everyone on-call to be as good as their best SRE

Chris Newton

VP of Marketing, Shoreline

Dec. 2, 2021

Solving Advent of Code Puzzles with Shoreline

Using Shoreline's Oplang and Metrics System to Solve Advent of Code Puzzles

Brian Scheuermann

Software Engineer, Shoreline

Nov. 16, 2021 Article

Shoreline.io Names Chris Newton as Vice President of Marketing

Experienced Marketing Leader Joins Hyper-Growth Incident Automation Company to Scale Go-To-Market Efforts

Ashley Stirrup

COO, Shoreline

Nov. 12, 2021 Article

Shoreline adds Terraform integration to automate production operations remediations

Ensure engineering best practices including code reviews, test pipelines, and version control.

Shoreline.io

Nov. 8, 2021 Article

Centralized vs. Decentralized Operations

Explore the balance between centralized and decentralized ops, and how simple oversights can lead to major outages and urgent fixes.

Shoreline.io

Oct. 26, 2021 Article

Shoreline.io Joins the Datadog Marketplace to Provide Diagnosis, Repair, and Automation Natively Within the Datadog UI

Datadog’s cloud monitoring together with Shoreline’s incident automation provides customers with closed-loop incident detection, diagnosis, and repair

Ashley Stirrup

COO, Shoreline

Oct. 10, 2021 Article

Four lessons every company should learn from the back-to-back Facebook outages

Organizations need to have the right technical and cultural atmosphere in place to reduce the risk, duration, and impact of outages.

Anurag Gupta

CEO and Founder, Shoreline

Oct. 5, 2021 Article

Shoreline.io Delivers Remediation as Infrastructure-as-Code Through New HashiCorp Terraform Verified Provider

Handle remediation with the same Terraform workflow you use to provision and manage infrastructure.

Ashley Stirrup

COO, Shoreline

Sep. 24, 2021

Building Shoreline's Azure Agent During My Summer Internship

Important lessons and valuable experiences while developing Shoreline's Azure Agent.

Tanay Menezes

Software Engineer Intern, Shoreline

Sep. 23, 2021 Article

Shoreline.io Names Ashley Stirrup as Chief Operating Officer

Incident Automation Company Expands Executive Team to Drive Next Phase of Growth

Anurag Gupta

CEO and Founder, Shoreline

Sep. 15, 2021

Infrastructure as Code for Production Ops

DevOps leaders can apply infrastructure as code lessons and tooling to production ops, use solutions like Terraform + Shoreline to automate repeatable tasks, and make hero-level institutional knowledge accessible to anyone.

Charles Cary

Chief Technology Officer, Shoreline

Sep. 10, 2021

Analyze and act upon remediation incidents with Shoreline Events

Unlock the power of automation for fleet-wide remediation. Learn how Alarms, Actions, Bots, and Resources streamline your infrastructure management, all accessible through an intuitive CLI.

Gabe Wyatt

Senior Technical Writer / Content Platform Developer, Shoreline

Sep. 9, 2021

Intern Spotlight: Jainam Shah on Shoreline Notebooks

Jupyter Notebooks for DevOps

Jainam Shah

Software Engineer Intern, Shoreline

Sep. 1, 2021

Automatically Resolve Kubernetes DNS Issues with the CoreDNS Op Pack

Learn how to resolve Kubernetes DNS issues with Shoreline's CoreDNS Op Pack.

Joe Kuo

Solutions Architect, Shoreline

Aug. 30, 2021

Intern Spotlight: Amanda Palamar on Time Series Search

For our Intern Spotlight series, we’ll showcase the work of a summer intern at Shoreline with a technical deep dive.

Amanda Palamar

Software Engineer Intern, Shoreline

Aug. 24, 2021

Prevent Kubernetes IP Exhaustion with Shoreline’s Argo Op Pack

Shoreline’s Argo Op Pack is purpose-built to remediate IP exhaustion related to Argo workflows automatically.

Joe Kuo

Solutions Architect, Shoreline

Aug. 23, 2021

Minimizing Mean Time to Detect: Real Time Alarms with IREE

Execute 1,000s of alarms on box, with 1 second of delay.

Sergiu Iacob

Software Engineer, Shoreline

Aug. 20, 2021

Fleetwide Debugging in Three Easy Steps

Gabe Wyatt

Senior Technical Writer / Content Platform Developer, Shoreline

Jul. 27, 2021 Article

New automation platform aims to help DevOps engineers squash tickets forever

Fix once, automate the solution, and then deploy many times.

Shoreline.io

Jul. 27, 2021 Article

Shoreline Emerges from Stealth to Reduce Operator Toil and Automate the Repair of Incidents Quickly, Safely, and Securely

Company Launches New Platform Designed to Automatically Fix Common Issues that Arise in Operating Systems and to Interactively Debug Issues in Real-time, Across Fleets

Shoreline.io

Jul. 26, 2021

Why I Built Shoreline.io

Reduce on-call fatigue with Shoreline.io's automation, transforming production ops from manual toil to efficient, real-time issue resolution.

Anurag Gupta

CEO and Founder, Shoreline

Jul. 11, 2021

Shoreline Accelerates Ops with JAX & XLA

Shoreline’s metrics team has machine learning technologies from Google, JAX and XLA, to accelerate metric query and data analysis so SREs can run ad hoc queries in real-time.

Sergiu Iacob

Software Engineer, Shoreline

Jul. 6, 2021 Article

Anurag Gupta of Shoreline.io: Five Things You Need To Create A Highly Successful Startup

An interview with Paul Moss - The intuition of when to persevere and when to pivot is the mark of a great founder.

Shoreline.io

Jul. 5, 2021

What is DevOps Automation

Explore the evolution of DevOps automation, from deployment to operational efficiency, enhancing reliability and reducing manual toil.

Austin Gunter

Director of Marketing, Shoreline

Jul. 2, 2021 Article

Anurag Gupta of Shoreline.io: Second Chapters; How I Reinvented Myself In The Second Chapter Of My Life

Discover Anurag Gupta's journey of reinvention in 'Second Chapters', revealing the pivotal moments and qualities that shaped his path.

Shoreline.io

Jun. 30, 2021 Article

Shoreline Adds Industry Luminaries, Amy Chang and Niall Murphy, to its Advisory Board

Shoreline.io boosts incident automation with new advisory members, aiming for higher uptime and reduced operator toil through cutting-edge automation.

Anurag Gupta

CEO and Founder, Shoreline

Jun. 3, 2021 Article

Shoreline.io Wins Intellyx 2021 Digital Innovator Award

Shoreline.io joins Intellyx 2021 Digital Innovator Award, showcasing its impact in automating cloud operations and enhancing SRE efficiency.

Anurag Gupta

CEO and Founder, Shoreline

May. 18, 2021

Understanding and Mitigating System Failures

Anurag Gupta's CTO Summit talk "Why Systems Fail" covers four types of system failures and mitigation strategies, drawing from his AWS experience in analytics and database services.

Kristine Newman

VP, Product Marketing, Shoreline

Apr. 25, 2021

Tutorial: Automating Kubernetes Worker Node Retirement

Discover how Shoreline Op Packs streamline the process of retiring and replacing Kubernetes worker nodes, ensuring seamless updates and preventing outages.

Narendra Nath Challa

Apr. 1, 2021

Transitioning from SRE to Backend Engineering

SRE and Backend Engineering have a lot of overlap, and you can swap between roles relatively easily. This post addresses the pros and cons of leaving SRE for Backend work.

Charles Cary

Chief Technology Officer, Shoreline

Mar. 3, 2021

Runbooks vs Playbooks: Understanding Their Distinct Roles

Runbooks are tactical guides for specific tasks, aiming for automation. Playbooks are broader, strategizing over processes and integrating runbooks for efficiency in operations, reducing error and toil.

Anurag Gupta

CEO and Founder, Shoreline

Feb. 24, 2021

The Guide to Automating Runbook Execution

Automating runbooks streamlines operations, shifting from manual to machine-executed tasks for efficiency in complex environments like Kubernetes. Shoreline's solution, based on real-world experiences, enhances maintenance and incident handling.

Charles Cary

Chief Technology Officer, Shoreline

Feb. 15, 2021

Restarts and rollbacks don't fix everything

Automation and streamlining in operations are promoted as universal solutions, but declarative infrastructure and programmatic deployments have limits.

Charles Cary

Chief Technology Officer, Shoreline