Runbook

Elasticsearch disk out of space incident.

Back to Runbooks

Overview

In this type of incident, Elasticsearch, a search and analytics engine, has run out of disk space on a server instance. This can result in various issues such as slow performance, inability to index new data, and even system crashes. It is important to resolve this issue quickly to avoid further disruption to the system.

Parameters

Debug

1. Verify disk usage on the Elasticsearch instance

2. Identify largest directories on the Elasticsearch instance

4. Check Elasticsearch cluster health

5. Check Elasticsearch cluster status

6. Check Elasticsearch indices status

7. Check Elasticsearch shards status

The Elasticsearch retention policy is not configured correctly, leading to excessive data storage.

The Elasticsearch backups have not been performed as scheduled, leading to insufficient disk space.

Repair

Add more disk space to the Elasticsearch server to prevent future incidents of disk out of space.

Implement a mechanism to automatically delete old Elasticsearch data that is no longer required.

Add more disk space to the Elasticsearch server to prevent future incidents of disk out of space.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.