Runbook

Slow Disk in Cassandra Cluster

Back to Runbooks

Overview

In this incident type, there is an issue with a Cassandra cluster where one or more disks are running slow. This can cause performance issues and potentially lead to data loss or downtime. The goal is to identify and address the specific disk(s) causing the problem in order to restore normal cluster operations.

Parameters

Debug

Check disk usage and identify high usage disks

Check disk I/O and identify slow disks

Check disk read and write performance

Check for disk errors and bad sectors

Check for file system errors and corruption

Repair

Identify the specific disk(s) causing the issue by monitoring disk usage and performance metrics.