In this incident type, there is an issue with a Cassandra cluster where one or more disks are running slow. This can cause performance issues and potentially lead to data loss or downtime. The goal is to identify and address the specific disk(s) causing the problem in order to restore normal cluster operations.
Parameters
Debug
Check disk usage and identify high usage disks
Check disk I/O and identify slow disks
Check disk read and write performance
Check for errors in the system log related to disk I/O
Check for disk errors and bad sectors
Check for file system errors and corruption
Repair
Identify the specific disk(s) causing the issue by monitoring disk usage and performance metrics.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.