This incident type occurs when disk space in a Cassandra cluster is not reclaimed after adding new nodes. This can lead to the cluster running out of disk space, causing performance issues or even complete failure. The root cause can vary, but it is often due to misconfigurations or software bugs. It is important to address this issue promptly to prevent data loss and ensure the stability of the Cassandra cluster.
Parameters
Debug
Check disk usage on all nodes
Check available disk space
Check if compaction is stalled
Check if there are any pending tasks
Check if there are any errors in the system log
Check if there are any pending repairs
Check if nodetool cleanup has been run
Check if nodetool scrub has been run
Check if nodetool compact has been run
Repair
Review the Cassandra cluster configuration to ensure that it is properly set up for the number of nodes currently in use and the expected growth of the cluster.
Run a repair on the Cassandra cluster to clean up any old or unused data that may be taking up disk space.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.