Runbook

Cassandra Disk Space Not Reclaimed After Adding New Nodes

Back to Runbooks

Overview

This incident type occurs when disk space in a Cassandra cluster is not reclaimed after adding new nodes. This can lead to the cluster running out of disk space, causing performance issues or even complete failure. The root cause can vary, but it is often due to misconfigurations or software bugs. It is important to address this issue promptly to prevent data loss and ensure the stability of the Cassandra cluster.

Parameters

Debug

Check disk usage on all nodes

Check available disk space

Check if compaction is stalled

Check if there are any pending tasks

Check if there are any errors in the system log

Check if there are any pending repairs

Check if nodetool cleanup has been run

Check if nodetool scrub has been run

Check if nodetool compact has been run

Repair

Review the Cassandra cluster configuration to ensure that it is properly set up for the number of nodes currently in use and the expected growth of the cluster.

Run a repair on the Cassandra cluster to clean up any old or unused data that may be taking up disk space.