Runbook

Kafka Data Loss Incident

Back to Runbooks

Overview

The Kafka Data Loss Incident refers to an incident where there is a misconfiguration of the data retention policy in Kafka, which can result in data loss. Kafka is a distributed streaming platform that is commonly used in big data environments. When data retention policies are misconfigured, data that should be retained is deleted, or data that should be deleted is retained. This can lead to significant data loss and can have serious consequences for businesses that rely on the data stored in Kafka.

Parameters

Debug

Check if Kafka is running

Check the Kafka logs for any errors

Check if the Kafka retention policy is configured correctly

Check the disk usage of the Kafka logs

Check if the topic in question is still available

Check the status of the Kafka brokers

Repair

Review and update Kafka's data retention policy to ensure that it is correctly configured and aligned with your data retention requirements.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.