Runbook

MongoDB Replication Lag Incident

Back to Runbooks

Overview

A MongoDB Replication Lag Incident occurs when there is a delay in the replication of data from the primary MongoDB instance to its secondary instances. This delay can cause data inconsistencies and affect the performance of the application. It is important to identify and resolve this incident as soon as possible to ensure that the application is functioning optimally.

Parameters

Debug

Connect to the MongoDB instance with <host> and <port>

Check the status of the replica set

Check the replication lag for all members of the replica set

Check the replication lag for a specific member of the replica set

Check the oplog size for all members of the replica set

Check the oplog size for a specific member of the replica set

Check the slow queries log for any queries that may be causing replication lag

Check the network latency between the replica set members

Check the network throughput between the replica set members

Repair

Define the hostnames or IP addresses of the primary and secondary nodes

Check the status of the primary node

Check the status of the secondary node

Reduce the write concern settings on the MongoDB cluster to allow for faster replication.

Increase the replication buffer size to allow for more data to be replicated between the primary and secondary nodes.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.