Runbook

High urgency incident related to host context switching.

Back to Runbooks

Overview

This incident type relates to a high urgency issue regarding host context switching. The incident is triggered when the context switching grows on the node beyond a certain level, typically over 10000 per second. This issue can cause performance degradation and impact the stability of the system. The incident requires immediate attention from a software engineer to identify the root cause and take the necessary steps to resolve the issue.

Parameters

Debug

Check the CPU usage of the affected instance

Check the memory usage of the affected instance

Check the number of context switches per second on the affected instance

Check the network connection status of the affected instance

Check the disk usage of the affected instance

Check the status of the affected service

The system may be overloaded with too many requests, causing the CPU to switch between different processes frequently, leading to high context switching rates.

Repair

Consider restarting the affected service or host.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.