Runbook

Kubernetes CPU Availability Low

Back to Runbooks

Overview

The "Kubernetes CPU Availability Low" incident type refers to an incident where the available CPU for requests in a Kubernetes cluster is too low. This can cause performance issues and potential downtime for the affected service. This incident may be triggered by a monitoring system or an alert from Kubernetes itself. It is important to investigate and resolve this incident promptly to ensure the service is operating optimally.

Parameters

Debug

Check the CPU usage of all the pods in the Kubernetes cluster

Check the CPU usage of all the nodes in the Kubernetes cluster

Check the CPU limits and requests of the pods in the Kubernetes cluster

Check the CPU usage of the containers in a specific pod

Check the Kubernetes events to see if there are any relevant error messages

Resource limits were not set properly in containers, leading to overconsumption of CPU resources.

Inadequate resource allocation by the Kubernetes cluster, leading to insufficient CPU resources being available.

Repair

Increase the resources allocated to the Kubernetes cluster to avoid CPU contention.

Identify and terminate idle or unused containers to free up resources.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.