The "Kubernetes CPU Availability Low" incident type refers to an incident where the available CPU for requests in a Kubernetes cluster is too low. This can cause performance issues and potential downtime for the affected service. This incident may be triggered by a monitoring system or an alert from Kubernetes itself. It is important to investigate and resolve this incident promptly to ensure the service is operating optimally.
Parameters
Debug
Check the CPU usage of all the pods in the Kubernetes cluster
Check the CPU usage of all the nodes in the Kubernetes cluster
Check the CPU limits and requests of the pods in the Kubernetes cluster
Check the CPU usage of the containers in a specific pod
Check the Kubernetes events to see if there are any relevant error messages
Resource limits were not set properly in containers, leading to overconsumption of CPU resources.
Inadequate resource allocation by the Kubernetes cluster, leading to insufficient CPU resources being available.
Repair
Increase the resources allocated to the Kubernetes cluster to avoid CPU contention.
Identify and terminate idle or unused containers to free up resources.
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.