This incident type occurs when the pods in a Kubernetes cluster fail to schedule, which prevents the application from scaling. Kubernetes is a container orchestration platform that automates the deployment, scaling, and management of containerized applications. Pods in Kubernetes are the smallest deployable units that can be created and managed. When pods do not schedule, it means that the Kubernetes scheduler is unable to find a node with enough resources to run the pod. This can cause issues with application performance and scalability.
Parameters
Debug
Check if pods are pending
Check if there are enough resources available to schedule the pod
Check if there are any taints or affinity rules preventing the pod from being scheduled
Check if there are any events related to the pod scheduling failure
Repair
Check if there are enough resources available on the nodes to schedule the pods. If not, increase the resources on the nodes or provision new nodes by scaling eks cluster, azure cluster and gke cluster as repair
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.