Runbook

Host systemd service crashed (instance) incident.

Back to Runbooks

Overview

This incident type refers to the failure of a systemd service on a particular host instance. The incident could be triggered by various causes such as a software bug, hardware failure, or system overload. This type of incident can cause downtime or service disruption to the affected host instance, which may require immediate resolution to restore normal operations.

Parameters

Debug

Check the status of the systemd service on the affected host instance

Check the system logs for any relevant error messages

Check the CPU and memory usage on the affected host instance

Check the disk usage and available space on the affected host instance

Check the network connectivity on the affected host instance

Check the firewall rules on the affected host instance

Check the hardware status of the affected host instance

The host system's resources were overloaded due to high usage or traffic, causing the systemd service to fail.

Repair

Restart the systemd service on the affected host instance: This can be done to try and resolve the issue by manually restarting the systemd service on the affected host instance. If the failure was due to a temporary issue, the service should resume normal operations after restarting.

Learn more

Related Runbooks

Check out these related runbooks to help you debug and resolve similar issues.