Runbook
Long running tasks in Apache Airflow
Back to Runbooks
Overview
This incident type refers to a situation where tasks in Apache Airflow take longer to execute than expected, causing delays and potentially impacting the system's performance. This can happen due to various reasons such as a large volume of data, inefficient code, or resource constraints, among others. Identifying and addressing the root cause of this issue is crucial to ensure the smooth functioning of the system and timely completion of tasks.
Parameters
Debug
Check if Airflow workers are running
Check the logs of a worker pod
Check if the worker pod has enough resources
Check if there are any failed tasks
Check the logs of a failed task
Check if there are any pending tasks
Check if there are any issues with the Airflow scheduler
Check the logs of the Airflow scheduler
Check if there are any issues with the Airflow webserver
Check the logs of the Airflow webserver
Repair
Learn more
Related Runbooks
Check out these related runbooks to help you debug and resolve similar issues.