-
Notifications
You must be signed in to change notification settings - Fork 75
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Airflow database CPU usage #23
Comments
We experienced the same problem today. The problem is that it is also activated on the workers whenever a task instance is executed. So for each started task at plugin registration the In A simple solution may be to implement the |
It helps, yes. I applied the same solution locally, it prevents the scheduler calling the plugin. The DB operations are still heavy, but setting the Prometheus scrape interval to 5m causes only ~50% CPU usage spikes instead of 100%, so it's possible to live with that. |
Cool! @zstoth If you think clarifications in README are necessary - feel free to make a PR. Also, I'll add another bug for heavy CPU usage, maybe some extra indexes here and there will help. |
After adding the plugin to our installation the Airflow postgres DB reported ~100% CPU usage until the plugin was removed. This made it impossible for the scheduler to schedule new tasks, Airflow basically stopped functioning.
Our airflow contains around 100 dags, some with a few thousand dag runs - I think reloading this was too much for the DB. Do you know any workaround for this issue?
The text was updated successfully, but these errors were encountered: