Scheduler pod hang when K8s API call fail #28328
Labels
area:core
good first issue
kind:bug
This is a clearly a bug
provider:cncf-kubernetes
Kubernetes provider related issues
Milestone
Apache Airflow version
Other Airflow 2 version (please specify below)
What happened
Airflow version:
2.3.4
I have deployed airflow with the official Helm in K8s with
KubernetesExecutor
. Sometimes the scheduler hang when calling K8s API. The log:Then the executor process was killed and the pod was still running. But the scheduler does not work.
After restarting, the scheduler worked usually.
What you think should happen instead
When the error occurs, the executor needs to auto restart or the scheduler should be killed.
How to reproduce
No response
Operating System
Debian GNU/Linux 11 (bullseye)
Versions of Apache Airflow Providers
No response
Deployment
Official Apache Airflow Helm Chart
Deployment details
No response
Anything else
No response
Are you willing to submit PR?
Code of Conduct
The text was updated successfully, but these errors were encountered: