-
Notifications
You must be signed in to change notification settings - Fork 14.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[AIRFLOW-5660] Attempt to find the task in DB from Kubernetes pod labels #6340
Conversation
Codecov Report
@@ Coverage Diff @@
## master #6340 +/- ##
==========================================
+ Coverage 79.68% 80.06% +0.37%
==========================================
Files 616 616
Lines 35798 35803 +5
==========================================
+ Hits 28527 28666 +139
+ Misses 7271 7137 -134
Continue to review full report at Codecov.
|
LGTM. Rerunning tests. |
Looks like there is a problem with CI? I have no idea why the build failed. |
Co-Authored-By: Ash Berlin-Taylor <[email protected]>
Have you run this in a Kube cluster? I have a feeling that every task will hit the bad path because of the characters in the execution date |
We are running airflow in EKS with this patch applied and it works. We can finally scale to 100,000+ tasks with this. Previously it would choke with 5k-10k tasks. Ideally I would prefer to eliminate the bad path altogether. Currently, it requires the dag writer to write good dag id / task id which isn't a good design. I can only think of 2 solns:
|
_label_safe_datestring_to_datetime and it's reverse means it's probably okay:)
Final question: which log file does this warning show up in? |
My bad, didn't see that.
scheduler logs. They are in the console output of the scheduler pod. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(sorry for leaving this languishing for so long)
…els (#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit 0f9983f)
…els (#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit 0f9983f)
…els (#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit 0f9983f)
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]>
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit d8f7d25)
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit 66e2c22e1615c0999747d0c38355163e877872e7)
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit d8f7d25)
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]> (cherry picked from commit 66e2c22e1615c0999747d0c38355163e877872e7)
…els (apache#6340) Try to find the task in DB before regressing to searching every task, and explicitly warn about the performance regressions. Co-Authored-By: Ash Berlin-Taylor <[email protected]>
…ing every task
Make sure you have checked all steps below.
Jira
Description
_make_safe_label_value
function we can add a warning if a task_id or dag_id will require hashing (which will slow down processingTests
Commits
Documentation