Smart Reruns from Failures/Errors #4017

sungchun12 · 2021-10-06T21:46:30Z

resolves #3891

Description

Instead of having to manually override dbt commands with the models in scope that need to rerun, dbt can now parse the run_results.json to do so automatically.

Example Usage
First dbt job model has an error.

Rerun dbt job errors and downstream nodes.

Result Selectors in Scope
Note: This pull request does NOT limit which selectors the user can call on from the below list. There are use cases to analyze the nodes for any of these result statuses(ex: dbt ls --select result:skipped)

result:fail
result:error
result:warn
result:success
result:skipped
result:pass

Testing Approach
Tested result selectors relevant to the commands in scope. For example, you'll see a test for dbt run --select result:error, but you won't see a test for dbt run --select result:fail. This is because a result:fail selector will never exist for dbt run in the run_results.json.

For concurrent selector tests, we tested the most common use cases below.

dbt run --select state:modified+ result:error+ -—defer -—state ./target
- Rerun all my erroneous models AND run changes I made concurrently that may relate to the erroneous models for downstream use
dbt build --select state:modified+ result:error+ -—defer -—state ./target
- Rerun and retest all my erroneous models AND run changes I made concurrently that may relate to the erroneous models for downstream use
dbt build --select state:modified+ result:error+ result:fail+ --defer --state ./target
- Rerun all my erroneous models AND all my failed tests
- Rerun all my erroneous models AND run changes I made concurrently that may relate to the erroneous models for downstream use
- There's a failed test that's unrelated to modified or error nodes(think: source test that needs to refresh a data load in order to pass)
dbt test --select result:fail --exclude <example test> -—defer -—state ./target
- Rerun all my failed tests and exclude tests that I know will still fail
- This can apply to updates in source data during the "EL" process that need to be rerun after they are refreshed

Checklist

I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt next" section.

…e/smart-rerun-from-failures

sungchun12 · 2021-10-07T18:28:23Z

I noticed there a test error that's unrelated to the changes made in this pull request.

Succeeded in a previous run with the same command syntax: https://github.com/dbt-labs/dbt/runs/3820843861#step:9:1319

sungchun12 · 2021-10-07T18:45:28Z

The same thing is happening in the latest merge to develop: https://github.com/dbt-labs/dbt/runs/3829307022#step:9:1227

sungchun12 · 2021-10-07T18:49:18Z

@fclesio, would love for you to try this out!

…ttps://github.com/dbt-labs/dbt into feature/smart-rerun-from-failures

…e/smart-rerun-from-failures

jtcohen6

@sungchun12 @matt-winkler This looks fantastic. I especially appreciate the thoroughness with which you've tested and vested the array of anticipated use cases.

@leahwicz I contributed some of the initial code for this :), so I wouldn't hate if someone on the Core team could give this a quick code review, as a complement to my function review here. (This definitely falls under the Execution sub-team.) At the same time, it doesn't touch that much code, builds pretty firmly on top of the precedent set by the state: selection method, and the integration testing looks solid. I could go either way, depending on your comfort level.

(For the moment, I'm just going to commit a quick update to the changelog, since we've released v1.0.0-b1. I have every intention of including this feature in v1.0.0-b2.)

jtcohen6

(quick hits to get the tests passing)

test/integration/062_defer_state_test/test_run_results_state.py

Co-authored-by: Jeremy Cohen <[email protected]>

sungchun12 · 2021-10-13T17:45:24Z

Looks like the adapter tests have changed since the previous commit because they're being skipped now. Let me know if I need to pass a parameter somewhere to get things running.

gshank

The code here is not that big and is limited to the new feature implemented in this pull request, so I think the risk is pretty low. The new code follows the standard of the existing code quite well.

I'd like to have the commits squashed into one and rebased on current main before we merge this though :)

…mart-rerun-from-failures

jtcohen6

Thanks so much for this contribution, @sungchun12 @matt-winkler!

@gshank I agree re: adding this to main as a single commit. I think GitHub's squash-merge capabilities could get the job done.

…mart-rerun-from-failures

* Add result: selection method * make a copy for modified state test suite * test case notes * remove macro tests * add a test setup command * copy run results state * passing test case, todos, split work * clean up result:success test case * start with build command and remove previous state where needed * add error result selector tests for seed * add another error seed test case * remove todo * passing build result:error tests * single failure build test * add passing test * fix node assertions for tests * fix tests * draft fail+ tests * add severity to test * result:warn passing test * result:warn+ passing tests * add passing concurrent selector test * add downstream flag * add comment * passing test * fix test for dynamic node selection * add build concurrent selector passing test * add run test cases * add integration tests for dbt test * fix formatting * rename test * remove extra comments * add extra newline * add concurrent selector test / build cases * clean up todos * test all nodes * DRY rebuild code * test all nodes * add TODO update assertion code * cleaner assert code * fix this test to have a fixed set * more cleanup * add changelog * update concurrent selectors on dbt test * remove todo * Update changelog * Apply suggestions from code review Co-authored-by: Jeremy Cohen <[email protected]> * fix changelog * fix Contributors headers Co-authored-by: Jeremy Cohen <[email protected]> Co-authored-by: Matt Winkler <[email protected]> automatic commit by git-black, original commits: 97f31c8

jtcohen6 and others added 30 commits September 26, 2021 13:28

Add result: selection method

c23198f

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

f1fbea9

…e/smart-rerun-from-failures

make a copy for modified state test suite

4fd015f

test case notes

9855502

remove macro tests

1830ec0

add a test setup command

3c225f6

copy run results state

d18e8bf

passing test case, todos, split work

4b41483

clean up result:success test case

fbf5d23

start with build command and remove previous state where needed

5533172

add error result selector tests for seed

d2bc531

add another error seed test case

8992f02

remove todo

6413ab0

passing build result:error tests

46c09a7

single failure build test

e8c9fcf

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

19c9fcf

…e/smart-rerun-from-failures

add passing test

f48aa0b

fix node assertions for tests

eefdda5

fix tests

f2f4567

draft fail+ tests

13fd182

add severity to test

6fe4efd

result:warn passing test

00dd7fb

result:warn+ passing tests

1f55510

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

0959e0f

…e/smart-rerun-from-failures

add passing concurrent selector test

5d8cee4

add downstream flag

fbb8b9e

add comment

d95ca5c

passing test

ba20b48

fix test for dynamic node selection

b8e7ca7

add build concurrent selector passing test

5165749

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

c209c25

…e/smart-rerun-from-failures

sungchun12 marked this pull request as ready for review October 7, 2021 18:34

sungchun12 requested a review from jtcohen6 October 7, 2021 18:34

sungchun12 mentioned this pull request Oct 8, 2021

Result Selectors for Smarter Reruns for Failures/Errors dbt-labs/docs.getdbt.com#853

Merged

4 tasks

sungchun12 added 3 commits October 11, 2021 12:47

Merge branches 'feature/smart-rerun-from-failures' and 'develop' of h…

165d831

…ttps://github.com/dbt-labs/dbt into feature/smart-rerun-from-failures

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

7b6e973

…e/smart-rerun-from-failures

Merge branch 'develop' of https://github.com/dbt-labs/dbt into featur…

87b5b2f

…e/smart-rerun-from-failures

jtcohen6 mentioned this pull request Oct 13, 2021

Selection method based on source freshness: new max_loaded_at, new data #4050

Closed

jtcohen6 reviewed Oct 13, 2021

View reviewed changes

Update changelog

98de45b

jtcohen6 reviewed Oct 13, 2021

View reviewed changes

Apply suggestions from code review

624c2b4

Co-authored-by: Jeremy Cohen <[email protected]>

sungchun12 requested a review from jtcohen6 October 13, 2021 17:45

leahwicz requested a review from gshank October 14, 2021 13:31

gshank requested changes Oct 15, 2021

View reviewed changes

sungchun12 added 3 commits October 15, 2021 16:24

Merge branch 'main' of https://github.com/dbt-labs/dbt into feature/s…

4b44a85

…mart-rerun-from-failures

fix changelog

87c1023

fix Contributors headers

b992a0d

jtcohen6 approved these changes Oct 18, 2021

View reviewed changes

jtcohen6 requested a review from gshank October 18, 2021 07:42

Merge branch 'main' of https://github.com/dbt-labs/dbt into feature/s…

0250e4f

…mart-rerun-from-failures

gshank approved these changes Oct 18, 2021

View reviewed changes

jtcohen6 merged commit 97f31c8 into main Oct 18, 2021

jtcohen6 deleted the feature/smart-rerun-from-failures branch October 18, 2021 14:43

joellabes mentioned this pull request Nov 4, 2021

Add available result: options to state selectors dbt-labs/docs.getdbt.com#897

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smart Reruns from Failures/Errors #4017

Smart Reruns from Failures/Errors #4017

sungchun12 commented Oct 6, 2021 •

edited

Loading

sungchun12 commented Oct 7, 2021

sungchun12 commented Oct 7, 2021

sungchun12 commented Oct 7, 2021

jtcohen6 left a comment

jtcohen6 left a comment

sungchun12 commented Oct 13, 2021

gshank left a comment

jtcohen6 left a comment •

edited

Loading

Smart Reruns from Failures/Errors #4017

Smart Reruns from Failures/Errors #4017

Conversation

sungchun12 commented Oct 6, 2021 • edited Loading

Description

Checklist

sungchun12 commented Oct 7, 2021

sungchun12 commented Oct 7, 2021

sungchun12 commented Oct 7, 2021

jtcohen6 left a comment

Choose a reason for hiding this comment

jtcohen6 left a comment

Choose a reason for hiding this comment

sungchun12 commented Oct 13, 2021

gshank left a comment

Choose a reason for hiding this comment

jtcohen6 left a comment • edited Loading

Choose a reason for hiding this comment

sungchun12 commented Oct 6, 2021 •

edited

Loading

jtcohen6 left a comment •

edited

Loading