Add test_all_subset_models mode #2428

kwasd · 2024-10-07T11:52:16Z

kwasd · 2024-10-07T13:05:01Z

Test runs (are still pending):

.github/workflows/e2e-performance.yml

.github/workflows/e2e-accuracy.yml

pbchekin · 2024-10-07T16:35:33Z

.github/workflows/e2e-reusable.yml

@@ -225,7 +229,7 @@ jobs:
            bash -e $GITHUB_WORKSPACE/scripts/inductor_xpu_test.sh ${{ inputs.suite }} ${{ inputs.dtype }} ${{ inputs.mode }} ${{ inputs.test_mode }} xpu 0 static 1 0 ${{ inputs.only_one_model }}
          elif [[ "${{ inputs.models }}" == "subset" ]]; then
            while read model; do
-              bash -e $GITHUB_WORKSPACE/scripts/inductor_xpu_test.sh ${{ inputs.suite }} ${{ inputs.dtype }} ${{ inputs.mode }} ${{ inputs.test_mode }} xpu 0 static 1 0 $model
+              bash -e $GITHUB_WORKSPACE/scripts/inductor_xpu_test.sh ${{ inputs.suite }} ${{ inputs.dtype }} ${{ inputs.mode }} ${{ inputs.test_mode }} xpu 0 static 1 0 $model || ${{ inputs.test_all_subset_models }}


This won't work as expected, the script returns 0 even if some accuracy tests or performance runs failed. You need to check the report (csv file) and verify it has all items from a corresponding subset and every item has passed. See https://github.com/intel/intel-xpu-backend-for-triton/blob/main/.github/workflows/build-test-reusable.yml#L179-L181 for the idea.

Co-authored-by: Pavel Chekin <[email protected]>

…ure/2246-e2e-accuracy-subset

…/intel/intel-xpu-backend-for-triton into feature/2246-e2e-accuracy-subset

kwasd · 2024-10-08T11:04:00Z

Test runs:
true: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/11234055787
false: https://github.com/intel/intel-xpu-backend-for-triton/actions/runs/11234070592

pbchekin · 2024-10-08T15:45:16Z

.github/workflows/e2e-reusable.yml

@@ -226,6 +230,7 @@ jobs:
          elif [[ "${{ inputs.models }}" == "subset" ]]; then
            while read model; do
              bash -e $GITHUB_WORKSPACE/scripts/inductor_xpu_test.sh ${{ inputs.suite }} ${{ inputs.dtype }} ${{ inputs.mode }} ${{ inputs.test_mode }} xpu 0 static 1 0 $model
+              grep ,$model, $WORKSPACE/inductor_log/*/*/*.csv | grep -q ,pass, || ${{ inputs.test_all_subset_models }}


This will work only for accuracy. Also for accuracy it is possible that CSV file does not exist (major failure with E2E) in this case the one liner above won't work. I think you need more sophisticated script to handle accuracy/performance and all corner cases. The idea is you need to check that every model from the subset was successfully executed.

kwasd added 2 commits October 7, 2024 13:51

Add test_all_subset_models mode

1fc325d

Add test_all_subset_models mode

7eb6e85

kwasd requested a review from pbchekin October 7, 2024 13:03

vlad-penkin linked an issue Oct 7, 2024 that may be closed by this pull request

Subset of E2E models for accuracy tests #2246

Open

pbchekin reviewed Oct 7, 2024

View reviewed changes

kwasd and others added 5 commits October 7, 2024 18:59

Update .github/workflows/e2e-performance.yml

284bf46

Co-authored-by: Pavel Chekin <[email protected]>

Update .github/workflows/e2e-accuracy.yml

c47a9f8

Co-authored-by: Pavel Chekin <[email protected]>

Merge https://github.com/intel/intel-xpu-backend-for-triton into feat…

151e282

…ure/2246-e2e-accuracy-subset

Merge branch 'feature/2246-e2e-accuracy-subset' of https://github.com…

312d793

…/intel/intel-xpu-backend-for-triton into feature/2246-e2e-accuracy-subset

Check models

1818779

kwasd requested a review from pbchekin October 8, 2024 10:33

fix whitespace

3122341

pbchekin reviewed Oct 8, 2024

View reviewed changes

kwasd marked this pull request as draft October 9, 2024 11:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add test_all_subset_models mode #2428

Add test_all_subset_models mode #2428

kwasd commented Oct 7, 2024

kwasd commented Oct 7, 2024

pbchekin Oct 7, 2024

kwasd commented Oct 8, 2024

pbchekin Oct 8, 2024

Add test_all_subset_models mode #2428

Are you sure you want to change the base?

Add test_all_subset_models mode #2428

Conversation

kwasd commented Oct 7, 2024

kwasd commented Oct 7, 2024

pbchekin Oct 7, 2024

Choose a reason for hiding this comment

kwasd commented Oct 8, 2024

pbchekin Oct 8, 2024

Choose a reason for hiding this comment