Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[compile] Show breakdown of graph break #6601

Merged
merged 17 commits into from
Oct 14, 2024
Merged

Conversation

delock
Copy link
Collaborator

@delock delock commented Oct 6, 2024

This PR extends #6570 by showing a breakdown of graph breaks. So we can see how graph breaks are distributed among different reasons. An example of graph break output can be seen from the following workflow run https://github.com/microsoft/DeepSpeed/actions/runs/11199157962

@delock delock requested a review from tohtana as a code owner October 8, 2024 13:42
Copy link
Contributor

@tohtana tohtana left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @delock, this is a very useful feature! I left a comment, but it looks good to me.

@@ -14,22 +14,11 @@

torch._dynamo.config.cache_size_limit = 100

import collections


def get_dynamo_stats():
# TODO: consider deepcopy'ing the entire counters struct and
# adding a helper to do subtraction on it
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this comment still valid?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the comments, I think they should be removed. Will update.

@delock
Copy link
Collaborator Author

delock commented Oct 10, 2024

Hi @tohtana the unneeded comments had been removed. Also make the workflow to count zero2, there is no graph breaks for zero2 in this simple case.

https://github.com/microsoft/DeepSpeed/actions/runs/11269655884?pr=6601

@tohtana tohtana added this pull request to the merge queue Oct 11, 2024
@github-merge-queue github-merge-queue bot removed this pull request from the merge queue due to failed status checks Oct 11, 2024
@tohtana tohtana added this pull request to the merge queue Oct 14, 2024
Merged via the queue into master with commit cf41e8c Oct 14, 2024
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants