Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add GPU performance tips for high performance LLMs with JAX/XLA #674

Merged
merged 19 commits into from
Apr 4, 2024

Conversation

instinct79
Copy link
Contributor

@instinct79 instinct79 commented Apr 1, 2024

Documenting all JAX and XLA flags that we use for high performance LLMs with JAX/XLA.

Will update this periodically, and also link to

Per Yu-Hang's suggestion, will update https://github.com/NVIDIA/JAX-Toolbox/blob/main/README.md#environment-variables to this page as well, once the PR is merged.

@instinct79
Copy link
Contributor Author

@abhinavgoel95, can you please review and provide feedback ?

@abhinavgoel95 abhinavgoel95 self-assigned this Apr 1, 2024
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
abhinavgoel95
abhinavgoel95 previously approved these changes Apr 2, 2024
Copy link
Collaborator

@nouiz nouiz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Happy to seem the flag being better document. Some small comments and suggestion.

rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
@instinct79
Copy link
Contributor Author

@abhinavgoel95 or @nouiz, all conversations are resolved. Can one of you please approve, squash commits, and merge ?

@instinct79 instinct79 changed the title [WIP] Add GPU performance tips for high performance LLMs with JAX/XLA Add GPU performance tips for high performance LLMs with JAX/XLA Apr 4, 2024
Copy link
Collaborator

@nouiz nouiz left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Small issues. Most at nits, but the cuda_graph one must be verified.

rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
rosetta/docs/GPU_performance.md Outdated Show resolved Hide resolved
rosetta/docs/GPU_performance.md Show resolved Hide resolved
@nouiz
Copy link
Collaborator

nouiz commented Apr 4, 2024

@abhinavgoel95 anything else or I can merge now?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants