Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a new NVTX range for task GPU ownership #11596

Open
wants to merge 1 commit into
base: branch-24.12
Choose a base branch
from

Conversation

jihoonson
Copy link
Collaborator

If you have ever been curious about the semaphore-based GPU concurrency control mechanism, you may have wondered at some point exactly what tasks are holding the semaphore at a given point of time during query processing. This PR can help you in that case. This PR adds a new NVTX range that shows what task owns GPU in the nsys profile result. This feature is off by default (since I don't think it's always useful), and can be enabled by setting spark.rapids.sql.traceTaskGpuOwnership = true. The screenshot below shows an example nsys result with the new semaphore ranges. The orange boxes with Sem-${taskAttemptId} represent the ranges in which each spark task was holding the semaphore.

Screenshot 2024-10-11 at 4 09 11 PM

@jihoonson jihoonson changed the title Add a new nvtx range for task GPU ownership Add a new NVTX range for task GPU ownership Oct 11, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant