-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GPU HW Counters via rocprofiler #84
Merged
Merged
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
jrmadsen
added
enhancement
New feature or request
perfetto
Issue affects/involves perfetto features/capabilities
timemory
Issue affects/involves timemory features/capabilities
libomnitrace
Involves omnitrace library
omnitrace-avail
Involves the omnitrace-avail executable (info tool)
roctracer
GPU kernel tracing
cmake
Modifies the CMake build system
submodule
Updates a git submodule
configuration
Changes/involves configuration options
rocprofiler
GPU kernel HW counters
labels
Jul 12, 2022
- /opt/rocm/{rocprofiler,roctracer} path is deprecated so tweak search procedure
- rocm_metrics() - minor cleanup
- hw_counter categories - init rocm
…acer.* - relocated library/components/rocprofiler.* to library/rocprofiler.* - cleaned up rocprofiler.hpp - added perfetto output of rocprofiler - added timemory output of rocprofiler - renamed omni.roctracer thread to roctracer.hip - added roctracer.hsa thread name - updated timemory submodule to support std::variant - updated timemory submodule to support = in config value - updated timemory submodule to support standalone storage - updated timemory submodule to support new hw counter apis - updated timemory submodule to prevent label/description caching in data_tracker
jrmadsen
force-pushed
the
rocprofiler-support
branch
from
July 13, 2022 02:29
9c720d3
to
14b63fc
Compare
- Add -c command-line option for --categories - support verbosity
- throw exceptions to avoid aborting on HSA_STATUS_ERROR_NOT_INITIALIZED when advantageous - removed duplicate specialization of is_available for component::rocprofiler
- std::stringstream from initializer list would use explicit constructor
jrmadsen
changed the title
[WIP] GPU HW Counters via rocprofiler
GPU HW Counters via rocprofiler
Jul 17, 2022
- added using statements from timemory - tweaked the main and thread bundle names - fixed timemory header includes
jrmadsen
force-pushed
the
rocprofiler-support
branch
from
July 18, 2022 00:28
ab0fa76
to
57952e1
Compare
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
cmake
Modifies the CMake build system
configuration
Changes/involves configuration options
enhancement
New feature or request
libomnitrace
Involves omnitrace library
omnitrace-avail
Involves the omnitrace-avail executable (info tool)
perfetto
Issue affects/involves perfetto features/capabilities
rocprofiler
GPU kernel HW counters
roctracer
GPU kernel tracing
submodule
Updates a git submodule
timemory
Issue affects/involves timemory features/capabilities
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
rocprof-SQ_WAVES-device-0.txt
OMNITRACE_ROCM_EVENTS
OMNITRACE_PAPI_EVENTS
):device=N
to entries for collection on specific devicesomnitrace-avail -H --categories GPU -d
OMNITRACE_USE_ROCPROFILER
OMNITRACE_ROCM_EVENTS
Relevant Environment Variables
OMNITRACE_ROCPROFILER_LIBRARY
libomnitrace.so
ROCM_PATH
/opt/rocm
ROCP_METRICS
:<ROCM_PATH>/rocprofiler/lib/metrics.xml
Config file example
OMNITRACE_ROCM_EVENTS = GRBM_COUNT GPUBusy SQ_WAVES SQ_INSTS_VALU VALUInsts TCC_HIT_sum TA_TA_BUSY[0]:device=0 TA_TA_BUSY[11]:device=0
omnitrace-avail examples