v1.3.0: GPU HW Counters, RCCL, ROCTx, Python User Regions
Notable New Features
- Support for collecting GPU HW counters #84
OMNITRACE_ROCM_EVENTS
configuration variable
- Support for ROCTx #87
OMNITRACE_USE_ROCTX
configuration variable
- ROCm Collective Communication Library (RCCL) Support #93
OMNITRACE_USE_RCCLP
configuration variable
- Python User API #57
What's Changed
- Fix empty OMNITRACE_CONFIG_FILE and suppressing config and parsing by @jrmadsen in #81
- pthread_rwlock deadlock fix by @jrmadsen in #82
- Improved sampling performance by @jrmadsen in #74
- Combine ubuntu-focal-external.yml and ubuntu-focal.yml by @jrmadsen in #83
- GPU HW Counters via rocprofiler by @jrmadsen in #84
- Fix statistics type and use feature name indexes by @jrmadsen in #85
- Unified setup_environ b/t libomni and libomni-dl by @jrmadsen in #86
- Support ACTIVITY_DOMAIN_ROCTX by @jrmadsen in #87
- Fixes missing call to mpi_gotcha::update() by @jrmadsen in #88
- Support for disabling perfetto categories by @jrmadsen in #72
- Remove get_perfetto_output_filename().clear() by @jrmadsen in #89
- fix omnitrace print-* with libraries by @jrmadsen in #94
- Sampling Tweaks: disable sampling itimer by @jrmadsen in #95
- Replaces OMNITRACE_CONDITIONAL_BASIC_PRINT with OMNITRACE_VERBOSE by @jrmadsen in #97
- omnitrace builds timemory with TIMEMORY_USE_ROOFLINE=0 by @jrmadsen in #96
- Updated features docs [skip ci] by @jrmadsen in #98
- Fix warnings + Werror by @jrmadsen in #101
- Sampling use SIGRTMIN + N signals by @jrmadsen in #104
- Increase build timeouts by @jrmadsen in #107
- Updated documentation for hardware counters by @jrmadsen in #108
- Pthread category region by @jrmadsen in #102
- Release 1.3.0 preparations by @jrmadsen in #109
- Added new tests validating gotcha wrappers by @jrmadsen in #105
- Fix reliability when KOKKOS_PROFILE_LIBRARY is set in env by @jrmadsen in #103
- exit gotcha + remove DelayedInit state + rocm-smi + cleanup by @jrmadsen in #110
- Docker + build-release.sh + PAPI.cmake by @jrmadsen in #111
- Fix PAPI cpack packaging by @jrmadsen in #112
- Minor fixes by @jrmadsen in #113
- ubuntu cpack was building for rocm 5.2 twice by @jrmadsen in #114
- RCCL support by @jrmadsen in #93
- Fix dockerfile.opensuse by @jrmadsen in #115
- User regions in Python by @jrmadsen in #57
Full Changelog: v1.2.0...v1.3.0