Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Report HSA_OPS activities using the ROCr driver_node_id instead of th…
…e device's index When multiple ranks are used, each rank's first logical device always has GPU ID 0, regardless of which physical device is selected with CUDA_VISIBLE_DEVICES. Because of this, when merging trace files from multiple ranks, GPU IDs from different processes may overlap. The long term solution is to use the KFD's gpu_id which is stable across APIs and processes. Unfortunately the gpu_id is not yet exposed by the ROCr, so for now use the driver's node id. Change-Id: I2f5af8d2a7e8a89efeb5e0a1b86bdfa547b25fc8 (cherry picked from commit 799f032)
- Loading branch information