Missing GPU kernels when using @profile and -b flag #312

dwchang79 · 2023-10-02T15:30:00Z

I am using the @Profile and -b flag to try and remove the initial training section of a ML workload so that I can only profile the inference part. That is working, but the problem is the GPU kernels and information are now missing. The call stack shows the functions, but they do not link to the GPU and no GPU devices are shown nor is anything shown running on them.

I have attached two screenshots. One with the entire run (without the profile flags) where the GPU section is shown at the bottom as "HIP Activity Device 2, Queue 0" and a second screenshot where only the inference part is profiled, but the GPU information is now gone.

Thank you.

jrmadsen · 2023-11-28T13:50:36Z

Try prefixing the command with omnitrace-run -- python3 -m omnitrace -b -- <script> <script-args>. I suspect the later initialization of omnitrace due to the @profile is causing in omnitrace getting initialized after the hip runtime, resulting in omnitrace not getting registered as profiling tool for the HIP runtime.

ppanchad-amd · 2024-10-07T17:44:53Z

Hi @dwchang79. Has your issue been resolved? If so, please close the ticket. Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing GPU kernels when using @profile and -b flag #312

Missing GPU kernels when using @profile and -b flag #312

dwchang79 commented Oct 2, 2023 •

edited

Loading

jrmadsen commented Nov 28, 2023

ppanchad-amd commented Oct 7, 2024

Missing GPU kernels when using @profile and -b flag #312

Missing GPU kernels when using @profile and -b flag #312

Comments

dwchang79 commented Oct 2, 2023 • edited Loading

jrmadsen commented Nov 28, 2023

ppanchad-amd commented Oct 7, 2024

dwchang79 commented Oct 2, 2023 •

edited

Loading