You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Attempting to profile a SYCL application using NVIDIA Nsight Compute (NCU) results in an error preventing kernel profiling. The issue persists even when LD_PRELOAD=/usr/lib64/libcuda.so is set.
==ERROR== Failed to prepare kernel for profiling
==ERROR== Unknown Error on device 0.
==ERROR== Failed to profile "const:" in process <PID>
==PROF== Trying to shutdown target application
==ERROR== The application returned an error code (9).
==ERROR== An error occurred while trying to profile.
==WARNING== No kernels were profiled.
This occurs both with and without LD_PRELOAD=/usr/lib64/libcuda.so.
Expected Behavior
Nsight Compute should successfully profile the SYCL application.
Environment
OS: openSUSE Leap 15.6
Target device and vendor: NVIDIA H100 80GB HBM3
DPC++ version: clang version 20.0.0git (https://github.com/intel/llvm fb888b857f0e3ef31a474f51d8a6018eeb521d99)
Profiler:Nsight Compute (ncu)
Application: test.e (SYCL hello world or any other SYCL app)
Dependencies version:
$>nvidia-smi
Thu Feb 13 18:06:05 2025
+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 565.57.01 Driver Version: 565.57.01 CUDA Version: 12.7 |
|-----------------------------------------+------------------------+----------------------+
| GPU Name Persistence-M | Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap | Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|=========================================+========================+======================|
| 0 NVIDIA H100 80GB HBM3 On | 00000000:1C:00.0 Off | 0 |
| N/A 22C P0 71W / 700W | 1MiB / 81559MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
| 1 NVIDIA H100 80GB HBM3 On | 00000000:2B:00.0 Off | 0 |
| N/A 21C P0 69W / 700W | 1MiB / 81559MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
| 2 NVIDIA H100 80GB HBM3 On | 00000000:AC:00.0 Off | 0 |
| N/A 20C P0 69W / 700W | 1MiB / 81559MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
| 3 NVIDIA H100 80GB HBM3 On | 00000000:BC:00.0 Off | 0 |
| N/A 19C P0 69W / 700W | 1MiB / 81559MiB | 0% Default |
| | | Disabled |
+-----------------------------------------+------------------------+----------------------+
+-----------------------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=========================================================================================|
| No running processes found |
+-----------------------------------------------------------------------------------------+
Describe the bug
Description
Attempting to profile a SYCL application using NVIDIA Nsight Compute (NCU) results in an error preventing kernel profiling. The issue persists even when
LD_PRELOAD=/usr/lib64/libcuda.so
is set.To reproduce
Steps to Reproduce
LD_PRELOAD
:Observed Behavior
LD_PRELOAD=/usr/lib64/libcuda.so
.Expected Behavior
Environment
clang version 20.0.0git (https://github.com/intel/llvm fb888b857f0e3ef31a474f51d8a6018eeb521d99)
ncu
)test.e
(SYCL hello world or any other SYCL app)Additional context
Possible Related Issues
The text was updated successfully, but these errors were encountered: