Skip to content

[SYCL][HIP] Memory access fault by GPU on address (nil) #4688

Closed
@zjin-lcf

Description

@zjin-lcf

https://github.com/zjin-lcf/oneAPI-DirectProgramming/tree/master/ced-sycl

./ced -a 0

Running the program shows the following error on an AMD GPU. Could you reproduce the error ? Thanks.

Memory access fault by GPU node-2 (Agent handle: 0x51b550) on address (nil). Reason: Page not present or supervisor privilege.
bt

gdb message:
Thread 2 "ced" received signal SIGABRT, Aborted.
[Switching to Thread 0x7ffff2ec5700 (LWP 2852315)]
0x00007ffff766c18b in raise () from /lib/x86_64-linux-gnu/libc.so.6
(gdb) bt
#0 0x00007ffff766c18b in raise () from /lib/x86_64-linux-gnu/libc.so.6
#1 0x00007ffff764b859 in abort () from /lib/x86_64-linux-gnu/libc.so.6
#2 0x00007ffff3344a7f in rocr::core::Runtime::VMFaultHandler(long, void*) () from /opt/rocm/hip/lib/../../lib/libhsa-runtime64.so.1
#3 0x00007ffff334753b in rocr::core::Runtime::AsyncEventsLoop(void*) () from /opt/rocm/hip/lib/../../lib/libhsa-runtime64.so.1
#4 0x00007ffff32ef497 in rocr::os::ThreadTrampoline(void*) () from /opt/rocm/hip/lib/../../lib/libhsa-runtime64.so.1
#5 0x00007ffff7fa1609 in start_thread () from /lib/x86_64-linux-gnu/libpthread.so.0
#6 0x00007ffff7748293 in clone () from /lib/x86_64-linux-gnu/libc.so.6

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workinghipIssues related to execution on HIP backend.runtimeRuntime library related issue

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions