Skip to content

Segmentation faults on aarch64-linux starting from introduction of extension of KernelAbstractions #677

Closed
@giordano

Description

@giordano

Starting from #667 we have seen lots of segmentation faults on aarch64:

Julia 1.10 - integration - ubuntu-24.04-arm - aarch64 - packaged libReactant - assertions=false - push

Failed to precompile ReactantStatisticsExt [963ed91e-491b-54ce-bb4b-249dcb1ed2bb] to "/home/runner/.julia/compiled/v1.10/ReactantStatisticsExt/jl_B0YRz3".
2025-02-01 04:56:31.905782: I external/xla/xla/service/llvm_ir/llvm_command_line_options.cc:51] XLA (re)initializing LLVM with options fingerprint: 11962807958986418783

[4252] signal (11.1): Segmentation fault
in expression starting at /home/runner/work/Reactant.jl/Reactant.jl/src/Precompile.jl:60
last_fde at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2-fde.h:174 [inlined]
classify_object_over_fdes at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2-fde.c:727
init_object at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2-fde.c:888 [inlined]
_Unwind_Find_registered_FDE at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2-fde.c:1210 [inlined]
_Unwind_Find_FDE at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2-fde-dip.c:541
uw_frame_state_for at /workspace/srcdir/gcc-13.2.0/libgcc/unwind-dw2.c:1005
_Unwind_Backtrace at /workspace/srcdir/gcc-13.2.0/libgcc/unwind.inc:303
__backtrace at /lib/aarch64-linux-gnu/libc.so.6 (unknown line)
tsl::CurrentStackTrace[abi:cxx11]() at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
xla::cpu::RecordCpuCompilerStacktrace() at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
xla::cpu::CpuCompiler::RunBackend(std::unique_ptr<xla::HloModule, std::default_delete<xla::HloModule>>, stream_executor::StreamExecutor*, xla::Compiler::CompileOptions const&) at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
xla::TfrtCpuClient::Compile(xla::XlaComputation const&, xla::CompileOptions) at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
xla::TfrtCpuClient::Compile(mlir::ModuleOp, xla::CompileOptions) at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
ClientCompile at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
Compile at /home/runner/work/Reactant.jl/Reactant.jl/src/XLA.jl:567 [inlined]
#compile_xla#30 at /home/runner/work/Reactant.jl/Reactant.jl/src/Compiler.jl:1037
compile_xla at /home/runner/work/Reactant.jl/Reactant.jl/src/Compiler.jl:986 [inlined]
#compile#35 at /home/runner/work/Reactant.jl/Reactant.jl/src/Compiler.jl:1055
compile at /home/runner/work/Reactant.jl/Reactant.jl/src/Compiler.jl:1054

Julia 1.11 - integration - ubuntu-24.04-arm - aarch64 - packaged libReactant - assertions=false - push:

[5253] signal 11 (1): Segmentation fault
in expression starting at /home/runner/work/Reactant.jl/Reactant.jl/test/integration/cuda.jl:23
unknown function (ip: 0xffb7703830a0)
xla::cpu::CustomCallThunk::CallUntypedAPI(xla::cpu::Thunk::ExecuteParams const&) at /home/runner/.julia/artifacts/e08cc4d821f228b8f487acd163930546f0b6ff17/lib/libReactantExtra.so (unknown line)
Allocations: 222781207 (Pool: 222775637; Big: 5570); GC: 82
ERROR: LoadError: Package Reactant errored during testing (received signal: 11)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions