LLVM ERROR: mma16816 data type not supported #4922

mobicham · 2024-10-16T10:36:53Z

The latest Triton build (3.1.0) throws the following error when using bitpacked data inside a loop with tl.dot:

LLVM ERROR: mma16816 data type not supported

with the build from source, I get a different error:

unimplemented code path
UNREACHABLE executed at /root/triton_op/triton/lib/Conversion/TritonGPUToLLVM/ElementwiseOpToLLVM.cpp:79!
Aborted (core dumped)

This error happens on Ampere and Hopper, but not on older gpus like the Titan RTX/2080 Ti.

The bitpacked data is read with indices in the form offs_k[:, None] // num_elements, something like [0,0,0...1,1,1...64,64,64].

I have faced this error in the previous build and I found that replacing for k in range(0, total_blocks_k, 1): with for k in tl.range(0, total_blocks_k, 1, num_stages=1): solved the issue, but this trick no longer works with 3.1.0.

Here's a full-script to reproduce it.
https://gist.github.com/mobicham/f9eba3c07f7e497ae622194a9c5e4822

The text was updated successfully, but these errors were encountered:

lezcano · 2024-11-06T12:58:59Z

I think #5044 may fix this issue in Ampere. Mixed dtype tl.dot is not so well supported on Hopper yet tho. #5003 is making good progress in that front tho.

lezcano · 2024-11-06T12:59:52Z

Also, out of curiosity, can you post the ttgir?

mobicham · 2024-11-06T13:34:47Z

Thank you @lezcano !

The b.to(tl.float32).to(tl.float16) doesn't break the loop though but b.to(tl.float16) does, in the end, with or without the tl.float32, tl.dot is getting f[16 x fp16, sounds kinda strange isn't it?

Here's the ttgir for the b.to(tl.float32).to(tl.float16) version which doesn't crash:
https://gist.github.com/mobicham/ae48cdf55f7062994eae3e2653d26afa#file-b-to-tl-float32-to-tl-float16-_log-txt-L148

lezcano · 2024-11-06T14:23:54Z

Just to make sure I understand. The repro in the OP still breaks with #5044 patched in? That's rather weird. What's the crash you see? Could you also run the script with TRITON_ENABLE_PYTHON_STACKTRACE=1 and post the stacktrace for the crash?

Jokeren · 2024-11-06T14:29:16Z

I cannot reproduce it. Maybe the author of the PR is not using the correct triton version

mobicham · 2024-11-06T14:39:37Z

@Jokeren I just tried it on an A100, it does throw that error. I am using 3.1.0 since the nightly builds are broken. Let me build Triton from source and re-check

lezcano · 2024-11-06T14:41:03Z

note that you will not have to build master, but the commit linked above as it hasn't landed yet! Master will probably break.

mobicham · 2024-11-06T15:04:09Z

I can confirm that the build from https://github.com/triton-lang/triton/tree/keren/dot-mma-1 solves the issue.
Thank you both for taking the time to look into this, really appreciate it!

mobicham mentioned this issue Oct 16, 2024

Poor performance on Ampere vs. Ada with bitpacked weights #4906

Open

mobicham closed this as completed Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLVM ERROR: mma16816 data type not supported #4922

LLVM ERROR: mma16816 data type not supported #4922

mobicham commented Oct 16, 2024 •

edited

Loading

lezcano commented Nov 6, 2024

lezcano commented Nov 6, 2024

mobicham commented Nov 6, 2024

lezcano commented Nov 6, 2024

Jokeren commented Nov 6, 2024

mobicham commented Nov 6, 2024 •

edited

Loading

lezcano commented Nov 6, 2024 •

edited

Loading

mobicham commented Nov 6, 2024 •

edited

Loading

LLVM ERROR: mma16816 data type not supported #4922

LLVM ERROR: mma16816 data type not supported #4922

Comments

mobicham commented Oct 16, 2024 • edited Loading

lezcano commented Nov 6, 2024

lezcano commented Nov 6, 2024

mobicham commented Nov 6, 2024

lezcano commented Nov 6, 2024

Jokeren commented Nov 6, 2024

mobicham commented Nov 6, 2024 • edited Loading

lezcano commented Nov 6, 2024 • edited Loading

mobicham commented Nov 6, 2024 • edited Loading

mobicham commented Oct 16, 2024 •

edited

Loading

mobicham commented Nov 6, 2024 •

edited

Loading

lezcano commented Nov 6, 2024 •

edited

Loading

mobicham commented Nov 6, 2024 •

edited

Loading