Fix in erroneous implementation of _mm256_bsrli_epi128 #1823

satiscugcat · 2025-06-03T15:20:13Z

This fixes the error mentioned in issue #1822

Fixing the issue mentioned in issue rust-lang#1822 of rust-lang/stdarch.

rustbot · 2025-06-03T15:20:18Z

rustbot has assigned @Amanieu.
They will have a look at your PR within the next two weeks and either review your PR or reassign to another reviewer.

Use r? to explicitly pick a reviewer

bjorn3 · 2025-06-03T15:23:48Z

crates/core_arch/src/x86/avx2.rs

@@ -191,7 +191,7 @@ pub fn _mm256_alignr_epi8<const IMM8: i32>(a: __m256i, b: __m256i) -> __m256i {
            return transmute(a);
        }

-        let r: i8x32 = match IMM8 % 16 {
+        let r: i8x32 = match IMM8 {


You have to also add a value to the default arm. It is currently unreachable_unchecked(), which makes IMM8 values > 15 UB.
Edit: Maybe you should change the IMM8 == 16 if above to be IMM8 >= 16? Also there is handling for IMM8 > 16 above already: (_mm256_setzero_si256(), a) which would become unreachable in that case.

Oh right, I was mistaken. I think specifically values between 17-31 (inclusive) are UB without the %, which makes it not redundant. My mistake!

bjorn3 · 2025-06-03T15:29:15Z

Did you try reverifying _mm256_alignr_epi8 with whatever tool you used to find the _mm256_bsrli_epi128 bug? And if so how did the unreachable_unchecked() bug slip past it?

satiscugcat · 2025-06-03T15:45:05Z

Did you try reverifying _mm256_alignr_epi8 with whatever tool you used to find the _mm256_bsrli_epi128 bug? And if so how did the unreachable_unchecked() bug slip past it?

The _mm256_bsrli_epi128 bug was found manually upon noticing a discrepancy in the intel specifications and the rust implementations in that case, and then testing it as it is tested in issue #1822 . Thus the other bug slipped past me too. My bad!

sayantn · 2025-06-03T15:56:22Z

Could you check pls if this bug is also there for _mm256_bslli_epi128 and the _mm512 variants too. They might be affected as the impl is very similar

satiscugcat · 2025-06-03T16:11:21Z

Could you check pls if this bug is also there for _mm256_bslli_epi128 and the _mm512 variants too. They might be affected as the impl is very similar

Sure! We'll work on testing the variants from tomorrow.

sayantn · 2025-06-03T16:22:09Z

just checked, seems like the same bug is there for _mm512_bsrli_epi128, but not for any of the bslli variants (it is actually amusing how different the implementations are for bslli and bsrli, but anyway it seems like the bslli implementation is much more robust).

Actually, would you mind changing the impl of the bsrli variants to be like as the bslli variants? It would also fix this bug, and make it more consistent

satiscugcat · 2025-06-04T12:12:36Z

just checked, seems like the same bug is there for _mm512_bsrli_epi128, but not for any of the bslli variants (it is actually amusing how different the implementations are for bslli and bsrli, but anyway it seems like the bslli implementation is much more robust).

Actually, would you mind changing the impl of the bsrli variants to be like as the bslli variants? It would also fix this bug, and make it more consistent

I've made the appropriate changes.

…fixing a bug in the process.

sayantn

lgtm. Could you also see if this refactoring is also possible for _mm{256,512}_alignr_epi8. You could see the impl for _mm_alignr_epi8 for some reference (sorry for this many requests 😅)

satiscugcat added 2 commits June 3, 2025 20:40

Fix in erroneous implementation of _mm256_bsrli_epi128

b7f345e

Fixing the issue mentioned in issue rust-lang#1822 of rust-lang/stdarch.

Removal of redundant mod operation in _mm256_alignr_epi8

21877dc

rustbot assigned Amanieu Jun 3, 2025

satiscugcat changed the title ~~Fix in erroneous implementation of _mm256_bsrli_epi128, removal of redundant operation in _mm256_alignr_epi8 issue~~ Fix in erroneous implementation of _mm256_bsrli_epi128, removal of redundant operation in _mm256_alignr_epi8 Jun 3, 2025

bjorn3 reviewed Jun 3, 2025

View reviewed changes

Fixing mistake in previous commit

fa67737

satiscugcat changed the title ~~Fix in erroneous implementation of _mm256_bsrli_epi128, removal of redundant operation in _mm256_alignr_epi8~~ Fix in erroneous implementation of _mm256_bsrli_epi128 Jun 3, 2025

Changed the implementation of bsrli to match bslli in avx2 intrinsics

311a99c

Changed implementation of bsrli in avx512.rs to match that of bslli, …

08e8140

…fixing a bug in the process.

sayantn reviewed Jun 7, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix in erroneous implementation of _mm256_bsrli_epi128 #1823

Fix in erroneous implementation of _mm256_bsrli_epi128 #1823

Uh oh!

satiscugcat commented Jun 3, 2025 •

edited

Loading

Uh oh!

rustbot commented Jun 3, 2025

Uh oh!

bjorn3 Jun 3, 2025 •

edited

Loading

Uh oh!

satiscugcat Jun 3, 2025

Uh oh!

bjorn3 commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 3, 2025

Uh oh!

sayantn commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 3, 2025

Uh oh!

sayantn commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 4, 2025

Uh oh!

sayantn left a comment

Uh oh!

Uh oh!

Fix in erroneous implementation of _mm256_bsrli_epi128 #1823

Are you sure you want to change the base?

Fix in erroneous implementation of _mm256_bsrli_epi128 #1823

Uh oh!

Conversation

satiscugcat commented Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

rustbot commented Jun 3, 2025

Uh oh!

bjorn3 Jun 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

satiscugcat Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

bjorn3 commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 3, 2025

Uh oh!

sayantn commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 3, 2025

Uh oh!

sayantn commented Jun 3, 2025

Uh oh!

satiscugcat commented Jun 4, 2025

Uh oh!

sayantn left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

satiscugcat commented Jun 3, 2025 •

edited

Loading

bjorn3 Jun 3, 2025 •

edited

Loading