Skip to content

Commit

Permalink
fix scale only lop3 tensorize instructions.
Browse files Browse the repository at this point in the history
  • Loading branch information
LeiWang1999 committed Aug 19, 2024
1 parent 95c93b7 commit 67ad761
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions bitblas/gpu/intrin/lop3.py
Original file line number Diff line number Diff line change
Expand Up @@ -1127,7 +1127,7 @@ def fast_decode_desc(compressed: T.handle, decompressed: T.handle, scale: T.hand
1,
],
dtype=target_dtype,
scope="global",
scope="local",
)
with T.block("root"):
T.reads(Compressed[0:n_storage_elems], Scale[0:1])
Expand Down Expand Up @@ -1173,7 +1173,7 @@ def fast_decode_impl(compressed: T.handle, decompressed: T.handle, scale: T.hand
dtype=target_dtype,
offset_factor=1,
strides=[s0],
scope="global",
scope="local",
)
with T.block("root"):
T.reads(Compressed[0:n_storage_elems], Scale[0:1])
Expand Down

0 comments on commit 67ad761

Please sign in to comment.