Skip to content

Commit

Permalink
disable block reduce for int8
Browse files Browse the repository at this point in the history
  • Loading branch information
LeiWang1999 committed Aug 13, 2024
1 parent 1816526 commit b81a3a8
Show file tree
Hide file tree
Showing 2 changed files with 3 additions and 2 deletions.
2 changes: 1 addition & 1 deletion 3rdparty/tvm
3 changes: 2 additions & 1 deletion bitblas/gpu/matmul_analysis.py
Original file line number Diff line number Diff line change
Expand Up @@ -623,7 +623,8 @@ def check_last_trait(region: List[Range]):
# Currently, we only support block reduction depth 2 for small M
# When the func is a dequantize like ops, we should consider the M
require_block_reduce = False
if hasattr(func.attrs, "dequantize_info"):
# And we only support float16 for now
if hasattr(func.attrs, "dequantize_info") and in_dtype == "float16":
for arg in func.params:
inp_shape = func.buffer_map[arg].shape
M = inp_shape[0]
Expand Down

0 comments on commit b81a3a8

Please sign in to comment.