Skip to content

Commit

Permalink
skip wvSpltK when m,n are both tiny
Browse files Browse the repository at this point in the history
  • Loading branch information
root committed Jun 20, 2024
1 parent 93aab3c commit e6c9b37
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/model_executor/layers/tuned_gemm.py
Original file line number Diff line number Diff line change
Expand Up @@ -91,7 +91,7 @@ def mm(self, inp, weights):
self.tuned_df.to_csv(self.untune_path, index=False)

if ((n == 4 or n == 3 or n == 2 or n == 1) and k % 8 == 0
and inp_view.dtype == torch.float16):
and m > 8 and inp_view.dtype == torch.float16):
out = torch.empty(inp_view.shape[0],
weights.shape[0],
dtype=inp_view.dtype,
Expand Down

0 comments on commit e6c9b37

Please sign in to comment.