Skip to content

Commit

Permalink
[Int4-AWQ] Fix AWQ Marlin check for ROCm
Browse files Browse the repository at this point in the history
  • Loading branch information
hegemanjw4amd committed Sep 26, 2024
1 parent 5c50fca commit dd53521
Showing 1 changed file with 9 additions and 3 deletions.
12 changes: 9 additions & 3 deletions vllm/config.py
Original file line number Diff line number Diff line change
Expand Up @@ -285,9 +285,15 @@ def _verify_quantization(self) -> None:
quantization_override = method.override_quantization_method(
quant_cfg, self.quantization)
if quantization_override:
quant_method = quantization_override
self.quantization = quantization_override
break
if is_hip():
if quantization_override in rocm_supported_quantization:
quant_method = quantization_override
self.quantization = quantization_override
break
else:
quant_method = quantization_override
self.quantization = quantization_override
break

# Verify quantization configurations.
if self.quantization is None:
Expand Down

0 comments on commit dd53521

Please sign in to comment.