Skip to content

Commit 8913a0c

Browse files
committed
[Quantization] Fix AWQ supported dtypes, add BF16.
Signed-off-by: menogrey <[email protected]>
1 parent 65c6ef3 commit 8913a0c

File tree

1 file changed

+1
-1
lines changed
  • vllm_ascend/quantization/awq

1 file changed

+1
-1
lines changed

vllm_ascend/quantization/awq/awq.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -165,7 +165,7 @@ def get_name(self) -> str:
165165

166166
@classmethod
167167
def get_supported_act_dtypes(cls) -> List[torch.dtype]:
168-
return [torch.half]
168+
return [torch.half, torch.bfloat16]
169169

170170
@classmethod
171171
def get_min_capability(cls) -> int:

0 commit comments

Comments
 (0)