Skip to content

Commit 63b047b

Browse files
authored
Fix typo for 16a4w_block quantization
Differential Revision: D76949897 Pull Request resolved: #11821
1 parent 6a787ce commit 63b047b

File tree

1 file changed

+1
-1
lines changed
  • examples/qualcomm/oss_scripts/llama

1 file changed

+1
-1
lines changed

examples/qualcomm/oss_scripts/llama/llama.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -398,7 +398,7 @@ def quantize(self, quant_dtype, args, tokenizer, custom_annotations=()):
398398
self.llama_graph_module, self.inputs, strict=True
399399
).module()
400400

401-
if QuantDtype == QuantDtype.use_16a4w_block:
401+
if quant_dtype == QuantDtype.use_16a4w_block:
402402
conv_nodes = [
403403
n for n in fx_graph_module.graph.nodes if "conv" in n.name
404404
]

0 commit comments

Comments
 (0)