Skip to content

Commit 8a917e9

Browse files
rohansjoshifacebook-github-bot
authored andcommitted
Fix typo for 16a4w_block quantization (#11821)
Summary: Pull Request resolved: #11821 Differential Revision: D76949897
1 parent 496cb05 commit 8a917e9

File tree

1 file changed

+1
-1
lines changed
  • examples/qualcomm/oss_scripts/llama

1 file changed

+1
-1
lines changed

examples/qualcomm/oss_scripts/llama/llama.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -398,7 +398,7 @@ def quantize(self, quant_dtype, args, tokenizer, custom_annotations=()):
398398
self.llama_graph_module, self.inputs, strict=True
399399
).module()
400400

401-
if QuantDtype == QuantDtype.use_16a4w_block:
401+
if quant_dtype == QuantDtype.use_16a4w_block:
402402
conv_nodes = [
403403
n for n in fx_graph_module.graph.nodes if "conv" in n.name
404404
]

0 commit comments

Comments
 (0)