Skip to content

Commit 07411f9

Browse files
22dimensionswangxiaoxin (A)
authored andcommitted
[CI] remove old quantization model (vllm-project#1003)
remove old quantization model, and new models will be added to testcase later. Signed-off-by: 22dimensions <waitingwind@foxmail.com> Signed-off-by: wangxiaoxin (A) <wangxiaoxin7@huawei.com>
1 parent a3785f6 commit 07411f9

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

tests/singlecard/test_offline_inference.py

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -37,6 +37,10 @@
3737
"Qwen/Qwen3-0.6B-Base",
3838
]
3939
MULTIMODALITY_MODELS = ["Qwen/Qwen2.5-VL-3B-Instruct"]
40+
41+
QUANTIZATION_MODELS = [
42+
"vllm-ascend/Qwen2.5-0.5B-Instruct-W8A8-new",
43+
]
4044
os.environ["PYTORCH_NPU_ALLOC_CONF"] = "max_split_size_mb:256"
4145

4246
QUANTIZATION_MODELS = [

0 commit comments

Comments
 (0)