Skip to content

[CI][v0.9.1] Add qwen3_moe W8A8 quantized model test case #1874

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: v0.9.1-dev
Choose a base branch
from

Conversation

zhoux77899
Copy link

@zhoux77899 zhoux77899 commented Jul 18, 2025

What this PR does / why we need it?

Add qwen3_moe W8A8 quantized model test case

Does this PR introduce any user-facing change?

None

How was this patch tested?

Add a W8A8 quantized qwen3_moe model in tests/singlecard/test_offline_inference.py quantized models test list

Signed-off-by: ZhouXiang <zhouxiang100@huawei.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant