Enable accuracy test for PR labeled with "*accuracy-test" #1040

Yikun · 2025-05-31T15:00:55Z

What this PR does / why we need it?

This PR enable accuracy test for PR labeled with "*accuracy-test" and workflow_dispatch.

Only one model test running for each type test to reduce excution time.

The dense test costs about 25mins to complete (gsm8k 7mins, ~~mmlu 3h24mins,~~ cEval 18mins)
The vl test costs about 40mins to complete

In futute, we might consider enable all job test as nightly schedule job.

Below is mainly changes:

the dense/vl accuracy test will be triggered by lableling accuracy-test and ready-for-test
the dense accuracy test will be triggered by lableling dense-accuracy-test and ready-for-test
the vl accuracy test will be triggered by lableling vl-accuracy-test and ready-for-test
accuracy test will also be triggered by workflow_dispatch
Support V1 and V0 for qwen and V0 for VL

For PR test we also generate summary in test summary.

Does this PR introduce any user-facing change?

No

How was this patch tested?

CI passed with accuracy-test label
Preview: https://github.com/vllm-project/vllm-ascend/actions/runs/15407628722?pr=1040

Closes: #953

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Yikun · 2025-06-03T06:42:10Z

CI passed: https://github.com/vllm-project/vllm-ascend/actions/runs/15407628722

…ct#1040) ### What this PR does / why we need it? This PR enable accuracy test for PR labeled with "*accuracy-test" and workflow_dispatch. Only one model test running for each type test to reduce excution time. - The dense test costs about `25mins` to complete (gsm8k 7mins, ~mmlu 3h24mins,~ cEval 18mins) - The vl test costs about `40mins` to complete In futute, we might consider enable all job test as nightly schedule job. Below is mainly changes: - the dense/vl accuracy test will be triggered by lableling `accuracy-test` and `ready-for-test` - the dense accuracy test will be triggered by lableling `dense-accuracy-test` and `ready-for-test` - the vl accuracy test will be triggered by lableling `vl-accuracy-test` and `ready-for-test` - accuracy test will also be triggered by workflow_dispatch - Support V1 and V0 for qwen and V0 for VL For PR test we also generate summary in test summary. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? - CI passed with accuracy-test label - Preview: https://github.com/vllm-project/vllm-ascend/actions/runs/15407628722?pr=1040 Closes: vllm-project#953 --------- Signed-off-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: Yikun Jiang <yikunkero@gmail.com> Co-authored-by: hfadzxy <starmoon_zhang@163.com> Signed-off-by: wangxiaoxin (A) <w00664509@china.huawei.com>

@wangxiyuan

### What this PR does / why we need it? As plus of #1070, this patch adds `Nominating and Removing Maintainers` section (reference some design from [PyTorch Governance](https://docs.pytorch.org/docs/stable/community/governance.html)) Below are key info about existing maintainers: ## @wangxiyuan: - Super active code and high quality reviewer [450+ PR reviewed](https://github.com/vllm-project/vllm-ascend/pulls?q=commenter%3Awangxiyuan). - One of the top contributors, he also active contribute [50+ commits ](https://github.com/vllm-project/vllm-ascend/pulls?q=is%3Apr+is%3Aclosed+review%3Aapproved+author%3Awangxiyuan+) with good quality, he dares to [refactor the code](https://github.com/vllm-project/vllm-ascend/pulls?q=is%3Apr+author%3Awangxiyuan+is%3Aclosed+refactor), which also shows his deep understanding of vllm and vllm ascend. - He leads the [[RFC]: Hardware pluggable](vllm-project/vllm#11162) feature, this make vllm-ascend project become true. - Active community involved cross wechat group, slack, github issue. Involved on [150+ issue](https://github.com/vllm-project/vllm-ascend/issues?q=is%3Aissue%20state%3Aopen%20commenter%3Awangxiyuan) and help users. He is also the spearker of vLLM Beijing meetup help more users understand vLLM Ascend. - Relase manager of [v0.7.1rc1](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.7.1rc1), [v0.7.3rc1](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.7.3rc1), [v0.7.3rc2](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.7.3rc2), [v0.8.4rc1](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.8.4rc1), [v0.7.3.post1](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.7.3.post1). ## @Yikun: - High active code reviewer: [190+ PR reviewed](https://github.com/vllm-project/vllm-ascend/pulls?q=commenter%3AYikun), especially for new developers to help them onboarding. - One of the top contributors with sustained contributions: [50+ commits](https://github.com/vllm-project/vllm-ascend/pulls?q=is%3Apr+is%3Aclosed+review%3Aapproved+author%3AYikun+) since the first day of vLLM Ascend. - High quality contributions around vLLM compatibility guarantee and also maintain [CI ](#1040) and [test Framework](#730). - Active community involved cross local group, github issue Involved on [170+ issue](https://github.com/vllm-project/vllm-ascend/issues?q=is%3Aissue%20state%3Aopen%20commenter%3AYikun). He is also main organizer of vLLM Beijing Meetup and speaker of [PyTorch Day China 2025](https://pytorchdaychina2025.sched.com/event/2401V/poster-session) to help vLLM Ascend growth. - Relase manager of [v0.8.4rc2](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.8.4rc2), [v0.8.5rc1](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.8.5rc1), [v0.7.3](https://github.com/vllm-project/vllm-ascend/releases/tag/v0.7.3). ## @ganyi1996ppo - High active code and high quality reviewer: [90+ PR reviewed](https://github.com/vllm-project/vllm-ascend/pulls?q=commenter%3Aganyi1996ppo), he has a deep understanding of Ascend operators can always find some key issues, has deeply understand of the codebase, good code quality and qualified judgement. - Major and high quality contributions: [10+ commits](https://github.com/vllm-project/vllm-ascend/pulls?q=is%3Apr+is%3Aclosed+review%3Aapproved+author%3Aganyi1996ppo) with high quality. - He is the main contributor of [Custom AscendC op support](#371), [Deepseekv3 performance optimization](#598). - Community Involvement‌: Involved on [11+ issue and help users](https://github.com/vllm-project/vllm-ascend/issues?q=is%3Aissue%20state%3Aopen%20commenter%3Aganyi1996ppo), share [custom ops topic](https://www.bilibili.com/video/BV1Z25az3EqS/?share_source=copy_web&vd_source=72ef9c665af5f2f1370abe26ce1f719f&t=1342) on vLLM Ascend Weekly meeting. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Preview Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Yikun added ready-for-test start test by label for PR dense-accuracy-test labels May 31, 2025

Yikun force-pushed the pr/953 branch from 471adfd to 72eedd8 Compare May 31, 2025 15:09

Yikun added dense-accuracy-test and removed dense-accuracy-test labels May 31, 2025

Yikun force-pushed the pr/953 branch from 72eedd8 to 854edfa Compare May 31, 2025 15:12

Yikun added dense-accuracy-test and removed dense-accuracy-test labels May 31, 2025

Yikun force-pushed the pr/953 branch from 3bd8aa6 to ab834ef Compare May 31, 2025 16:53

Yikun added vl-accuracy-test and removed dense-accuracy-test labels May 31, 2025

Yikun force-pushed the pr/953 branch from ab834ef to 211588c Compare May 31, 2025 17:01

Yikun added dense-accuracy-test and removed vl-accuracy-test labels May 31, 2025

Yikun force-pushed the pr/953 branch from 211588c to 1e94479 Compare May 31, 2025 17:11

Yikun removed the dense-accuracy-test label May 31, 2025

Yikun force-pushed the pr/953 branch from 1e94479 to a5c58ce Compare May 31, 2025 17:12

Yikun added the dense-accuracy-test label May 31, 2025

Yikun force-pushed the pr/953 branch from a5c58ce to 420e5d2 Compare June 1, 2025 01:49

Yikun removed the dense-accuracy-test label Jun 1, 2025

Yikun force-pushed the pr/953 branch from 420e5d2 to d4b5671 Compare June 1, 2025 01:54

Yikun added the accuracy-test enable all accuracy test for PR label Jun 1, 2025

Yikun force-pushed the pr/953 branch from d4b5671 to 11d2ef1 Compare June 1, 2025 02:14

Yikun added accuracy-test enable all accuracy test for PR and removed accuracy-test enable all accuracy test for PR labels Jun 1, 2025

Yikun force-pushed the pr/953 branch from 11d2ef1 to 46b1eff Compare June 1, 2025 02:18

Yikun added the accuracy-test enable all accuracy test for PR label Jun 2, 2025

Yikun force-pushed the pr/953 branch from 0e32b8d to 85ea63e Compare June 3, 2025 03:06

Yikun added accuracy-test enable all accuracy test for PR and removed accuracy-test enable all accuracy test for PR labels Jun 3, 2025

zhangxinyuehfad and others added 5 commits June 3, 2025 11:08

[Bugfix] Fix accuarcy test

805933e

Signed-off-by: hfadzxy <starmoon_zhang@163.com>

fix

91f102b

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

tmp

7b784f0

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

vl -> tp4

7d77905

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Suport V0 and V1 and remove unused mmlu

dcf0bb9

Signed-off-by: Yikun Jiang <yikunkero@gmail.com>

Yikun force-pushed the pr/953 branch from 85ea63e to dcf0bb9 Compare June 3, 2025 03:08

Yikun added accuracy-test enable all accuracy test for PR and removed accuracy-test enable all accuracy test for PR labels Jun 3, 2025

Yikun marked this pull request as ready for review June 3, 2025 03:48

Yikun added ready read for review and removed accuracy-test enable all accuracy test for PR labels Jun 3, 2025

Yikun added ready-for-test start test by label for PR and removed ready-for-test start test by label for PR labels Jun 3, 2025

wangxiyuan approved these changes Jun 3, 2025

View reviewed changes

Yikun merged commit f24375f into vllm-project:main Jun 3, 2025
17 of 21 checks passed

Yikun mentioned this pull request Jun 9, 2025

Init vLLM Ascend maintainers info #1124

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable accuracy test for PR labeled with "*accuracy-test" #1040

Enable accuracy test for PR labeled with "*accuracy-test" #1040

Uh oh!

Yikun commented May 31, 2025 •

edited

Loading

Uh oh!

Yikun commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Enable accuracy test for PR labeled with "*accuracy-test" #1040

Enable accuracy test for PR labeled with "*accuracy-test" #1040

Uh oh!

Conversation

Yikun commented May 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

Yikun commented Jun 3, 2025

Uh oh!

Uh oh!

Uh oh!

Yikun commented May 31, 2025 •

edited

Loading