-
Notifications
You must be signed in to change notification settings - Fork 480
[CI]Update accuracy report test #1288
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
cc85eba
to
d32a860
Compare
4832fa4
to
7052731
Compare
7052731
to
ba0fb30
Compare
ba0fb30
to
580b14f
Compare
.github/workflows/accuracy_test.yaml
Outdated
type: choice | ||
options: | ||
- main | ||
- v0.9.0-dev |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
- v0.9.0-dev | |
- v0.9.1-dev |
type: choice | ||
options: | ||
- main | ||
- v0.9.0-dev |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please remove accuracy_report.yaml
- Please append accuracy_report after accuracy test
- Report only when dispatch
- Using bot
benchmarks/scripts/run_accuracy.py
Outdated
datasets=datasets) | ||
model = model_name.split("/")[1] | ||
preamble = f"""# 🎯 {model} Accuracy Test | ||
preamble = f"""# {ACCURACY_FLAG}🎯 {model} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I curious why this is not a markdown format to reduce maintainence cost
Please update hash as follow:
# 🎯 Qwen2.5-7B-Instruct
**vLLM Version**: vLLM: 0.9.1 ([b6553be](https://github.com/vllm-project/vllm/commit/b6553be1bc75f046b00046a4ad7576364d03c835)) , **vLLM Ascend**: refs/pull/1288/merge([c59e3895](https://github.com/vllm-project/vllm-ascend/commit/c59e3895e227dfc07cef71f0163b6ca8ad3649a6))
Preview:
🎯 Qwen2.5-7B-Instruct
vLLM Version: vLLM: 0.9.1 (b6553be) , vLLM Ascend: refs/pull/1288/merge(c59e3895)
benchmarks/scripts/run_accuracy.py
Outdated
f"| {n_shot:6} " | ||
f"| {metric:<6} " | ||
f"| ↑ {value:>5.4f} " | ||
f"| {value:>5.4f} " |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Add ✅ ❌ here
benchmarks/scripts/run_accuracy.py
Outdated
|
||
|
||
def main(args): | ||
global ACCURACY_FLAG |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Remove this.
parser.add_argument("--torch_npu_version", type=str, required=False) | ||
parser.add_argument("--vllm_version", type=str, required=False) | ||
parser.add_argument("--cann_version", type=str, required=False) | ||
parser.add_argument("--vllm_commit", type=lambda s: s[:7], required=False) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
only vllm_commit
and vllm_ascend_commit
are needed, other var can be generate from them
benchmarks/tests/accuracy.json
Outdated
@@ -0,0 +1,28 @@ | |||
{ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's move this to run_accuracy to keep it simple
fe8dd57
to
b0b31fc
Compare
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #1288 +/- ##
==========================================
- Coverage 27.39% 27.21% -0.19%
==========================================
Files 56 56
Lines 6191 6214 +23
==========================================
- Hits 1696 1691 -5
- Misses 4495 4523 +28
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
10af3a6
to
ae04ba2
Compare
ae04ba2
to
b0f64e8
Compare
477f4eb
to
5c5d704
Compare
Signed-off-by: hfadzxy <starmoon_zhang@163.com>
5c5d704
to
9a0fc54
Compare
- name: Get vLLM-Ascend commit hash and URL | ||
working-directory: ./vllm-ascend | ||
run: | | ||
VLLM_ASCEND_COMMIT=$(git rev-parse HEAD) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
short
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, let's merge this to see it work or not. some nits could address in separate PR.
run: | | ||
VLLM_ASCEND_COMMIT=$(git rev-parse HEAD) | ||
echo "VLLM_ASCEND_COMMIT=$VLLM_ASCEND_COMMIT" >> $GITHUB_ENV | ||
echo "VLLM_ASCEND_COMMIT_URL=https://github.com/vllm-project/vllm-ascend/commit/$VLLM_ASCEND_COMMIT" >> $GITHUB_ENV |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
remove
### What this PR does / why we need it? fix accuracy test: 1. fix accuracy report like:https://vllm-ascend--1429.org.readthedocs.build/en/1429/developer_guide/evaluation/accuracy_report/Qwen2.5-7B-Instruct-V0.html 2. fix create pr for report Signed-off-by: hfadzxy <starmoon_zhang@163.com>
### What this PR does / why we need it? Update accuracy report test 1. Add Record commit hashes and GitHub links for both vllm and vllm-ascend in accuracy reports 2. Add accuracy result verification checks to ensure output correctness 3. Creat PR via forked repository workflow ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? dense-accuracy-test: https://github.com/vllm-project/vllm-ascend/actions/runs/15745619485 create pr via forked repository workflow: https://github.com/zhangxinyuehfad/vllm-ascend/actions/runs/15747013719/job/44385134080 accuracy report pr: vllm-project#1292 Currently, the accuracy report used is old and needs to be merged into pr, retest, update new report, then close vllm-project#1292 . Signed-off-by: hfadzxy <starmoon_zhang@163.com>
…llm-project#1435) ### What this PR does / why we need it? fix accuracy test: 1. fix accuracy report like:https://vllm-ascend--1429.org.readthedocs.build/en/1429/developer_guide/evaluation/accuracy_report/Qwen2.5-7B-Instruct-V0.html 2. fix create pr for report Signed-off-by: hfadzxy <starmoon_zhang@163.com>
What this PR does / why we need it?
Update accuracy report test
Does this PR introduce any user-facing change?
How was this patch tested?
dense-accuracy-test: https://github.com/vllm-project/vllm-ascend/actions/runs/15745619485
create pr via forked repository workflow: https://github.com/zhangxinyuehfad/vllm-ascend/actions/runs/15747013719/job/44385134080
accuracy report pr: #1292
Currently, the accuracy report used is old and needs to be merged into pr, retest, update new report, then close #1292 .