Skip to content

Conversation

@rootfs
Copy link
Collaborator

@rootfs rootfs commented Sep 15, 2025

What type of PR is this?

When testing different reasoning models, the reason bench needs to adjust max token to avoid partial responses that result in wrong answers to many datasets include MMLU and GPQA

What this PR does / why we need it:

Which issue(s) this PR fixes:

Fixes #

Release Notes: Yes/No

@netlify
Copy link

netlify bot commented Sep 15, 2025

Deploy Preview for vllm-semantic-router ready!

Name Link
🔨 Latest commit 9d22297
🔍 Latest deploy log https://app.netlify.com/projects/vllm-semantic-router/deploys/68cd67e0ffc0b20008bc3388
😎 Deploy Preview https://deploy-preview-137--vllm-semantic-router.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@github-actions
Copy link

github-actions bot commented Sep 15, 2025

👥 vLLM Semantic Team Notification

The following members have been identified for the changed files in this PR and have been automatically assigned:

📁 bench

Owners: @yuezhu1, @Xunzhuo
Files changed:

  • bench/vllm_semantic_router_bench/router_reason_bench_multi_dataset.py

vLLM

🎉 Thanks for your contributions!

This comment was automatically generated based on the OWNER files in the repository.

@rootfs rootfs marked this pull request as draft September 15, 2025 23:22
Signed-off-by: Huamin Chen <hchen@redhat.com>
@rootfs rootfs force-pushed the fix-max-token-bench branch from 2bc24ff to d2691a4 Compare September 19, 2025 14:25
@rootfs rootfs marked this pull request as ready for review September 19, 2025 14:25
Copy link
Collaborator

@yuezhu1 yuezhu1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@yuezhu1 yuezhu1 merged commit fac50b1 into vllm-project:main Sep 19, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants