[Misc] Rev DeepEP #27122

varun-sundar-rabindranath · 2025-10-17T20:42:32Z

Purpose

Update DeepEP commit to DeepEP main (73b6ea4) to pick up additional hidden-size support that is useful for gpt-oss low-latency DP/EP

Test Plan

Run unit test test_deepep_moe.py locally

e2e:
Server : VLLM_ALL2ALL_BACKEND="deepep_high_throughput" VLLM_USE_DEEP_GEMM=1 canhazgpu run -g2 -- vllm serve Qwen/Qwen3-30B-A3B-FP8 --trust-remote-code --enable-expert-parallel --data-parallel-size 2 -no-enable-prefix-caching

Server: VLLM_ALL2ALL_BACKEND="deepep_low_latency" VLLM_USE_DEEP_GEMM=1 canhazgpu run -g2 -- vllm serve Qwen/Qwen3-30B-A3B-FP8 --trust-remote-code --enable-expert-parallel --data-parallel-size 2 --port 9010 --no-enable-prefix-caching

lm_eval: lm_eval --model local-completions --tasks gsm8k --model_args model=Qwen/Qwen3-30B-A3B-FP8,base_url=http://127.0.0.1:9010/v1/completions,num_concurrent=30,max_retries=3 --limit 100

Test Result

|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value|   |Stderr|
|-----|------:|----------------|-----:|-----------|---|----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  | 0.94|±  |0.0239|
|     |       |strict-match    |     5|exact_match|↑  | 0.94|±  |0.0239|

|Tasks|Version|     Filter     |n-shot|  Metric   |   |Value|   |Stderr|
|-----|------:|----------------|-----:|-----------|---|----:|---|-----:|
|gsm8k|      3|flexible-extract|     5|exact_match|↑  | 0.86|±  |0.0349|
|     |       |strict-match    |     5|exact_match|↑  | 0.93|±  |0.0256|

mergify · 2025-10-17T20:43:13Z

This pull request has merge conflicts that must be resolved before it can be
merged. Please rebase the PR, @varun-sundar-rabindranath.

https://docs.github.com/en/pull-requests/collaborating-with-pull-requests/working-with-forks/syncing-a-fork

varun-sundar-rabindranath · 2025-10-17T20:43:30Z

cc @mgoin @tlrmchlsmth @bnellnm @yewentao256 @LucasWilkinson @SageMoore

gemini-code-assist

Code Review

This pull request updates the DeepEP dependency to a newer commit, enabling support for a hidden size of 3072. The changes are straightforward, involving an update to the commit hash in the installation script and adding the new size to the supported sizes list. My review includes a performance-related suggestion to use a frozenset for the list of supported hidden sizes, which is more efficient for membership checks in a low-latency context.

gemini-code-assist · 2025-10-17T20:43:35Z

vllm/model_executor/layers/fused_moe/deepep_ll_prepare_finalize.py

    # DeepEP low-latency kernels are compiled only for certain
    # specific hidden sizes.
-    SUPPORTED_HIDDEN_SIZES = [2048, 2560, 4096, 5120, 6144, 7168]
+    SUPPORTED_HIDDEN_SIZES = [2048, 2560, 3072, 4096, 5120, 6144, 7168]


For improved performance, consider using a frozenset for SUPPORTED_HIDDEN_SIZES. Membership testing (in) is O(1) on average for sets, while it is O(n) for lists. Given that this check is in a performance-sensitive path for low-latency kernels, this optimization is beneficial. A frozenset is suitable here as it's an immutable constant.

Suggested change

SUPPORTED_HIDDEN_SIZES = [2048, 2560, 3072, 4096, 5120, 6144, 7168]

SUPPORTED_HIDDEN_SIZES = frozenset([2048, 2560, 3072, 4096, 5120, 6144, 7168])

mgoin

Nice :) Please fix the conflict and we can ready

yewentao256

LGTM, thanks for the work!

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com>

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>

varun-sundar-rabindranath requested a review from mgoin as a code owner October 17, 2025 20:42

mergify bot added the needs-rebase label Oct 17, 2025

gemini-code-assist bot reviewed Oct 17, 2025

View reviewed changes

mgoin approved these changes Oct 17, 2025

View reviewed changes

tlrmchlsmth approved these changes Oct 17, 2025

View reviewed changes

varun-sundar-rabindranath force-pushed the varun/rev-deepep branch from f034ced to 1c11c4b Compare October 17, 2025 20:56

mergify bot removed the needs-rebase label Oct 17, 2025

bnellnm approved these changes Oct 17, 2025

View reviewed changes

yewentao256 approved these changes Oct 17, 2025

View reviewed changes

zhuohan123 approved these changes Oct 17, 2025

View reviewed changes

mgoin added ready ONLY add when PR is ready to merge/full CI is needed dependencies Pull requests that update a dependency file deepseek Related to DeepSeek models labels Oct 17, 2025

mgoin enabled auto-merge (squash) October 17, 2025 23:02

jeejeelee approved these changes Oct 18, 2025

View reviewed changes

rev deepep

0f15d4f

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com>

auto-merge was automatically disabled October 18, 2025 03:19
Head branch was pushed to by a user without write access

varun-sundar-rabindranath force-pushed the varun/rev-deepep branch from 1c11c4b to 0f15d4f Compare October 18, 2025 03:19

DarkLight1337 merged commit 30a33b9 into vllm-project:main Oct 18, 2025
50 checks passed

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Misc] Rev DeepEP (vllm-project#27122)

829e066

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>

adabeyta pushed a commit to adabeyta/vllm that referenced this pull request Oct 20, 2025

[Misc] Rev DeepEP (vllm-project#27122)

d62af91

Signed-off-by: Varun Sundar Rabindranath <vsundarr@redhat.com> Co-authored-by: Varun Sundar Rabindranath <vsundarr@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Misc] Rev DeepEP #27122

[Misc] Rev DeepEP #27122

Uh oh!

varun-sundar-rabindranath commented Oct 17, 2025 •

edited by github-actions bot

Loading

Uh oh!

mergify bot commented Oct 17, 2025

Uh oh!

varun-sundar-rabindranath commented Oct 17, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 17, 2025

Uh oh!

mgoin left a comment •

edited

Loading

Uh oh!

yewentao256 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

	SUPPORTED_HIDDEN_SIZES = [2048, 2560, 3072, 4096, 5120, 6144, 7168]
	SUPPORTED_HIDDEN_SIZES = frozenset([2048, 2560, 3072, 4096, 5120, 6144, 7168])

Uh oh!

[Misc] Rev DeepEP #27122

[Misc] Rev DeepEP #27122

Uh oh!

Conversation

varun-sundar-rabindranath commented Oct 17, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

mergify bot commented Oct 17, 2025

Uh oh!

varun-sundar-rabindranath commented Oct 17, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 17, 2025

Choose a reason for hiding this comment

Uh oh!

mgoin left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

varun-sundar-rabindranath commented Oct 17, 2025 •

edited by github-actions bot

Loading

mgoin left a comment •

edited

Loading