[Refactor]Support gatingtopk operator generalization and remove row idx #3265

Liccol · 2025-09-29T08:42:48Z

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

vLLM version: v0.11.0rc3
vLLM main: vllm-project/vllm@releases/v0.11.0

github-actions · 2025-09-29T08:42:55Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

gemini-code-assist

Code Review

This pull request refactors the MoE expert selection logic to generalize the gatingtopk operator and remove the row_idx tensor, which simplifies the codebase across numerous files. The removal of row_idx appears to be handled correctly and consistently. However, the generalization of _select_experts_with_fusion_ops in vllm_ascend/ops/moe/experts_selector.py introduces a critical issue. The new logic incorrectly handles grouped top-k, which will likely lead to incorrect behavior and potential crashes. I have provided a detailed comment with a suggested fix for this issue.

gemini-code-assist · 2025-09-29T08:45:05Z

vllm_ascend/ops/moe/experts_selector.py

+    if scoring_func == "softmax":
+        norm_type = 0
+        topk_group = 1
+        num_expert_group = 1
+    else:
+        norm_type = 1


The current logic for determining topk_group and num_expert_group is incorrect. It unconditionally overwrites these parameters to 1 when scoring_func is "softmax", effectively disabling grouped top-k for softmax, which is not the intended behavior when use_grouped_topk is True. Additionally, for scoring_func="sigmoid", if use_grouped_topk is False, topk_group and num_expert_group would be None, which could lead to a crash in the torch_npu.npu_moe_gating_top_k operator.

The logic should be updated to respect the use_grouped_topk parameter for all scoring functions. When use_grouped_topk is False, topk_group and num_expert_group should be set to 1 to disable grouping. Otherwise, the provided values should be used.

Suggested change

if scoring_func == "softmax":

norm_type = 0

topk_group = 1

num_expert_group = 1

else:

norm_type = 1

if scoring_func == "softmax":

norm_type = 0

else:

norm_type = 1

if not use_grouped_topk:

topk_group = 1

num_expert_group = 1

Signed-off-by: 1092626063 <1092626063@qq.com>

Signed-off-by: CaranLic <740821011@qq.com>

github-actions bot added module:tests module:ops module:quantization labels Sep 29, 2025

gemini-code-assist bot reviewed Sep 29, 2025

View reviewed changes

weijinqian0 approved these changes Sep 29, 2025

View reviewed changes

Liccol closed this Sep 30, 2025

Liccol force-pushed the remove-row-idx branch from b990378 to c73dd8f Compare September 30, 2025 06:18

1092626063 and others added 2 commits September 30, 2025 14:44

refactor gatingtopk

f96880f

Signed-off-by: 1092626063 <1092626063@qq.com>

fix ut break and remove row_idx

865416c

Signed-off-by: CaranLic <740821011@qq.com>

Liccol reopened this Sep 30, 2025

Liccol changed the title ~~Support gatingtopk operator generalization and remove row idx~~ [Refactor]Support gatingtopk operator generalization and remove row idx Sep 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Refactor]Support gatingtopk operator generalization and remove row idx #3265

[Refactor]Support gatingtopk operator generalization and remove row idx #3265

Liccol commented Sep 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Sep 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Sep 29, 2025

Uh oh!

Uh oh!

[Refactor]Support gatingtopk operator generalization and remove row idx #3265

Are you sure you want to change the base?

[Refactor]Support gatingtopk operator generalization and remove row idx #3265

Conversation

Liccol commented Sep 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

github-actions bot commented Sep 29, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Sep 29, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Liccol commented Sep 29, 2025 •

edited by github-actions bot

Loading