SwiftBalancer Zero OverHead Expert Movement #1855

raindaywhu · 2025-07-17T08:15:47Z

What this PR does / why we need it?

Dynamic Experts load balance for MoE LLM Models

Does this PR introduce any user-facing change?

How was this patch tested?

wangxiyuan · 2025-07-21T08:24:09Z

vllm_ascend/ascend_config.py

@@ -37,6 +37,7 @@ def __init__(self, vllm_config):
            ascend_scheduler_config)

        self.expert_map_path = additional_config.get("expert_map_path", None)
+        self.dynamic_eplb = additional_config.get("dynamic_eplb", False)


can we use vllm eplb config enalbe_eplb instead of adding a new config?

wangxiyuan · 2025-07-21T08:25:47Z

vllm_ascend/eplb/adaptor/abstract_adaptor.py

+
+from abc import ABC, abstractmethod
+
+class EplbAdaptor():


what's is this abstract used for?

what's is this abstract used for?

for SGlang/Vllm abstract

wangxiyuan · 2025-07-21T08:28:12Z

vllm_ascend/models/deepseek_v2.py

@@ -773,6 +775,32 @@ def load_weights(self, weights: Iterable[tuple[str,

        return loaded_params

+    def get_expert_map(self, layer_id):


vllm has the MixtureOfExperts interface, once we contribute to vllm, these func should be moved there.

And, what about Qwen Moe model?

Qwen Moe is doing test now, will be submit in other pr.

wangxiyuan · 2025-07-21T08:42:17Z

vllm_ascend/eplb/adaptor/vllm_adaptor.py

+                        for name in self.expert_weight_names]
+                )
+
+    # def collect_topk_ids(self, dummy_run=False):


remove comment code

wangxiyuan · 2025-07-21T08:43:26Z

vllm_ascend/eplb/core/policy/dynamic_ep.py

+
+class DynamicTable:
+    # workload_table:
+    # 三维矩阵，[layer, gpus, experts_per_gpu_per_layer] -> value: 所在位置的热度


use english

wangxiyuan · 2025-07-21T08:52:48Z

vllm_ascend/eplb/tool/eplb_utils.py

+import torch
+import random
+
+class ExpertMapUtils():


use class here is meaningless

wangxiyuan · 2025-07-21T08:53:24Z

vllm_ascend/eplb/tool/generate_map.py

@@ -0,0 +1,65 @@
+import numpy as np


move this file to example folder

wangxiyuan · 2025-07-21T08:53:56Z

vllm_ascend/eplb/tool/eplb_utils.py

@@ -0,0 +1,114 @@
+#


remove tool folder

wangxiyuan · 2025-07-21T08:54:41Z

vllm_ascend/eplb/core/worker/eplb_worker.py

@@ -0,0 +1,408 @@
+#


the worker module has only one file. I think the module is useless

the worker module has been removed

wangxiyuan · 2025-07-21T08:55:19Z

vllm_ascend/eplb/adaptor/abstract_adaptor.py

@@ -0,0 +1,39 @@
+#


vllm_ascend/eplb/__init__.py is missied

wangxiyuan · 2025-07-21T09:14:01Z

vllm_ascend/eplb/core/worker/eplb_worker.py

+
+        return list(zip(send_all, recv_all, maps, log2phy_all, layer_ids))
+
+class EplbProcess:


what will happen if the eplbprocess is down in a woker?

eplb will not update anymore, how erver the forwarding is continuing

into whq-v091-new * 'whq-v091-new' of https://github.com/raindaywhu/vllm-ascend: simplify eplb policy

add swift balancer doc

address review comments

# Conflicts: # vllm_ascend/eplb/tool/eplb_utils.py

fix commits

fix import path

fix import

… into whq-v091-new * 'whq-v091-new' of https://github.com/845473182/vllm-ascend: fix import fix param bug fix param bug 修改注册引用错误修改注册引用错误

fix lint errors

Signed-off-by: raindaywhu <raindaywhu@163.com>

What this PR does / why we need it? ####Dynamic Experts load balance for MoE LLM Models Co-authored-by: wanghanqingLYT [hqwang12345@sina.com](mailto:hqwang12345@sina.com) Co-authored-by: njuyuan [yuanjl19@smail.nju.edu.cn](mailto:yuanjl19@smail.nju.edu.cn) Co-authored-by: qmkakaxi [wjh1594260677@qq.com](mailto:wjh1594260677@qq.com) Co-authored-by: Skywalker-EP [173723846@qq.com](mailto:173723846@qq.com) Co-authored-by: ZhengWG [zwg0606@gmail.com](mailto:zwg0606@gmail.com) Co-authored-by: GuoXiYuan [496444320@qq.com](mailto:496444320@qq.com) Co-authored-by: zyy-hw [zhangyuanyun@huawei.com](mailto:zhangyuanyun@huawei.com) Co-authored-by: ltdo111 [1061328217@qq.com](mailto:1061328217@qq.com) Fix commits ci of pr #1855 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? --------- Signed-off-by: raindaywhu <raindaywhu@163.com> Signed-off-by: wanghanqingLYT <wanghanqing3@huawei.com> Co-authored-by: wanghanqingLYT <wanghanqing3@huawei.com>

wanghanqingLYT added 2 commits July 14, 2025 21:10

dynamic eplb

6682deb

fix

431320f

github-actions bot added module:ops module:core module:quantization labels Jul 17, 2025

YiYang-Eon and others added 3 commits July 17, 2025 19:38

fix is not None

abbb39a

新算法更新

8f9c3b7

dynamic eplb

cc35a75

wangxiyuan reviewed Jul 21, 2025

View reviewed changes

白永斌 and others added 2 commits July 21, 2025 19:06

Partially address review comments

b02f517

simplify eplb policy

c66e4ce

wanghanqingLYT force-pushed the whq-v091-new branch from af52d72 to c66e4ce Compare July 22, 2025 01:44

845473182 and others added 8 commits July 22, 2025 10:00

Merge branch 'raindaywhu:whq-v091-new' into whq-v091-new

8866b9e

fix is not None

d60b96c

Merge remote-tracking branch 'origin/eplb' into eplb

cea6967

代码质量看护

1868d2c

代码质量看护

bf708cd

代码质量看护

e77d7bb

address review comments

48343f3

simplify eplb policy

b3417f6

wanghanqingLYT force-pushed the whq-v091-new branch from c66e4ce to b3417f6 Compare July 22, 2025 06:26

jianzs mentioned this pull request Jul 22, 2025

add eplb develop doc #1928

Closed

raindaywhu and others added 4 commits July 22, 2025 14:53

Merge branch 'whq-v091-new' of https://github.com/raindaywhu/vllm-ascend

3fcc55f

into whq-v091-new * 'whq-v091-new' of https://github.com/raindaywhu/vllm-ascend: simplify eplb policy

fix bugs

a0cd659

add swift balancer doc

c4fbab0

Merge pull request #118 from raindaywhu/cy_0722

72e22ee

add swift balancer doc

github-actions bot added the documentation Improvements or additions to documentation label Jul 22, 2025

郭惜缘 and others added 2 commits July 22, 2025 15:12

修改self.ascend_config注册顺序

d28bd52

Merge pull request #117 from 845473182/whq-v091-new

ee7937e

address review comments

郭惜缘 and others added 27 commits July 22, 2025 15:21

代码质量看护

c453b59

代码质量看护

5c260bb

代码质量看护

f3296db

修改self.ascend_config注册顺序

86984da

Merge remote-tracking branch 'origin/eplb' into eplb

bf066aa

fix is not None

3f5f792

代码质量看护修改

9f15780

代码质量看护

f6e7651

代码质量看护

b5867fd

代码质量看护

7690863

Merge remote-tracking branch 'origin/eplb' into eplb

ea9e7f1

# Conflicts: # vllm_ascend/eplb/tool/eplb_utils.py

Merge pull request #116 from sadatama/eplb

90396ad

fix commits

修改注册引用错误

26e483e

修改注册引用错误

77c31c0

fix param bug

b8a3096

fix param bug

056c359

Merge pull request #119 from sadatama/eplb

b0836d1

fix import path

fix import

8fd696c

Merge pull request #120 from raindaywhu/cy_0722

e175550

fix import

fix lint errors

9ea2232

Merge branch 'whq-v091-new' of https://github.com/845473182/vllm-ascend…

699dc7b

… into whq-v091-new * 'whq-v091-new' of https://github.com/845473182/vllm-ascend: fix import fix param bug fix param bug 修改注册引用错误修改注册引用错误

fix bugs for dynamic eplb without assigning expert json

8603f7b

Merge branch 'raindaywhu:whq-v091-new' into whq-v091-new

e6d9dfb

Merge pull request #122 from 845473182/whq-v091-new

9e51e41

fix lint errors

fix doc title

ca34eb8

Signed-off-by: raindaywhu <raindaywhu@163.com>

fix doc title

53cf807

Signed-off-by: raindaywhu <raindaywhu@163.com>

add eplb_swift_balancer to doc tree

56152ee

Signed-off-by: raindaywhu <raindaywhu@163.com>

raindaywhu mentioned this pull request Jul 23, 2025

SwiftBalancer Zero OverHead Expert Movement #1943

Merged

raindaywhu closed this Jul 23, 2025

		@@ -773,6 +775,32 @@ def load_weights(self, weights: Iterable[tuple[str,

		return loaded_params

		def get_expert_map(self, layer_id):


		return list(zip(send_all, recv_all, maps, log2phy_all, layer_ids))

		class EplbProcess:

SwiftBalancer Zero OverHead Expert Movement #1855

SwiftBalancer Zero OverHead Expert Movement #1855

Conversation

raindaywhu commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

wangxiyuan Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

wangxiyuan Jul 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

raindaywhu commented Jul 17, 2025 •

edited

Loading

wangxiyuan Jul 21, 2025 •

edited

Loading

wangxiyuan Jul 21, 2025 •

edited

Loading