[Perf] Reduce memory usage by splitting tokens in fused_experts and avoiding unused tensor#833
Closed
ApsarasX wants to merge 2 commits intovllm-project:mainfrom
Closed
[Perf] Reduce memory usage by splitting tokens in fused_experts and avoiding unused tensor#833ApsarasX wants to merge 2 commits intovllm-project:mainfrom
ApsarasX wants to merge 2 commits intovllm-project:mainfrom