add dispatch layout for a2 #103

luanyundu · 2025-09-20T10:00:23Z

add dispatch layout for a2

gemini-code-assist · 2025-09-20T10:00:55Z

Summary of Changes

Hello @luanyundu, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request integrates support for a new dispatch layout specifically designed for the Ascend910B (A2) compute unit. The core objective is to optimize the token and expert distribution mechanism for A2 machines by introducing A2-specific parameters, a dedicated kernel implementation, and dynamic dispatching logic. These changes ensure that the system can leverage the unique characteristics of the A2 hardware for improved efficiency in deep learning workloads.

Highlights

A2 Platform Support: This pull request introduces specific dispatch layout logic and configurations tailored for the Ascend910B (A2) compute unit, enabling optimized performance on this hardware.
Extended aclnnDispatchLayout Operator: The aclnnDispatchLayout operator has been extended to accept new parameters, local_ranksize and a total_data tensor, which are crucial for A2-specific data handling and distribution.
Dedicated A2 Kernel Implementation: A new kernel, MoeDispatchLayoutA2::DispatchLayoutA2, has been added. This dedicated implementation contains specialized logic for efficiently processing dispatch layouts on the A2 architecture.
Dynamic Kernel Dispatch: The dispatch_layout kernel now dynamically selects between the generic and the newly introduced A2-specific implementations based on the detected platform, ensuring the correct optimization is applied.
Configuration and Tiling Updates: CMake presets have been updated to target ascend910b, and the tiling data structures and logic have been enhanced to incorporate local_ranksize and platform-specific checks for A2.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request adds support for a new hardware architecture, "a2" (Ascend 910B), for the dispatch_layout operation. This is a significant change, introducing a new device kernel and modifying code across the C++ stack. My review focuses on improving code clarity, maintainability, and reducing potential errors. Key suggestions include replacing magic numbers with named constants, refactoring duplicated configuration code, correcting a copy-paste error, centralizing constants, and simplifying complex pointer arithmetic in the new kernel to enhance readability and correctness.

csrc/deepep/ops/op_kernel/dispatch_layout_a2.h

csrc/deepep/deep_ep.cpp

csrc/deepep/ops/op_host/dispatch_layout_tiling.cc

csrc/deepep/ops/op_kernel/dispatch_layout.cpp

csrc/deepep/deep_ep.cpp

csrc/deepep/deep_ep.hpp

csrc/deepep/deep_ep.cpp

…pu-old into a2_layour * 'a2_layour' of https://github.com/luanyundu/sgl-kernel-npu-old: integrate A2 and A3 interface

csrc/deepep/deep_ep.cpp

add dispatch layout for a2

85d9980

gemini-code-assist bot reviewed Sep 20, 2025

View reviewed changes

luanyundu force-pushed the a2_layour branch 7 times, most recently from 56c4873 to f251b43 Compare September 23, 2025 04:09

integrate A2 and A3 interface

11f8090

luanyundu force-pushed the a2_layour branch from f251b43 to 11f8090 Compare September 23, 2025 04:13

Yael-X reviewed Sep 24, 2025

View reviewed changes

csrc/deepep/deep_ep.cpp Outdated Show resolved Hide resolved

csrc/deepep/deep_ep.cpp Outdated Show resolved Hide resolved

csrc/deepep/deep_ep.cpp Outdated Show resolved Hide resolved

csrc/deepep/deep_ep.hpp Outdated Show resolved Hide resolved

zuje123 reviewed Sep 26, 2025

View reviewed changes

csrc/deepep/deep_ep.cpp Outdated Show resolved Hide resolved

linzihan and others added 5 commits September 28, 2025 16:19

integrate A2 and A3 interface

4a20325

Merge branch 'a2_layour' of https://github.com/luanyundu/sgl-kernel-n…

981bb22

…pu-old into a2_layour * 'a2_layour' of https://github.com/luanyundu/sgl-kernel-npu-old: integrate A2 and A3 interface

Merge branch 'main' into a2_layour

8c683a5

fix lint problems

c138e74

fix lint problems

2d499c4

luanyundu force-pushed the a2_layour branch 2 times, most recently from 4d197d6 to ff4ab5e Compare September 29, 2025 01:29

Yael-X reviewed Oct 9, 2025

View reviewed changes

csrc/deepep/deep_ep.cpp Outdated Show resolved Hide resolved

fix lint problems

c156c5d

luanyundu force-pushed the a2_layour branch 2 times, most recently from 761f4a1 to 464669e Compare October 11, 2025 07:47

Merge branch 'main' into a2_layour

87e2650

luanyundu force-pushed the a2_layour branch from 464669e to 87e2650 Compare October 11, 2025 08:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

add dispatch layout for a2 #103

add dispatch layout for a2 #103

Uh oh!

luanyundu commented Sep 20, 2025

Uh oh!

gemini-code-assist bot commented Sep 20, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

add dispatch layout for a2 #103

Are you sure you want to change the base?

add dispatch layout for a2 #103

Uh oh!

Conversation

luanyundu commented Sep 20, 2025

Uh oh!

gemini-code-assist bot commented Sep 20, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants