Skip to content

[Performance]: Custom AscendC Kernel of Multi-Step Prepare Input #814

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 11 commits into
base: main
Choose a base branch
from

Conversation

wonderful199082
Copy link

@wonderful199082 wonderful199082 commented May 12, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

no user-facing change

How was this patch tested?

we add Unit Test file and offline inference file to test the custom ascendc kernel. See test/ops/test_multi_step.py and examples/offline_multi_step.py

@wonderful199082 wonderful199082 changed the title docs: revise the comments of kv_cache shape Custom AscendC Kernel of Multi-Step Prepare Input May 13, 2025
@wonderful199082 wonderful199082 changed the title Custom AscendC Kernel of Multi-Step Prepare Input [Performance]: Custom AscendC Kernel of Multi-Step Prepare Input May 13, 2025
@wonderful199082 wonderful199082 force-pushed the main branch 4 times, most recently from 2cac2cd to 82c0e2c Compare May 13, 2025 10:11
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
Signed-off-by: wan_danfeng <wonderful199082@126.com>
@wangxiyuan
Copy link
Collaborator

Add CopyIn to CODESPELL_IGNORE_WORDS to fix codespell issue. https://github.com/search?q=repo%3Avllm-project%2Fvllm-ascend%20CODESPELL_IGNORE_WORDS&type=code

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants