Skip to content

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models #9328

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models #9328

Re-run triggered August 15, 2025 09:15
Status Failure
Total duration 1h 5m 15s
Artifacts

vllm_ascend_test.yaml

on: pull_request
Matrix: singlecard e2e test
Matrix: unit test
Matrix: multicard e2e test
Fit to window
Zoom out
Zoom in

Annotations

3 errors
multicard e2e test (linux-aarch64-a2-2, main)
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
multicard e2e test (linux-aarch64-a2-2, main)
Process completed with exit code 1.
multicard e2e test (linux-aarch64-a2-2, main)
Error: failed to run script step: command terminated with non-zero exit code: Error executing in Docker Container: 1