[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#9328
vllm_ascend_test.yaml
on: pull_request
Matrix: singlecard e2e test
Matrix: unit test
Matrix: multicard e2e test
Annotations
3 errors
multicard e2e test (linux-aarch64-a2-2, main)
Executing the custom container implementation failed. Please contact your self hosted runner administrator.
|
multicard e2e test (linux-aarch64-a2-2, main)
Process completed with exit code 1.
|
multicard e2e test (linux-aarch64-a2-2, main)
Error: failed to run script step: command terminated with non-zero exit code: Error executing in Docker Container: 1
|