[Perf] Add new npu_fused_infer_attention_score op to improve perfomance in splitfuse cases and resolve long-seq mask problems · vllm-project/vllm-ascend@8ac3f51

Triggered via pull request September 20, 2025 04:16

Angazenn

synchronize #2962

qyqc731:main

Status Cancelled

Total duration 28m 2s

Artifacts –

vllm_ascend_test.yaml

on: pull_request

Matrix: singlecard e2e test - light

Matrix: unit test

Matrix: multicard e2e test - light

4 errors

Process completed with exit code 1.

Executing the custom container implementation failed. Please contact your self hosted runner administrator.

Canceling since a higher priority waiting request for test-refs/pull/2962/merge exists

Canceling since a higher priority waiting request for test-refs/pull/2962/merge exists