Skip to content

[Perf] Add new npu_fused_infer_attention_score op to improve perfomance in splitfuse cases and resolve long-seq mask problems #4025

[Perf] Add new npu_fused_infer_attention_score op to improve perfomance in splitfuse cases and resolve long-seq mask problems

[Perf] Add new npu_fused_infer_attention_score op to improve perfomance in splitfuse cases and resolve long-seq mask problems #4025

Triggered via pull request September 20, 2025 08:30
Status Cancelled
Total duration 8m 58s
Artifacts 1

image_a3_openeuler.yml

on: pull_request
vllm-ascend image build
8m 52s
vllm-ascend image build
Fit to window
Zoom out
Zoom in

Annotations

1 error and 1 warning
image / openEuler / a3
Canceling since a higher priority waiting request for image / openEuler / a3-refs/pull/2962/merge exists
vllm-ascend image build
The command [sudo apt-get remove -y azure-cli google-chrome-stable firefox powershell mono-devel libgl1-mesa-dri --fix-missing] failed to complete successfully. Proceeding...

Artifacts

Produced during runtime
Name Size Digest
vllm-project~vllm-ascend~M9OP35.dockerbuild
84 KB
sha256:55f9c42a6670838ef4867e95ae661b0c1c93fdb9e7a72fb70f331561b5fb85f2