Skip to content

[Feature] Reduce host memory usage for attention mask generation #12877

[Feature] Reduce host memory usage for attention mask generation

[Feature] Reduce host memory usage for attention mask generation #12877

Triggered via pull request September 20, 2025 00:09
Status Failure
Total duration 36m 30s
Artifacts

vllm_ascend_test.yaml

on: pull_request
Matrix: singlecard e2e test - light
Matrix: unit test
Matrix: multicard e2e test - light
Fit to window
Zoom out
Zoom in

Annotations

1 error
unit test (v0.10.2)
Process completed with exit code 1.