test-full

[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer #616

Sign in to view logs

Triggered via pull request September 20, 2025 14:38

labeled #3068

linfeng-yuan:fix_torchair_redundant_process_group

Status Cancelled

Total duration 9s

Artifacts –

vllm_ascend_test_full.yaml

on: pull_request

Matrix: multicard e2e test - full

Matrix: singlecard e2e test - full

Annotations

1 error

Canceling since a higher priority waiting request for test-full-refs/pull/3068/merge exists