Skip to content

[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer #1266

[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer

[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer #1266

Triggered via pull request September 20, 2025 14:38
Status Cancelled
Total duration 1s
Artifacts

nightly_benchmarks.yaml

on: pull_request
Matrix: test
Waiting for pending jobs
Fit to window
Zoom out
Zoom in

Annotations

1 error
Benchmarks / Performance
Canceling since a higher priority waiting request for static-8-01-cards exists