[bugfix][torchair] fix wasted NPU memory buffer allocation for quantized deepseek with unquantized MTP layer #1266
nightly_benchmarks.yaml
on: pull_request
Matrix: test
Waiting for pending jobs
Annotations
1 error
Benchmarks / Performance
Canceling since a higher priority waiting request for static-8-01-cards exists
|