[Fix] fix resources limit error when apply speculative decoding and aclgraph #6534
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
491 KB |
sha256:60c37b63d4ba9367cd35fdfe40ee109fc8ff9ce71f9328335db97768cbb5f566
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
500 KB |
sha256:7554da733e78c4cdee1729553af00f04d204766a144f33c3c039d16bf7a87534
|
|