[3/N][Feat][Graph] Support all-to-all
and quantized models with ACL Graph
#6426
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
490 KB |
sha256:d6bd005341171319af952db81d0ba29e247704d7e83391dddb841a9e3b0a4ca8
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
499 KB |
sha256:ef4aee41db91fafb116bd62212b9fbecca3666b32d45b28db3f5672491a62a08
|
|