[3/N][Feat][Graph] Support all-to-all
and quantized models with ACL Graph
#6425
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
490 KB |
sha256:f290aac25d636eae710d5018ffea982c68783156d86cdd3fff492c3e046211da
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
499 KB |
sha256:ae4b396a92b646c5ff9a0212212051669667e14485232946c8ba2eadfa06567c
|
|