[3/N][Feat][Graph] Support all-to-all
and quantized models with ACL Graph
#6555
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
493 KB |
sha256:a787bb4a92c03619928d23d2493da06c9647445e61789acd3543830f6ba25bd2
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
502 KB |
sha256:d9ebee4e375349fccee03e9cba365e56e5142593d773bd2b0ea8a35348100dac
|
|