[3/N][Feat][Graph] Support all-to-all
and quantized models with ACL Graph
#6553
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
vllm-ascend-ubuntu-24.04-arm-py3.11-wheel
|
493 KB |
sha256:06fa31331fc279c511c38b376e5ecf60263eebc5c320db2d3688dec6294481ed
|
|
vllm-ascend-ubuntu-24.04-py3.11-wheel
|
502 KB |
sha256:d9c47cd713bb97a79b0e00a256d75f3f22ab09fd2a6b05c7f77843c7da25f7eb
|
|