Actions: vllm-project/vllm-ascend
Actions
830 workflow runs
830 workflow runs
FULL_DECODE_ONLY
mode for GQA/MHA models
ascend test / full
#740:
Pull request #2128
synchronize
by
yiz-liu