Actions: vllm-project/vllm-ascend
Actions
9,223 workflow run results
9,223 workflow run results
W8A8_DYNAMIC
quantized MoE layers
ascend test
#9341:
Pull request #2275
synchronize
by
zhoux77899
FULL_DECODE_ONLY
mode for GQA/MHA models
ascend test
#9328:
Pull request #2128
synchronize
by
yiz-liu
FULL_DECODE_ONLY
mode for GQA/MHA models
ascend test
#9326:
Pull request #2128
synchronize
by
yiz-liu