Actions: vllm-project/vllm-ascend
Actions
9,223 workflow run results
9,223 workflow run results
W8A8_DYNAMIC
quantized MoE layers
ascend test
#9363:
Pull request #2275
synchronize
by
zhoux77899
W8A8_DYNAMIC
quantized MoE layers
ascend test
#9357:
Pull request #2275
synchronize
by
zhoux77899
FULL_DECODE_ONLY
mode for GQA/MHA models
ascend test
#9354:
Pull request #2128
synchronize
by
yiz-liu