Skip to content

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models #5901

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models

[Feat][Graph] Support FULL_DECODE_ONLY mode for GQA/MHA models #5901

Triggered via pull request September 19, 2025 06:54
@yiz-liuyiz-liu
synchronize #2128
Status Success
Total duration 13s
Artifacts

format_pr_body.yaml

on: pull_request_target
update vLLM version
10s
update vLLM version
Fit to window
Zoom out
Zoom in

Annotations

1 warning
update vLLM version
The `python-version` input is not set. The version of Python currently in `PATH` will be used.