[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#5901
Job | Run time |
---|---|
10s | |
10s |
FULL_DECODE_ONLY
mode for GQA/MHA models
#5901
Job | Run time |
---|---|
10s | |
10s |