[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#9682
Job | Run time |
---|---|
4s | |
4s |
FULL_DECODE_ONLY
mode for GQA/MHA models
#9682
Job | Run time |
---|---|
4s | |
4s |