[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#9572
Job | Run time |
---|---|
5s | |
5s |
FULL_DECODE_ONLY
mode for GQA/MHA models
#9572
Job | Run time |
---|---|
5s | |
5s |