[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#9328
Job | Run time |
---|---|
5s | |
5m 8s | |
9m 31s | |
43m 56s | |
0s | |
5s | |
5m 8s | |
44m 57s | |
9m 31s | |
19m 11s | |
2h 17m 32s |