[Feat][Graph] Support FULL_DECODE_ONLY
mode for GQA/MHA models
#9354
Job | Run time |
---|---|
4s | |
4m 59s | |
10m 21s | |
42m 42s | |
0s | |
58m 6s |
FULL_DECODE_ONLY
mode for GQA/MHA models
#9354
Job | Run time |
---|---|
4s | |
4m 59s | |
10m 21s | |
42m 42s | |
0s | |
58m 6s |