You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[V0.9.1] torchair_graph bugfix when chunked_prefill is true (#1748)
### What this PR does / why we need it?
when torchair_graph and chunked_prefill are both true, save the
decode kv_cache.
### Does this PR introduce _any_ user-facing change?
### How was this patch tested?
---------
Signed-off-by: fems14 <1804143737@qq.com>
Signed-off-by: SlightwindSec <slightwindsec@gmail.com>
Co-authored-by: SlightwindSec <slightwindsec@gmail.com>
0 commit comments