You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Summary:
X-link: facebookresearch/FBGEMM#1879
Pull Request resolved: #4848
The current KV padding only suppported full prefill case (D78967317). This diff adds partial prefill support as well. Coverage added in the tests.
WIP: upstreaming this. ( D78967317 and this diff)
Reviewed By: sryap
Differential Revision: D82080682
fbshipit-source-id: 7a6c7a0d3c32245e5c13864b1f0cfe37d8d254c4
Copy file name to clipboardExpand all lines: fbgemm_gpu/experimental/gen_ai/src/attention/cuda/cutlass_blackwell_fmha/kernel/sm100_fmha_fwd_kernel_tma_warpspecialized.hpp
0 commit comments