Skip to content

v0.2.4

Latest
Compare
Choose a tag to compare
@jiazhihao jiazhihao released this 29 Mar 02:05
· 22 commits to main since this release
68ff606

What's Changed

Fingerprint

Grace Hopper Support

  • Grace Hopper: let users assign tasks to different warp groups by @xinhaoc in #165
  • Set num_warp_groups and pipeline_stages with default value in generate_cuda_program() by @xinhaoc in #179
  • Fix MMA Threadlayout issue by @xinhaoc in #197
  • Hopper: Add bf16 and fix some corner cases by @xinhaoc in #198

QWen2.5 Demo

New operators

Triton backend

Others

New Contributors

Full Changelog: v0.2.3...v0.2.4