Skip to content

v0.3.0

Choose a tag to compare

@tharapalanivel tharapalanivel released this 10 Jun 16:01
· 204 commits to main since this release
7467f68

Highlights

  1. AIU support: new example added for model conversion for AIU (see examples/AIU_CONVERSION folder) and new add-ons for fms
  2. triton kernel for specialized matmul HW simulation and verification
  3. microscaling format support by integrating functionalities from microsoft mx package (see examples/MX for more details)
  4. other upgrades and improvements:
    • qmodel_prep tracing speed improvement, e.g., for Llama3-70B the time has been reduced from ~20min to ~2min now
    • Upgrade base dependencies to torch 2.5, python 3.12 and migrated from auto_gptq to gptqmodel

What's Changed

New Contributors

Full Changelog: v0.2.0...v0.3.0