Skip to content

Conversation

Theorem411
Copy link
Contributor

Description of changes:
Redo demo_hopper/; complete redesign of PTQ/calibration workflow. Examples in demo_hopper e.g. gqa.py demonstrates how the mugraphs are changed to include scaling factors

Add kernel-level rowwise scaling operator in transpiler runtime.

Related Issues:

Linked Issues:

  • Issue #

Issues closed by this PR:

  • Closes #

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants