-
Notifications
You must be signed in to change notification settings - Fork 279
Optimized compression for FP8 modes #3748
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimized compression for FP8 modes #3748
Conversation
src/nncf/quantization/algorithms/weight_compression/fp8_conversion.py
Outdated
Show resolved
Hide resolved
bd421c7 to
d5cc3f8
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull request overview
Copilot reviewed 16 out of 16 changed files in this pull request and generated 1 comment.
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
…avelyevv/nncf into ns/f8e4m3-optimized-compression
Changes
Added optimized OpenVINO weights compression for fp8e4m3 data type.
optimum-cli export openvinotime:Reason for changes
UX improvement.
Tests
Extended existing tests.
https://github.com/openvinotoolkit/nncf/actions/runs/19767009608