Properly handle incomplete calibration for static quantization by wpietka · Pull Request #2460 · intel/neural-compressor

wpietka · 2026-04-30T13:09:34Z

Type of Change

Bug fix

Description

Currently when calibration function for static quantization is provided with incomplete input sample - not activating all model layers during calibration - certain layers are effectively broken, due to scales being set to 0.

This change detects if quantization failed for given layer and recovers original call() methods and weights. Hence the layer will behave exactly as if it was not quantized at all

Expected Behavior & Potential Risk

Output provided by quantized model will be correctly even if quantization fails for certain layers

How has this PR been tested?

pytest test/jax/test_gemma3_model.py::test_static_quantization_with_incomplete_calibration

Dependency Change?

No dependencies changed

anko-intel

I would like to avoid adding so many load_own_variables_preprocess
Lets discuss it offline, Probably we can postpone post_quantization_cleanup actions into load_own_variables

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

anko-intel

It looks very well, thanks

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

for more information, see https://pre-commit.ci

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

wpietka force-pushed the wpietkax/add-gemma-quantization-test branch 2 times, most recently from 3385be8 to b07f4ff Compare May 4, 2026 13:40

wpietka force-pushed the wpietkax/fix-incomplete-static-quantization branch 2 times, most recently from 36d5100 to 942cdf3 Compare May 5, 2026 08:20

wpietka marked this pull request as ready for review May 5, 2026 08:28

anko-intel requested changes May 5, 2026

View reviewed changes

Comment thread test/jax/test_gemma3_model.py Outdated

Comment thread neural_compressor/jax/quantization/layers_static.py

wpietka force-pushed the wpietkax/fix-incomplete-static-quantization branch 2 times, most recently from e1b1040 to 709bc31 Compare May 7, 2026 09:14

Don't quantize a layer when there's no calbration data

709bc31

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com> Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

anko-intel approved these changes May 8, 2026

View reviewed changes

Comment thread neural_compressor/jax/quantization/layers_dynamic.py Outdated

wpietka and others added 5 commits May 8, 2026 04:09

Don't preserve _is_quantized during save

1e32ea3

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

7480fb5

for more information, see https://pre-commit.ci

Properly remove _kernel_quant from vars list

9e21556

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

Improve comments

4d5bbdd

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

Don't make redundant const variables assignment

531dc97

Signed-off-by: Wojciech Piętka <wojciechx.pietka@intel.com>

anko-intel approved these changes May 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Properly handle incomplete calibration for static quantization#2460

Properly handle incomplete calibration for static quantization#2460
wpietka wants to merge 6 commits into
wpietkax/add-gemma-quantization-testfrom
wpietkax/fix-incomplete-static-quantization

wpietka commented Apr 30, 2026

Uh oh!

anko-intel left a comment

Uh oh!

Uh oh!

Uh oh!

anko-intel left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wpietka commented Apr 30, 2026

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Uh oh!

anko-intel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

anko-intel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants