[Bugfix] Fix test fused quant layernorm tests #27865

ElizaWszola · 2025-10-31T07:56:37Z

Fix optional vs. bool type issue in test_fused_quant_layernorm.py. This allows int8 tests to run correctly.

TODO: fix unit test failures

Signed-off-by: ElizaWszola <ewszola@redhat.com>

Signed-off-by: yewentao256 <zhyanwentao@126.com>

yewentao256

After we fix the scale_ub case in unit test, we now face three known issues.

precision error for int8
precision error for fp8
illegal memory access without setting scale_ub

And I fix all of them in my commits

reserve core]$ pytest test_fused_quant_layernorm.py -x
============= test session starts ==============
platform linux -- Python 3.12.11, pytest-8.4.2, pluggy-1.6.0
rootdir: /home/yewentao256/vllm-source
configfile: pyproject.toml
plugins: anyio-4.11.0
collected 576 items                            

test_fused_quant_layernorm.py .......... [  1%]
........................................ [  8%]
........................................ [ 15%]
........................................ [ 22%]
........................................ [ 29%]
........................................ [ 36%]
........................................ [ 43%]
........................................ [ 50%]
........................................ [ 57%]
........................................ [ 64%]
........................................ [ 71%]
........................................ [ 78%]
........................................ [ 85%]
........................................ [ 92%]
........................................ [ 98%]
......                                   [100%]

= 576 passed, 2 warnings in 160.03s (0:02:40) ==

Signed-off-by: ElizaWszola <ewszola@redhat.com>

…or int8 Signed-off-by: ElizaWszola <ewszola@redhat.com>

Signed-off-by: ElizaWszola <ewszola@redhat.com>

ElizaWszola and others added 4 commits October 31, 2025 03:53

Fix test fused quant layernorm tests

512e00e

Signed-off-by: ElizaWszola <ewszola@redhat.com>

Merge branch 'main' into fix-fused-quant-layernorm-tests

f885487

add fallback

a618d91

Signed-off-by: yewentao256 <zhyanwentao@126.com>

fix IMA issue

f3d4a9e

Signed-off-by: yewentao256 <zhyanwentao@126.com>

yewentao256 marked this pull request as ready for review November 3, 2025 16:45

yewentao256 requested review from WoosukKwon, mgoin, tlrmchlsmth and yewentao256 as code owners November 3, 2025 16:45

yewentao256 added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 3, 2025

yewentao256 reviewed Nov 3, 2025

View reviewed changes

Comment about fp8 precision

a25bf16

Signed-off-by: ElizaWszola <ewszola@redhat.com>

ElizaWszola added a commit to neuralmagic/vllm that referenced this pull request Nov 4, 2025

Apply quant layer norm fixes from vllm-project#27865, inv scale fix f…

b3a55fd

…or int8 Signed-off-by: ElizaWszola <ewszola@redhat.com>

Lower tol on int8 scales

152e69a

Signed-off-by: ElizaWszola <ewszola@redhat.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix test fused quant layernorm tests #27865

[Bugfix] Fix test fused quant layernorm tests #27865

ElizaWszola commented Oct 31, 2025 •

edited by github-actions bot

Loading

Uh oh!

yewentao256 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix] Fix test fused quant layernorm tests #27865

Are you sure you want to change the base?

[Bugfix] Fix test fused quant layernorm tests #27865

Conversation

ElizaWszola commented Oct 31, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yewentao256 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ElizaWszola commented Oct 31, 2025 •

edited by github-actions bot

Loading