[webgpu] Fix `GatherBlockQuantized` on Intel ADL/TGL platforms #26526

daijh · 2025-11-07T06:46:37Z

Description

The GatherBlockQuantized operation was using incorrect data_indices during execution on Intel Alder Lake (ADL) and Tiger Lake (TGL) platforms.

This change sets the proper data_indices, resolving correctness issues encountered with the Phi-4-mini model on these architectures.

Motivation and Context

See above.

daijh · 2025-11-07T06:48:42Z

The first commit is only the early draft to demonstrate the fixing.
To be refined in following commits.

daijh · 2025-11-10T01:33:09Z

@guschmue @fs-eire PTAL.

fs-eire · 2025-11-10T23:10:54Z

Thanks for the fix! LGTM

onnxruntime/contrib_ops/webgpu/quantization/gather_block_quantized.cc

fs-eire · 2025-11-12T00:40:41Z

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-12T00:40:58Z

Azure Pipelines successfully started running 4 pipeline(s).

guschmue · 2025-11-12T04:11:45Z

tested on tgl with qwen3-0.6b and int4 embeddings - works!

daijh · 2025-11-12T05:32:31Z

Not sure why CI failed. Probably kick off a retry.

guschmue · 2025-11-13T17:15:05Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

azure-pipelines · 2025-11-13T17:15:25Z

Azure Pipelines successfully started running 4 pipeline(s).

guschmue · 2025-11-13T17:16:41Z

/azp run web_Release / build_onnxruntime_web,web_Debug / build_onnxruntime_web,Test Linux TensorRT x64 Release,Test Linux CUDA x64 Release

azure-pipelines · 2025-11-13T17:16:48Z

No pipelines are associated with this pull request.

[webgpu] Fix GatherBlockQuantized on Intel ADL/TGL platforms

ba2423c

Update

d6a6058

fs-eire previously approved these changes Nov 10, 2025

View reviewed changes

fs-eire reviewed Nov 10, 2025

View reviewed changes

onnxruntime/contrib_ops/webgpu/quantization/gather_block_quantized.cc Outdated Show resolved Hide resolved

Get gather axis dim from niforms

72439a4

daijh dismissed fs-eire’s stale review via 72439a4 November 11, 2025 03:10

guschmue added the ep:WebGPU ort-web webgpu provider label Nov 11, 2025

fs-eire approved these changes Nov 12, 2025

View reviewed changes

guschmue approved these changes Nov 12, 2025

View reviewed changes

guschmue closed this Nov 13, 2025

guschmue reopened this Nov 13, 2025

guschmue merged commit d6219b6 into microsoft:main Nov 13, 2025
157 of 184 checks passed

daijh deleted the fix-gather-int4 branch November 18, 2025 11:06

[webgpu] Fix GatherBlockQuantized on Intel ADL/TGL platforms #26526

[webgpu] Fix GatherBlockQuantized on Intel ADL/TGL platforms #26526

Uh oh!

Conversation

daijh commented Nov 7, 2025

Description

Motivation and Context

Uh oh!

daijh commented Nov 7, 2025

Uh oh!

daijh commented Nov 10, 2025

Uh oh!

fs-eire commented Nov 10, 2025

Uh oh!

Uh oh!

fs-eire commented Nov 12, 2025

Uh oh!

azure-pipelines bot commented Nov 12, 2025

Uh oh!

guschmue commented Nov 12, 2025

Uh oh!

daijh commented Nov 12, 2025

Uh oh!

guschmue commented Nov 13, 2025

Uh oh!

azure-pipelines bot commented Nov 13, 2025

Uh oh!

guschmue commented Nov 13, 2025

Uh oh!

azure-pipelines bot commented Nov 13, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[webgpu] Fix `GatherBlockQuantized` on Intel ADL/TGL platforms #26526

[webgpu] Fix `GatherBlockQuantized` on Intel ADL/TGL platforms #26526