Skip to content

Conversation

@daijh
Copy link
Contributor

@daijh daijh commented Nov 7, 2025

Description

The GatherBlockQuantized operation was using incorrect data_indices during execution on Intel Alder Lake (ADL) and Tiger Lake (TGL) platforms.

This change sets the proper data_indices, resolving correctness issues encountered with the Phi-4-mini model on these architectures.

Motivation and Context

See above.

@daijh
Copy link
Contributor Author

daijh commented Nov 7, 2025

The first commit is only the early draft to demonstrate the fixing.
To be refined in following commits.

@daijh
Copy link
Contributor Author

daijh commented Nov 10, 2025

@guschmue @fs-eire PTAL.

fs-eire
fs-eire previously approved these changes Nov 10, 2025
@fs-eire
Copy link
Contributor

fs-eire commented Nov 10, 2025

Thanks for the fix! LGTM

@guschmue guschmue added the ep:WebGPU ort-web webgpu provider label Nov 11, 2025
@fs-eire
Copy link
Contributor

fs-eire commented Nov 12, 2025

/azp run Linux QNN CI Pipeline, Win_TRT_Minimal_CUDA_Test_CI, Windows ARM64 QNN CI Pipeline, Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 4 pipeline(s).

@guschmue
Copy link
Contributor

tested on tgl with qwen3-0.6b and int4 embeddings - works!

@daijh
Copy link
Contributor Author

daijh commented Nov 12, 2025

Not sure why CI failed. Probably kick off a retry.

@guschmue guschmue closed this Nov 13, 2025
@guschmue guschmue reopened this Nov 13, 2025
@guschmue
Copy link
Contributor

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline

@azure-pipelines
Copy link

Azure Pipelines successfully started running 4 pipeline(s).

@guschmue
Copy link
Contributor

/azp run web_Release / build_onnxruntime_web,web_Debug / build_onnxruntime_web,Test Linux TensorRT x64 Release,Test Linux CUDA x64 Release

@azure-pipelines
Copy link

No pipelines are associated with this pull request.

@guschmue guschmue merged commit d6219b6 into microsoft:main Nov 13, 2025
157 of 184 checks passed
@daijh daijh deleted the fix-gather-int4 branch November 18, 2025 11:06
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ep:WebGPU ort-web webgpu provider

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants