Skip to content

Pull requests: intel/neural-compressor

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add Gemma3 quantization test
#2459 opened Apr 30, 2026 by wpietka Contributor Draft
[JAX] Enable dot_product_attention usage in MultiHeadAttention
#2458 opened Apr 29, 2026 by anko-intel Contributor Loading…
Add batch size and gradient accumulation parameters to quantization s…
#2456 opened Apr 28, 2026 by xin3he Contributor Loading…
Fix activation scale inf issue for const_weight and const_scale
#2448 opened Apr 15, 2026 by qgao007 Contributor Loading…
Bump the pip group across 7 directories with 1 update dependencies Pull requests that update a dependency file python Pull requests that update Python code
#2445 opened Apr 8, 2026 by dependabot Bot Loading…
Cherry pick v1.24.0
#2439 opened Mar 31, 2026 by xin3he Contributor Loading…
Add mark_step() for the sliced FusedSDPA
#2395 opened Jan 23, 2026 by yangulei Loading…
Split fp8_fused_sdpa into two phases
#2346 opened Nov 26, 2025 by czhu15 Loading…
[WIP] Add mlperf example for whisper
#2343 opened Nov 25, 2025 by lkk12014402 Contributor Loading…
[Draft] Enable slicing of fp8 FusedSDPA for APC
#2340 opened Nov 20, 2025 by yangulei Loading…
[WIP] Add mlperf examples
#2338 opened Nov 18, 2025 by lkk12014402 Contributor Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.