Cannot run unsloth finetuned `Gemma-3-4b-it` model on webgpu

### System Info

transformers.js: 3.7.5

### Environment/Platform

- [x] Website/web-app
- [ ] Browser extension
- [ ] Server-side (e.g., Node.js, Deno, Bun)
- [ ] Desktop app (e.g., Electron)
- [ ] Other (e.g., VSCode extension)

### Description

**Description**
We fine-tuned `unsloth/gemma-3-4b-it` and merged LoRA adapters into the base model in bf16 to prepare for ONNX export.

* HF sources

  * Base: [https://huggingface.co/unsloth/gemma-3-4b-it](https://huggingface.co/unsloth/gemma-3-4b-it)
  * Our merged model: [https://huggingface.co/azizbekabdullaev/gemma-3-4b-it](https://huggingface.co/azizbekabdullaev/gemma-3-4b-it)

**What we tried**

* Transformers.js conversion script

  * `python -m scripts.convert --model_id azizbekabdullaev/gemma-3-4b-it --quantize`
  * Result: conversion fails during ONNX export for Gemma-3. 

  * `python -m onnxruntime_genai.models.builder -i "C:\models\Gemma3-4B-SFT-merged-bf16" -o "C:\models\gemma3-4B-webgpu" -e webgpu -p int4`
  * This produced a WebGPU Q4 package but it does not run in Transformers.js. 

**Expected**
A path to convert Gemma-3-4B-IT to a Transformers.js-compatible ONNX package for WebGPU text-generation.

### Reproduction

```bash
# Try official conversion
python -m scripts.convert --model_id azizbekabdullaev/gemma-3-4b-it --quantize

# Then attempt to load in JS
import { pipeline } from "@huggingface/transformers";
const pipe = await pipeline("text-generation", "./models/gemma-3-4b-it", {
  device: "webgpu",
  dtype: "q4",
});
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Cannot run unsloth finetuned `Gemma-3-4b-it` model on webgpu #1438

System Info

Environment/Platform

Description

Reproduction

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Cannot run unsloth finetuned Gemma-3-4b-it model on webgpu #1438

Description

System Info

Environment/Platform

Description

Reproduction

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Cannot run unsloth finetuned `Gemma-3-4b-it` model on webgpu #1438