[MODEL] support qwen3.5 series w/o vision by JJJYmmm · Pull Request #869 · ml-explore/mlx-lm

JJJYmmm · 2026-02-10T17:25:42Z

This PR adds model support for the upcoming Qwen3.5 models, including both dense and MoE variants.

It's a refine version of #861 by @johnmai-dev.

Reference HF implementation - huggingface/transformers#43830

Co-authored-by: johnmai-dev <johnmai-dev@users.noreply.github.com>

johnmai-dev · 2026-02-10T17:48:55Z

Do we need to add support for qwen3_5_text and qwen3_5_moe_text?

https://github.com/huggingface/transformers/blob/42791a34fdeae197f60f11ace3807c81f44b0729/src/transformers/models/auto/modeling_auto.py#L356-L357

JJJYmmm · 2026-02-10T18:12:12Z

Do we need to add support for qwen3_5_text and qwen3_5_moe_text?

https://github.com/huggingface/transformers/blob/42791a34fdeae197f60f11ace3807c81f44b0729/src/transformers/models/auto/modeling_auto.py#L356-L357

imo it’s not need, because we’ll only release the vlm ckpts, so just following previous vlms e.g. qwen3vlmoe should be ok.

awni · 2026-02-10T21:32:36Z

mlx_lm/models/qwen3_5.py

+            if any(k.endswith(sfx) for sfx in norm_keys):
+                if v.ndim == 1:
+                    weights[k] = v + 1.0


I think this is a bug. The sanitize function is called every time a mlx model is loaded so if you do convert the model (which will call sanitize) then run it (which will call sanitize) you will add 1.0 to these values twice.

Instead we should only apply this scaling once. An easy way to do that is to have a condition which can tell you if the model has already been sanitized. (For example if the "mpt" layer is in the weights or something).

got it, update the sanitize logic and add a test🫡

awni

Looks great!

awni · 2026-02-11T15:11:01Z

@JJJYmmm should we go ahead and merge this? Have you tested it on an actual model yet?

JJJYmmm · 2026-02-11T15:24:38Z

I’ve tested it on preview ckpts, so it’s fine to merge now. I’ll check if it still works when the official version drops. 🤗

JJJYmmm and others added 2 commits February 11, 2026 01:11

support text-only qwen3.5 series

63fef0e

Co-authored-by: johnmai-dev <johnmai-dev@users.noreply.github.com>

add test

ae8cf11

johnmai-dev mentioned this pull request Feb 10, 2026

Adding Support for Qwen3.5 #861

Closed

1 task

johnmai-dev mentioned this pull request Feb 10, 2026

Adding Support for Qwen3.5 ml-explore/mlx-swift-lm#97

Draft

4 tasks

awni reviewed Feb 10, 2026

View reviewed changes

awni mentioned this pull request Feb 10, 2026

Add Qwen3.5 MoE #862

Closed

JJJYmmm added 2 commits February 11, 2026 12:01

fix sanitize and add test

3b54f0d

make it more readable

3f20611

awni approved these changes Feb 11, 2026

View reviewed changes

fix lint

2dc22b6

awni merged commit 0fd3126 into ml-explore:main Feb 12, 2026
2 checks passed

BrewTestBot mentioned this pull request Feb 12, 2026

mlx-lm 0.30.7 Homebrew/homebrew-core#267270

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

[MODEL] support qwen3.5 series w/o vision#869

[MODEL] support qwen3.5 series w/o vision#869
awni merged 5 commits intoml-explore:mainfrom
JJJYmmm:add_qwen3_5

JJJYmmm commented Feb 10, 2026

Uh oh!

johnmai-dev commented Feb 10, 2026

Uh oh!

JJJYmmm commented Feb 10, 2026

Uh oh!

awni Feb 10, 2026

Uh oh!

JJJYmmm Feb 11, 2026

Uh oh!

awni left a comment

Uh oh!

awni commented Feb 11, 2026

Uh oh!

JJJYmmm commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

JJJYmmm commented Feb 10, 2026

Uh oh!

johnmai-dev commented Feb 10, 2026

Uh oh!

JJJYmmm commented Feb 10, 2026

Uh oh!

awni Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

JJJYmmm Feb 11, 2026

Choose a reason for hiding this comment

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

awni commented Feb 11, 2026

Uh oh!

JJJYmmm commented Feb 11, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants