ml-explore / mlx-lm Public

Notifications You must be signed in to change notification settings
Fork 417
Star 3.6k

Code
Issues 66
Pull requests 42
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: ml-explore/mlx-lm

Labels 9 Milestones 0

New pull request New

42 Open 496 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Add per-sequence trim support to BatchKVCache

#873 opened Feb 11, 2026 by 0xDaizz

Loading…

Share model

#871 opened Feb 11, 2026 by angeloskath

Loading…

Fix dynamic_quant for MoE and VL models

#870 opened Feb 10, 2026 by Taderich73

Loading…

3 tasks done

[MODEL] support qwen3.5 series w/o vision

#869 opened Feb 10, 2026 by JJJYmmm

Loading…

LongCat MLA

#868 opened Feb 10, 2026 by kernelpool

Loading…

Add GLM5

#867 opened Feb 10, 2026 by Goekdeniz-Guelmez

Loading…

Add support for ministral3 and mistral3 model types

#860 opened Feb 7, 2026 by sealad886

Loading…

server: log bad tool_calls from model

#850 opened Feb 6, 2026 by percontation

Loading…

server: add usage.prompt_tokens_details.cached_tokens to json response

#849 opened Feb 6, 2026 by percontation

Loading…

refactor: use time.perf_counter() for duration measurements

#848 opened Feb 6, 2026 by m92y

Loading…

feat: enhance chat CLI with readline history, line editing, and distributed support

#841 opened Feb 3, 2026 by Vlor999

Loading…

feat: Add LLaDA 1.0/2.0 Diffusion Model Support

#804 opened Jan 24, 2026 by akkikiki • Draft

Add support for LongCat ZigZag Attention

#802 opened Jan 23, 2026 by kernelpool

Loading…

moved more activation functions to activations module

#774 opened Jan 19, 2026 by Goekdeniz-Guelmez

Loading…

sov30_moe sparse moe blocks

#734 opened Jan 6, 2026 by rachittibrewal-sarvam • Draft

Add Zamba2

#724 opened Jan 4, 2026 by proazr

Loading…

feat: allow callers to cancel stream generation via callback check, and ensure prompt cache consistency

#710 opened Dec 30, 2025 by zhutao100

Loading…

Add tokenizer_config support to LoRA YAML configuration

#688 opened Dec 17, 2025 by breitburg

Loading…

Add server logging and stop generation on client disconnect

#677 opened Dec 15, 2025 by otarkhan

Loading…

Add Ouro

#599 opened Nov 9, 2025 by kernelpool

Loading…

Added optional accuracy reporting while training and evaluating models

#595 opened Nov 7, 2025 by jyork03

Loading…

Fix : mlx-server for chunked request (to support one-api, curl)

#589 opened Nov 3, 2025 by yiakwy-xpu-ml-framework-team

Loading…

Addding Emu3

#585 opened Nov 2, 2025 by Goekdeniz-Guelmez • Draft

Normalize scheduler/warmup step-like arguments by grad_accumulation_steps; warmup-only configuration, sane argument defaults and tests

#582 opened Nov 1, 2025 by jyork03

Loading…

Add LLada2 MoE

#560 opened Oct 18, 2025 by kernelpool

Loading…

Previous 1 2 Next

Previous Next

ProTip! Mix and match filters to narrow down what you’re looking for.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!