Skip to content

Pull requests: huggingface/transformers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Fix flash attention crash with 3D position_ids (Qwen3.5)
#44911 opened Mar 21, 2026 by ouroborosscr Loading…
2 of 5 tasks
Fix: Update optimization.py
#44909 opened Mar 21, 2026 by anshuS1310 Loading…
Remove unnecessary expand_as in get_placeholder_mask across VLMs
#44907 opened Mar 21, 2026 by syncdoth Loading…
7 tasks done
Support SizeDict import in get_size_dict
#44903 opened Mar 21, 2026 by yonigozlan Loading…
[docs] continuous batching
#44896 opened Mar 20, 2026 by stevhliu Loading…
Add static FP8 expert support
#44895 opened Mar 20, 2026 by SunMarc Loading…
add StaticLayer.crop() to match DynamicLayer API
#44893 opened Mar 20, 2026 by ai-man-codes Loading…
2 of 5 tasks
[Trainer] add MoERouterHealthCallback Callback
#44891 opened Mar 20, 2026 by kashif Loading…
5 tasks
Add big angry code agent warnings!
#44890 opened Mar 20, 2026 by Rocketknight1 Loading…
[DeepSpeed] Fix evaluate()/predict() before train()
#44889 opened Mar 20, 2026 by roycho96 Loading…
2 of 5 tasks
Remove explicit cuda stream in nemotron_h
#44888 opened Mar 20, 2026 by Cyrilvallez Loading…
Allow arbitrary template kwargs in processors
#44881 opened Mar 20, 2026 by zucchini-nlp Loading…
incorrect model list update
#44880 opened Mar 20, 2026 by itazap Loading…
refactor: unify QA calls
#44879 opened Mar 20, 2026 by tarekziade Loading…
ProTip! Adding no:label will show everything without a label.