-
Notifications
You must be signed in to change notification settings - Fork 3.1k
Pull requests: sgl-project/sglang
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Auto Sync] Update scheduler.py (20251017)
high priority
run-ci
#11738
opened Oct 17, 2025 by
zhyncs
Loading…
Revert "Set csgmv as default lora backend. (#11488)"
high priority
run-ci
#11735
opened Oct 16, 2025 by
zhyncs
Loading…
4 tasks
chore: bump sgl-kernel version to 0.3.16.post3
run-ci
#11733
opened Oct 16, 2025 by
sglang-bot
Loading…
add tuned fuse moe kernel for qwen3 235b fp8 on h200
#11730
opened Oct 16, 2025 by
pdasgup
Loading…
4 tasks
Support mrope triton kernel and add unit test
run-ci
#11722
opened Oct 16, 2025 by
yuan-luo
Loading…
4 tasks
[WIP][sgl-kernel] support flashmla libtorch
high priority
run-ci
#11717
opened Oct 16, 2025 by
FlamingoPg
Loading…
1 of 4 tasks
feat(example/fastapi): support --startup-timeout using Qwen3-Next-80B-A3B-Instruct as example
#11710
opened Oct 16, 2025 by
Kindyaa
Loading…
4 tasks
[Fix] fix type issue of env flag value MODELOPT_MAX_TOKENS_PER_EXPERT
run-ci
#11709
opened Oct 16, 2025 by
zejunchen-zejun
Loading…
Support running FP4 Deepseek on SM120.
run-ci
#11708
opened Oct 16, 2025 by
weireweire
Loading…
2 of 4 tasks
[sgl-kernel] enhance sgl-kernel import logic for sm8x
run-ci
#11707
opened Oct 16, 2025 by
FlamingoPg
Loading…
1 of 4 tasks
[quantization][MoE] fix the check for A PR may be merged without a full CI check
run-ci
tp_size
/ moe_ep_size
/ moe_intermediate_size
/ weight_block_size_n
express-lane
#11702
opened Oct 16, 2025 by
kevin85421
Loading…
1 of 4 tasks
[Test] support llm-compressor: w8a8_fp8_block, wNa16
#11701
opened Oct 16, 2025 by
Wangzheee
Loading…
4 tasks
[router] Add Configurable L0 and L1 Tokenizer Caching
enhancement
New feature or request
router
router-benchmark
run-ci
#11688
opened Oct 16, 2025 by
slin1237
Loading…
2 of 4 tasks
[Lint] Add
python/sglang
to ruff F401 checks and remove unused imports in files
run-ci
#11685
opened Oct 15, 2025 by
CatherineSue
Loading…
1 of 4 tasks
wip: Remove redundant fill_(0) in dp_scatter
run-ci
#11683
opened Oct 15, 2025 by
ch-wan
Loading…
4 tasks
[Bug fix] fix Qwen3-VL dense model launch failure caused by rotary-embedding
#11675
opened Oct 15, 2025 by
coco-alen
Loading…
4 tasks
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.