-
-
Notifications
You must be signed in to change notification settings - Fork 9.3k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Core][BugFix] Fix thread safety issue in RequestOutputCollector
ready
ONLY add when PR is ready to merge/full CI is needed
v1
#22576
opened Aug 9, 2025 by
22quinn
Loading…
3 of 4 tasks
[Misc] Replace flaky image urls in pixtral test
multi-modality
Related to multi-modality (#4194)
#22574
opened Aug 9, 2025 by
Isotr0py
Loading…
1 of 4 tasks
optimize: improve scheduler policy lookup performance
v1
#22573
opened Aug 9, 2025 by
skyloevil
Loading…
[Core] Use individual MM items in P0/P1 cache and model runner
multi-modality
Related to multi-modality (#4194)
tpu
Related to Google TPUs
v1
#22570
opened Aug 9, 2025 by
DarkLight1337
Loading…
1 of 4 tasks
[Misc] code clean duplicate set_current_vllm_config in _set_vllm_config
#22566
opened Aug 9, 2025 by
andyxning
Loading…
4 tasks
Frontend: Adding LM Format Enforcer support to V1 engine
ci/build
structured-output
v1
#22564
opened Aug 9, 2025 by
noamgat
Loading…
4 tasks
[BugFix] EAGLE Load Bias From Config
llama
Related to Llama models
speculative-decoding
#22558
opened Aug 9, 2025 by
MMuzzammil1
Loading…
[gpt-oss] Add test for response API + harmony (but skipped)
#22554
opened Aug 9, 2025 by
heheda12345
Loading…
3 of 4 tasks
[Misc] fail fast when exception is raised in in_the_same_node_as
#22553
opened Aug 8, 2025 by
andyxning
Loading…
4 tasks
[Bugfix][V1] Fix Finished Request Handling in Async Scheduling
bug
Something isn't working
v1
#22543
opened Aug 8, 2025 by
leo-cf-tian
Loading…
New moe quant config
ci/build
deepseek
Related to DeepSeek models
documentation
Improvements or additions to documentation
needs-rebase
rocm
Related to AMD ROCm
Fix(benchmarks): allow multiple mm contents in OpenAI Chat Completion Benchmarks
performance
Performance-related issues
#22534
opened Aug 8, 2025 by
h-brenoskuk
Loading…
Improve fast_topk function with type hints and documentation
#22530
opened Aug 8, 2025 by
skyloevil
Loading…
Remove redundant row_indices unsqueeze operation in MiniCPMO
#22528
opened Aug 8, 2025 by
skyloevil
Loading…
Quantization: support FP4 quantized models on AMD CDNA2/CDNA3 GPUs
ci/build
rocm
Related to AMD ROCm
#22527
opened Aug 8, 2025 by
fengli1702
Loading…
4 tasks done
[Fix] fix offline env use local mode path
#22526
opened Aug 8, 2025 by
lengrongfu
Loading…
1 of 4 tasks
[WIP] [Bench] Add Triton NVFP4 GEMM
performance
Performance-related issues
#22523
opened Aug 8, 2025 by
phuhung273
•
Draft
2 of 4 tasks
[ROCm][AITER] Support AITER Rope ops in RotaryEmbedding Module.
rocm
Related to AMD ROCm
#22521
opened Aug 8, 2025 by
vllmellm
Loading…
3 of 4 tasks
[WIP][Model] Add Ernie4.5 VL Model Support
documentation
Improvements or additions to documentation
new-model
Requests to new models
#22514
opened Aug 8, 2025 by
CSWYF3634076
•
Draft
Previous Next
ProTip!
Adding no:label will show everything without a label.