-
Notifications
You must be signed in to change notification settings - Fork 247
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[0.9.1][BugFix] Fix the failure to recognize the actual type of quantization
module:ops
#1721
opened Jul 10, 2025 by
rjg-lyh
Loading…
[AscendScheduler][Bugfix] Remove num_draft_tokens while allocating slots
documentation
Improvements or additions to documentation
#1718
opened Jul 10, 2025 by
MengqingCao
Loading…
[BUGFIX] [v0.9.1-dev] Obtain the NPU ID of non-consecutive NPU cards
#1717
opened Jul 10, 2025 by
yangqinghao-cmss
Loading…
[v0.9.1]add rot_pos_emb()/get_window_index()/_process_image_input() to qwen2.5_vl_without_padding
#1705
opened Jul 9, 2025 by
zheliuyu
Loading…
[V0.9.1] Replace FA interface with FA_V2 to optimize perf in SelfAttention
#1701
opened Jul 9, 2025 by
rjg-lyh
Loading…
feat: Qwen3-dense model support dual-batch overlap(dbo)
#1699
opened Jul 9, 2025 by
ZhaoJiangJiang
Loading…
[WIP] dynamic eplb
merge-conflicts
module:core
module:ops
module:quantization
#1697
opened Jul 9, 2025 by
wanghanqingLYT
Loading…
support fa3 quant for v0.9.1-dev
module:quantization
module:tests
#1695
opened Jul 9, 2025 by
22dimensions
Loading…
fix: use dist.reduce_scatter_tensor to avoid memory leak
merge-conflicts
module:ops
#1688
opened Jul 9, 2025 by
NeverRaR
Loading…
[WIP][Prefill Performance] Parallel Strategy Optimizations (VRAM-for-Speed Tradeoff)
merge-conflicts
module:ops
module:quantization
#1687
opened Jul 9, 2025 by
SlightwindSec
Loading…
[V0 Deprecation] Remove V0 prompt adapter
merge-conflicts
#1683
opened Jul 9, 2025 by
shen-shanshan
Loading…
[Dist][EP] Remove ETP/EP maintained in vllm-ascend
documentation
Improvements or additions to documentation
merge-conflicts
module:core
module:ops
module:quantization
module:tests
#1681
opened Jul 9, 2025 by
MengqingCao
Loading…
[0.9.1][WIP][Feat] Restore paged attention kernel in Full Graph for performence
module:tests
#1677
opened Jul 8, 2025 by
yiz-liu
Loading…
update 091 eplb
merge-conflicts
module:core
module:ops
module:quantization
module:tests
#1676
opened Jul 8, 2025 by
shiyuan680
Loading…
[feat]add shared expert feature
merge-conflicts
module:core
module:ops
module:quantization
#1668
opened Jul 8, 2025 by
sadatama
Loading…
Upstream 091 eplb dynamic
ci/build
documentation
Improvements or additions to documentation
merge-conflicts
module:core
module:ops
module:quantization
module:tests
#1665
opened Jul 8, 2025 by
shiyuan680
Loading…
[Test] Add unittests for multi stream
module:tests
#1662
opened Jul 8, 2025 by
SunnyLee151064
Loading…
[Refactor] Refactor torchair
merge-conflicts
module:core
module:ops
module:quantization
module:tests
#1661
opened Jul 8, 2025 by
wangxiyuan
Loading…
[Draft][WIP][Feature]cpu offload connector
merge-conflicts
#1659
opened Jul 7, 2025 by
lidenghui1110
Loading…
New 091
ci/build
documentation
Improvements or additions to documentation
merge-conflicts
module:core
module:ops
module:quantization
module:tests
#1658
opened Jul 7, 2025 by
shiyuan680
Loading…
[CI/Build] Upgrade CANN to 8.2.RC1.alpha003
accuracy-test
enable all accuracy test for PR
documentation
Improvements or additions to documentation
merge-conflicts
ready-for-test
start test by label for PR
#1653
opened Jul 7, 2025 by
MengqingCao
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.