-
Notifications
You must be signed in to change notification settings - Fork 662
Pull requests: vllm-project/vllm-ascend
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[E2E] Fix and optimize e2e test.
ready
read for review
ready-for-test
start test by label for PR
#5091
opened Dec 16, 2025 by
menogrey
Loading…
[Pangu][MoE] Remove PanguProMoEV1 related code
merge-conflicts
#5088
opened Dec 16, 2025 by
Pr0Wh1teGivee
Loading…
[WIP] [Doc]Add the user_guide doc file regarding fine-grained TP.
#5084
opened Dec 16, 2025 by
zzhx1
Loading…
[Refactor] 4/N Distinguish the branches based on the applicable scenarios of pagedAttention and fusedInferAttention.
module:tests
#5081
opened Dec 16, 2025 by
weijinqian0
Loading…
[Graph][Fusion]Add new pattern for AddRmsnormQuant with SP.
ready
read for review
ready-for-test
start test by label for PR
#5077
opened Dec 16, 2025 by
Angazenn
Loading…
[Doc] Add new contributors and relative scripts.
documentation
Improvements or additions to documentation
module:tools
#5070
opened Dec 16, 2025 by
menogrey
Loading…
Add a Mooncake installation tutorial for kv pool and update Mooncake installation tutorial
documentation
Improvements or additions to documentation
#5069
opened Dec 16, 2025 by
liziyu179
Loading…
[feat] apply flashcomm1 on multimodal_model
module:core
module:ops
#5064
opened Dec 16, 2025 by
hwhaokun
Loading…
[UT]add the UT of pcp and dcp in the attention_cp file
module:tests
#5054
opened Dec 16, 2025 by
pichangping
Loading…
Upgrade vllm commit hash to 1216
ci/build
documentation
Improvements or additions to documentation
ready
read for review
ready-for-test
start test by label for PR
#5053
opened Dec 16, 2025 by
Toneymiller
Loading…
[Bugfix]fix dsv3.1 FIA err in async_scheduling with mtp
ready
read for review
ready-for-test
start test by label for PR
#5046
opened Dec 15, 2025 by
hust17yixuan
Loading…
Add the requirement of arctic-inference which speculative decoding with suffix_decode
merge-conflicts
#5045
opened Dec 15, 2025 by
frankie-ys
Loading…
[Feature]Use DispatchGmmCombineDecode operator to replace MC2(Optional)
merge-conflicts
module:core
module:ops
module:quantization
#5040
opened Dec 15, 2025 by
wangqiankun13
Loading…
[UT]Ut for function cumsum_group_list in main (ref #5025)
module:ops
module:tests
#5036
opened Dec 15, 2025 by
Clorist33
Loading…
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.