Skip to content

Pull requests: vllm-project/vllm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat(benchmarks): support HF model names in multi-turn benchmark performance Performance-related issues
#27850 opened Oct 31, 2025 by ai-jz Loading…
[WIP][CI/Build] Fix AMD structured outputs tests OOM rocm Related to AMD ROCm structured-output v1
#27845 opened Oct 30, 2025 by zhewenl Loading…
[Bugfix] Flashinfer block size for hybrid ssm models ready ONLY add when PR is ready to merge/full CI is needed v1
#27843 opened Oct 30, 2025 by heheda12345 Draft
5 tasks
[CI] Add batch invariant test to ci ci/build ready ONLY add when PR is ready to merge/full CI is needed v1
#27842 opened Oct 30, 2025 by yewentao256 Loading…
[Feat] Drop-in Torch CUDA Profiler documentation Improvements or additions to documentation frontend v1
#27841 opened Oct 30, 2025 by benchislett Loading…
[Attention] Remove max cudagraph size limit of 992 ready ONLY add when PR is ready to merge/full CI is needed v1
#27840 opened Oct 30, 2025 by 22quinn Draft
5 tasks
Batch invariance doc documentation Improvements or additions to documentation
#27839 opened Oct 30, 2025 by bwasti Loading…
[Spec Decode] Fix EAGLE + DP bug speculative-decoding v1
#27837 opened Oct 30, 2025 by MatthewBonanni Loading…
3 of 5 tasks
[WIP] enable_autorun_on_ready_for_eval ci/build
#27836 opened Oct 30, 2025 by hl475 Draft
5 tasks
[CI/Build] Set test case to run two different containers on the same host ci/build ready ONLY add when PR is ready to merge/full CI is needed
#27835 opened Oct 30, 2025 by amdfaa Loading…
5 tasks
[Kimi-Linear] Correct prefixes and add compatibility to AWQ quants
#27834 opened Oct 30, 2025 by toncao Loading…
4 tasks
[Test] Adjust abort sleep time to reduce AsyncLLM test flake ready ONLY add when PR is ready to merge/full CI is needed v1
#27827 opened Oct 30, 2025 by njhill Loading…
[Cleanup] Remove no-longer-used SpeculativeConfig.enable_chunked_prefill frontend ready ONLY add when PR is ready to merge/full CI is needed
#27826 opened Oct 30, 2025 by njhill Loading…
Docs update tpu install instructions documentation Improvements or additions to documentation tpu Related to Google TPUs
#27824 opened Oct 30, 2025 by RobMulla Loading…
4 of 5 tasks
Simplify vLLM deployment on AWS with new Ansible playbooks and step-by-step instructions & video guide documentation Improvements or additions to documentation
#27820 opened Oct 30, 2025 by rlopez133 Loading…
2 tasks
Adding SplitK in fused_moe_lora kernel
#27818 opened Oct 30, 2025 by yugong333 Loading…
5 tasks
[Refactor] FP8 Linear Ops
#27814 opened Oct 30, 2025 by vllmellm Draft
5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.