- 
          
- 
                Notifications
    You must be signed in to change notification settings 
- Fork 10.9k
Pull requests: vllm-project/vllm
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
      feat(benchmarks): support HF model names in multi-turn benchmark
        
              
                performance
  Performance-related issues 
        
      
    
      
  
        
          #27850
            opened Oct 31, 2025  by
            ai-jz
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [NIXL][XPU] Pin NIXL version to 0.7.0
        
              
                kv-connector
        
      
    
      
  
        
          #27849
            opened Oct 31, 2025  by
            zhenwei-intel
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [WIP] Add general correctness check script for vLLM endpoints
      
    
        
          #27848
            opened Oct 31, 2025  by
            jasonlizhengjian
            
        
        
            
    •
    
      Draft
    
  
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [Bugfix] Skip gs:// model paths for speculator detection
      
    
      
  
        
          #27846
            opened Oct 30, 2025  by
            pwschuurman
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [WIP][CI/Build] Fix AMD structured outputs tests OOM
        
              
                rocm
  Related to AMD ROCm 
              
                structured-output
              
                v1
        
      
    
        
          #27845
            opened Oct 30, 2025  by
            zhewenl
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Bugfix] Flashinfer block size for hybrid ssm models
        
              
                ready
  ONLY add when PR is ready to merge/full CI is needed 
              
                v1
        
      
    
      
  
        
          #27843
            opened Oct 30, 2025  by
            heheda12345
            
        
        
            
    •
    
      Draft
    
  
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [CI] Add batch invariant test to ci
        
              
                ci/build
              
                ready
  ONLY add when PR is ready to merge/full CI is needed 
              
                v1
        
      
    
      
  
        
          #27842
            opened Oct 30, 2025  by
            yewentao256
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Feat] Drop-in Torch CUDA Profiler
        
              
                documentation
  Improvements or additions to documentation 
              
                frontend
              
                v1
        
      
    
      
  
        
          #27841
            opened Oct 30, 2025  by
            benchislett
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Attention] Remove max cudagraph size limit of 992
        
              
                ready
  ONLY add when PR is ready to merge/full CI is needed 
              
                v1
        
      
    
    
      Batch invariance doc
        
              
                documentation
  Improvements or additions to documentation 
        
      
    
      
  
        
          #27839
            opened Oct 30, 2025  by
            bwasti
            
        
        
            
    
  
    Loading…
 
        
          
        
      
    
      [Spec Decode] Fix EAGLE + DP bug
        
              
                speculative-decoding
              
                v1
        
      
    
      
  
        
          #27837
            opened Oct 30, 2025  by
            MatthewBonanni
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    3 of 5 tasks
  
      [CI/Build] Set test case to run two different containers on the same host
        
              
                ci/build
              
                ready
  ONLY add when PR is ready to merge/full CI is needed 
        
      
    
      
  
        
          #27835
            opened Oct 30, 2025  by
            amdfaa
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [Kimi-Linear] Correct prefixes and add compatibility to AWQ quants
      
    
      
  
        
          #27834
            opened Oct 30, 2025  by
            toncao
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 tasks
  
      [Compile] Avoid compiling the same module definition many times
      
    
        
          #27833
            opened Oct 30, 2025  by
            Lucaskabela
            
        
        
            
    •
    
      Draft
    
  
        
          
   
        
      
    
      
        
      
      
  
    3 of 5 tasks
  
      [Test] Adjust abort sleep time to reduce AsyncLLM test flake
        
              
                ready
  ONLY add when PR is ready to merge/full CI is needed 
              
                v1
        
      
    
      
  
        
          #27827
            opened Oct 30, 2025  by
            njhill
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [Cleanup] Remove no-longer-used ONLY add when PR is ready to merge/full CI is needed 
        
      
    
      
  SpeculativeConfig.enable_chunked_prefill
        
              
                frontend
              
                ready
  
        
          #27826
            opened Oct 30, 2025  by
            njhill
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      Docs update tpu install instructions
        
              
                documentation
  Improvements or additions to documentation 
              
                tpu
  Related to Google TPUs 
        
      
    
      
  
        
          #27824
            opened Oct 30, 2025  by
            RobMulla
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    4 of 5 tasks
  
      Simplify vLLM deployment on AWS with new Ansible playbooks and step-by-step instructions & video guide
        
              
                documentation
  Improvements or additions to documentation 
        
      
    
      
  
        
          #27820
            opened Oct 30, 2025  by
            rlopez133
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    2 tasks
  
      [MLA] Separate Quant from unified_mla_attn op
        
              
                v1
        
      
    
        
          #27817
            opened Oct 30, 2025  by
            pavanimajety
            
        
        
            
    •
    
      Draft
    
  
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
      [Misc] Refactor Attention kv transfer methods into decorator
      
    
      
  
        
          #27816
            opened Oct 30, 2025  by
            NickLucche
            
        
        
            
    
  
    Loading…
 
        
        
      
    
      [BugFix] fix: skip check unstreamed tool arg tokens when tool call name is present
        
              
                frontend
        
      
    
      
  
        
          #27806
            opened Oct 30, 2025  by
            llsj14
            
        
        
            
    
  
    Loading…
 
        
          
   
        
      
    
      
        
      
      
  
    5 tasks
  
Previous Next
  
  
  ProTip!
  Mix and match filters to narrow down what you’re looking for.