Skip to content

Pull requests: ModelTC/LightLLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix grouped_topk tl.sort when numel=1
#1101 opened Nov 7, 2025 by SangChengC Loading…
moe triton kernel use tma.
#1100 opened Nov 7, 2025 by hiworldwzj Loading…
add flashinfer-trtllm-ragged-prefill-attn
#1099 opened Nov 6, 2025 by SangChengC Loading…
feat: disk cache v1.0
#1098 opened Nov 5, 2025 by blueswhen Loading…
[model] Support Qwen3next
#1097 opened Nov 5, 2025 by sufubao Loading…
Add qwen3 vl
#1095 opened Nov 4, 2025 by SangChengC Loading…
Awq support
#1084 opened Oct 21, 2025 by shihaobai Loading…
fix: MTP in chunked prefill mode
#1079 opened Oct 14, 2025 by sufubao Loading…
support interns1
#1060 opened Sep 18, 2025 by xhx1022 Loading…
enable fa3 and fused_shared_experts by default
#1053 opened Sep 15, 2025 by sufubao Loading…
fix tl.where and set the default loader worker number
#1052 opened Sep 11, 2025 by sufubao Loading…
feat: Implementing Past Future Scheduler
#1048 opened Sep 8, 2025 by WuSiYu Loading…
[support] vit and llm disaggregation
#1014 opened Aug 20, 2025 by SangChengC Loading…
add fa3_mtp
#1005 opened Aug 11, 2025 by WANDY666 Loading…
Support Qwen models' dp>1 in PD
#999 opened Aug 5, 2025 by zhhangBian Loading…
add rmsnorm-add fusion kernel
#996 opened Aug 4, 2025 by theNiemand Loading…
Asynchicache
#977 opened Jul 21, 2025 by jinbiaoyu Loading…
Fp8 deepseek
#975 opened Jul 17, 2025 by blueswhen Loading…
cuda graph pool with LRU
#964 opened Jul 8, 2025 by STwangyingrui Loading…
Add fake balance for EP mode
#962 opened Jul 8, 2025 by STwangyingrui Loading…
Multimodal improve
#951 opened Jul 1, 2025 by shihaobai Loading…
feat: Support decode chunk PD serving mode
#944 opened Jun 25, 2025 by zhhangBian Loading…
Support disk radix cache
#837 opened Apr 18, 2025 by jayfeather9 Loading…
ProTip! Add no:assignee to see everything that’s not assigned.