Skip to content

Pull requests: alibaba/rtp-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add memory connector
#365 opened Nov 14, 2025 by li-xiao-qing Loading…
Feature: support rocm hipgraph in py mode
#363 opened Nov 14, 2025 by muse-coder Loading…
chore: rm extra norm in py model
#362 opened Nov 14, 2025 by Vinkle-hzt Loading…
feat: sync some legacy updates
#360 opened Nov 13, 2025 by sunmiaozju Loading…
feat: optimize default setting
#358 opened Nov 13, 2025 by Bruce-Lee-LY Loading…
fix - jinja template not support break
#357 opened Nov 13, 2025 by jianglan89 Loading…
Feature/rmsnorm fuse quant and ut update
#355 opened Nov 12, 2025 by missximon Loading…
Feature/refactor fused moe
#352 opened Nov 12, 2025 by alibaba-miji Loading…
AMD sampling with random seed
#351 opened Nov 12, 2025 by CrimsonDump Loading…
refactor: multimodal process
#349 opened Nov 11, 2025 by ySingularity Loading…
fix - add group topk in pymodel for deepseek
#347 opened Nov 10, 2025 by Nancheng-11 Loading…
Develop/xuanche/kvcache refactor 5
#346 opened Nov 10, 2025 by xinfei-shi Loading…
fix: fix cpp api server not started
#340 opened Nov 7, 2025 by zhangjianning-zjn Loading…
Amd/speculative sampling and mtp
#335 opened Nov 7, 2025 by amd-yilizhao Loading…
feat:aiter new version #1328 update
#332 opened Nov 6, 2025 by zhiqchen-amd Loading…
refactor: optimize fp8_per_block_linear performance
#331 opened Nov 6, 2025 by yykzjh Loading…
feat: add atex cuda kernels
#319 opened Nov 4, 2025 by ZhangZhiPku Loading…
feature - add cuda version in whl name
#317 opened Nov 3, 2025 by jianglan89 Loading…
[wip]Develop/embedding grpc server
#315 opened Nov 3, 2025 by wanglining97 Loading…
refactor: refactor cutlass groupgemm fp8
#307 opened Oct 31, 2025 by MMadhatter Loading…
feature: new mtp framework
#305 opened Oct 31, 2025 by Vinkle-hzt Loading…
3 tasks done
ProTip! Updated in the last three days: updated:>2025-11-13.