-
-
Notifications
You must be signed in to change notification settings - Fork 11.1k
Open
Labels
feature requestNew feature or requestNew feature or request
Description
- [AsyncScheduling] Make async overlap work with logprobs #27615
- [BugFix] Handle unscheduled requests properly when async scheduling #27756
- [Core] Async scheduling + structured outputs compatibility #26866
- [BugFix] Fix mixed penalties batch with async scheduling #27910
- [AsyncScheduling] Don't schedule past request max_tokens #27922
- [KV offload] Offloading connector async scheduling support #27648
- [PerfFix] Avoid separate thread for MP executor shm spin #28012 - perf fix for regression in
#26866 - [Core] Rework handling of async scheduling config #28250 - ready for review/merge
- [WIP] Enable async scheduling by default #27614 - testing
- [Core] Async Scheduling X Spec Decoding Compatibility #24799 - under review
- Explore Async Scheduling + Pipeline Parallel
Zerohertz, xuanyu-mistral, cjackal, Ronald1995, hnt2601 and 7 moretjtanaa
Metadata
Metadata
Assignees
Labels
feature requestNew feature or requestNew feature or request