You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Signed-off-by: bbartels <benjamin@bartels.dev>
[gpt-oss] Add IncompleteDetails to ResponsesRepsonse (#24561)
Signed-off-by: Andrew Xia <axia@meta.com>
[gpt-oss][1a] create_responses stream outputs BaseModel type, api server is SSE still (#24759)
Signed-off-by: Andrew Xia <axia@meta.com>
[Performance] Remove redundant clone() calls in cutlass_mla (#24891)
[Bug] Fix Cutlass Scaled MM Compilation Error (#24887)
Signed-off-by: yewentao256 <zhyanwentao@126.com>
[ci] fix wheel names for arm wheels (#24898)
Signed-off-by: simon-mo <simon.mo@hey.com>
[Tests] fix initialization of kv hash in tests (#24273)
Signed-off-by: Mickael Seznec <mickael@mistral.ai>
[Compile] Fix noop_elimination pass and add tests for noop_elimination (#24880)
Signed-off-by: zjy0516 <riverclouds.zhu@qq.com>
Propagate entire tokens to connector for resumed preemptions
Signed-off-by: Qier Li <kevin44036@gmail.com>
Fix pre-commit
Signed-off-by: Qier Li <kevin44036@gmail.com>
Rename field and nullify empty lists
Signed-off-by: Qier Li <kevin44036@gmail.com>
Update vllm/v1/core/sched/scheduler.py
Co-authored-by: Nick Hill <nhill@redhat.com>
Signed-off-by: Qier Li <kevin44036@gmail.com>
Add unit test for preemption resumption
Signed-off-by: Qier Li <kevin44036@gmail.com>
0 commit comments