You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[V0.11.0][Core] Restore scheduling logic under default configuration (vllm-project#3967)
This PR reverts the changes introduced in PR vllm-project#2894 Initially, due to
performance issues with the older version of the chunked prefill ops,
the default behavior was to use the Ascend scheduler to disable the
chunked prefill feature. However, with the improvements in the
performance of the new chunked prefill ops, this interception strategy
has been removed. This change also aligns with the community's default
configuration behavior.
No.
CI passed with new added/existing test.
- vLLM version: v0.11.0
- vLLM main:
vllm-project/vllm@83f478b
Signed-off-by: rjg-lyh <1318825571@qq.com>
0 commit comments