Skip to content

Commit f0dac90

Browse files
committed
allow deepseek models to enable chunked prefill on NPUs
Signed-off-by: rjg-lyh <1318825571@qq.com>
1 parent 418ed3d commit f0dac90

File tree

1 file changed

+3
-0
lines changed

1 file changed

+3
-0
lines changed

vllm_ascend/platform.py

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -204,6 +204,9 @@ def check_and_update_config(cls, vllm_config: VllmConfig) -> None:
204204
"ascend_scheduler_config", None) is not None:
205205
additional_scheduler_config = additional_config.get(
206206
"ascend_scheduler_config")
207+
if vllm_config.scheduler_config.enable_chunked_prefill:
208+
additional_scheduler_config[
209+
"enable_chunked_prefill"] = True
207210
from vllm_ascend.core.schedule_config import \
208211
AscendSchedulerConfig
209212
ascend_scheduler_config = AscendSchedulerConfig.initialize_from_config(

0 commit comments

Comments
 (0)