Skip to content

Commit b721f6c

Browse files
Disable prefix caching only if chunked prefill is explicitly disabled
Signed-off-by: simondanielsson <simon.danielsson99@hotmail.com>
1 parent cedaaa0 commit b721f6c

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

vllm/config/__init__.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -432,7 +432,9 @@ def __post_init__(self):
432432
"try setting 'VLLM_WORKER_MULTIPROC_METHOD' "
433433
"to 'spawn'.")
434434

435-
if (not self.scheduler_config.chunked_prefill_enabled
435+
# Disable prefix caching only if chunked prefill is explicitly disabled
436+
# (and not merely unset)
437+
if (self.scheduler_config.chunked_prefill_enabled is False
436438
or disable_chunked_prefill_reasons):
437439
for reason in disable_chunked_prefill_reasons:
438440
logger.info(reason)

0 commit comments

Comments
 (0)