[Bugfix][Core]Fix block table out-of-range issue in priority scheduling #26661

quanliu1991 · 2025-10-12T14:38:30Z

Purpose

This PR fixes a block table out-of-range error that occurs when using priority scheduling

When a low-priority request that has already been added to scheduled_running_reqs is preempted,
the corresponding entries in req_to_new_blocks and num_scheduled_tokens are not updated synchronously.

As a result, when the same request is later re-scheduled and added back to scheduled_running_reqs,
the block table allocation does not start from index 0, leading to an index out-of-range error.

This PR ensures that:

When a request is preempted, related entries in req_to_new_blocks and num_scheduled_tokens are cleared or resynchronized.

Subsequent scheduling will always allocate new block IDs starting from 0.

Fixes #23316

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the Google Doc.

gemini-code-assist

Code Review

这个 PR 旨在修复一个抢占调度中的 bug，即当一个已调度的请求被抢占时，相关的资源没有被正确回收。您正确地识别出需要从 req_to_new_blocks 和 num_scheduled_tokens 中移除被抢占的请求。然而，在归还 token_budget 时存在一个错误：代码错误地使用了当前请求的 ID (request.request_id) 而不是被抢占请求的 ID (preempted_req.request_id)。这会导致 KeyError，因为当前请求尚未被调度。我提供了一个修复建议。请查看具体的审查评论。

gemini-code-assist · 2025-10-12T14:39:24Z

vllm/v1/core/sched/scheduler.py

+                        token_budget += num_scheduled_tokens[request.request_id]
+                        req_to_new_blocks.pop(preempted_req.request_id)
+                        num_scheduled_tokens.pop(preempted_req.request_id)


在归还 token_budget 时存在一个错误。代码使用了 request.request_id 来查找已调度的 token 数量，但这里的 request 是指当前正在调度并触发抢占的请求，而不是被抢占的 preempted_req。这会导致 KeyError，因为当前的 request 还没有被添加到 num_scheduled_tokens 字典中。您应该使用 preempted_req.request_id。

Suggested change

token_budget += num_scheduled_tokens[request.request_id]

req_to_new_blocks.pop(preempted_req.request_id)

num_scheduled_tokens.pop(preempted_req.request_id)

token_budget += num_scheduled_tokens[preempted_req.request_id]

req_to_new_blocks.pop(preempted_req.request_id)

num_scheduled_tokens.pop(preempted_req.request_id)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

chatgpt-codex-connector · 2025-10-12T14:40:01Z

vllm/v1/core/sched/scheduler.py

                    if preempted_req in scheduled_running_reqs:
                        scheduled_running_reqs.remove(preempted_req)
+                        token_budget += num_scheduled_tokens[request.request_id]
+                        req_to_new_blocks.pop(preempted_req.request_id)


Refund tokens for preempted request instead of current request

When a previously scheduled request is preempted, the code attempts to refund its token budget with token_budget += num_scheduled_tokens[request.request_id]. Here request is the request currently being scheduled and has not been inserted into num_scheduled_tokens, so the dictionary lookup will raise a KeyError the first time a scheduled low‑priority request is preempted. Even if the key existed by coincidence, the refunded amount would be for the wrong request and the scheduler’s token accounting would be corrupted. This line should reference preempted_req.request_id instead.

Useful? React with 👍 / 👎.

…uled_tokens Signed-off-by: quanliu <18646313696@163.com>

heheda12345

LGTM!

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: 1994 <1994@users.noreply.github.com>

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: Dhruvil Bhatt <bhattdbh@amazon.com>

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: bbartels <benjamin@bartels.dev>

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com>

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>

quanliu1991 requested review from ApostaC, WoosukKwon, alexm-redhat, comaniac, heheda12345, njhill, robertgshaw2-redhat and ywang96 as code owners October 12, 2025 14:38

mergify bot added the v1 label Oct 12, 2025

gemini-code-assist bot reviewed Oct 12, 2025

View reviewed changes

chatgpt-codex-connector bot reviewed Oct 12, 2025

View reviewed changes

quanliu1991 changed the title ~~[Bugfix][Core]remove preempted_req in req_to_new_blocks and num_sched…~~ [Bugfix][Core]Fix block table out-of-range issue in priority scheduling Oct 12, 2025

[Bugfix][Core]remove preempted_req in req_to_new_blocks and num_sched…

c8c7d56

…uled_tokens Signed-off-by: quanliu <18646313696@163.com>

quanliu1991 force-pushed the main branch from e2f9314 to c8c7d56 Compare October 12, 2025 14:48

heheda12345 approved these changes Oct 12, 2025

View reviewed changes

heheda12345 enabled auto-merge (squash) October 12, 2025 23:33

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 12, 2025

heheda12345 merged commit 41f3884 into vllm-project:main Oct 13, 2025
46 checks passed

1994 pushed a commit to 1994/vllm that referenced this pull request Oct 14, 2025

[Bugfix][Core]Fix block table out-of-range issue in priority scheduli…

a6cec82

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: 1994 <1994@users.noreply.github.com>

bbartels pushed a commit to bbartels/vllm that referenced this pull request Oct 16, 2025

[Bugfix][Core]Fix block table out-of-range issue in priority scheduli…

fbe3b85

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com> Signed-off-by: bbartels <benjamin@bartels.dev>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request Oct 20, 2025

[Bugfix][Core]Fix block table out-of-range issue in priority scheduli…

8407129

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request Oct 24, 2025

[Bugfix][Core]Fix block table out-of-range issue in priority scheduli…

28f2bd4

…ng (vllm-project#26661) Signed-off-by: quanliu <18646313696@163.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix][Core]Fix block table out-of-range issue in priority scheduling #26661

[Bugfix][Core]Fix block table out-of-range issue in priority scheduling #26661

Uh oh!

quanliu1991 commented Oct 12, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 12, 2025

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Oct 12, 2025

Uh oh!

quanliu1991 Oct 12, 2025

Uh oh!

heheda12345 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

[Bugfix][Core]Fix block table out-of-range issue in priority scheduling #26661

[Bugfix][Core]Fix block table out-of-range issue in priority scheduling #26661

Uh oh!

Conversation

quanliu1991 commented Oct 12, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

quanliu1991 Oct 12, 2025

Choose a reason for hiding this comment

Uh oh!

heheda12345 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

quanliu1991 commented Oct 12, 2025 •

edited by github-actions bot

Loading