Skip to content

Commit

Permalink
[Bugfix] Free cross attention block table for preempted-for-recompute…
Browse files Browse the repository at this point in the history
… sequence group. (vllm-project#10013)

Signed-off-by: Kathy Yu <feiyangyu@google.com>
  • Loading branch information
kathyyu-google authored and abmfy committed Jan 24, 2025
1 parent 3712641 commit 9071af8
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions vllm/core/scheduler.py
Original file line number Diff line number Diff line change
Expand Up @@ -1579,6 +1579,7 @@ def _preempt_by_recompute(
seq.status = SequenceStatus.WAITING
self.free_seq(seq)
seq.reset_state_for_recompute()
self._free_seq_group_cross_attn_blocks(seq_group)

def _preempt_by_swap(
self,
Expand Down

0 comments on commit 9071af8

Please sign in to comment.