Skip to content

Commit ad8a800

Browse files
nopperltdoublep
andcommitted
Remove superfluous slicing of state_indices_tensor
Co-authored-by: Thomas Parnell <tom.parnell@gmail.com> Signed-off-by: nopperl <54780682+nopperl@users.noreply.github.com>
1 parent e10d3cc commit ad8a800

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm/model_executor/models/plamo2.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -332,7 +332,7 @@ def forward_cuda(
332332
dim=0)
333333
# Split along batch dimension
334334
state_indices_tensor_d, state_indices_tensor_p = torch.split(
335-
state_indices_tensor[:num_actual_tokens],
335+
state_indices_tensor,
336336
[num_decodes, num_prefills],
337337
dim=0,
338338
)

0 commit comments

Comments
 (0)