Skip to content

Commit 356dc3b

Browse files
committed
little fix
Signed-off-by: shen-shanshan <467638484@qq.com>
1 parent a26c097 commit 356dc3b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

vllm_ascend/worker/model_runner_v1.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1965,7 +1965,7 @@ def execute_model(
19651965
logits_dtype = logits.dtype
19661966
logits = logits.to("cpu").float()
19671967
apply_grammar_bitmask(scheduler_output, self.input_batch,
1968-
logits, self.device)
1968+
logits, torch.device("cpu"))
19691969
logits = logits.to(self.device).to(logits_dtype)
19701970

19711971
# Sample the next token and get logprobs if needed.

0 commit comments

Comments
 (0)