Skip to content

Commit

Permalink
[Bugfix][TPU] Fix pad slot id (vllm-project#5977)
Browse files Browse the repository at this point in the history
  • Loading branch information
WoosukKwon authored and prashantgupta24 committed Jul 1, 2024
1 parent 28773bf commit 9ea7506
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion vllm/worker/tpu_model_runner.py
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@

logger = init_logger(__name__)

_PAD_SLOT_ID = 0 # FIXME(woosuk)
_PAD_SLOT_ID = -1 # NOTE(woosuk): In PyTorch XLA, index -1 is ignored.
# FIXME(woosuk): Temporarily disabled top-p sampling since it's too slow.
_ENABLE_TOP_P = False
# FIXME(woosuk): A temporary hack to support `n > 1`.
Expand Down

0 comments on commit 9ea7506

Please sign in to comment.