fix long dtype in topk sampling #15049

chujiezheng · 2025-03-18T16:34:26Z

Fix the long dtype in topk sampling, since gather requires the indices to be the long dtype (int64)

github-actions · 2025-03-18T16:34:39Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

houseroad · 2025-03-18T18:10:20Z

vllm/v1/sample/sampler.py


        # Get with the logprob of the prompt or sampled token.
-        token_ids = token_ids.unsqueeze(-1)
+        token_ids = token_ids.unsqueeze(-1).to(torch.long)


nit: .long()

Just curious: How is it different from .to(torch.long)?

it's the same. my reason is long() is shorter, :-)

Reference: https://pytorch.org/docs/stable/generated/torch.Tensor.long.html#torch.Tensor.long

houseroad · 2025-03-18T18:10:50Z

Some context on why it's long tensor: https://discuss.pytorch.org/t/why-does-the-indices-tensor-have-to-be-long-dtype/139675

houseroad

Looks good to me. Wondering if we can add some unittest?

houseroad

Since it's a simple fix, I am okay with landing the fix first, then have a follow up PR to add the test.

houseroad · 2025-03-18T22:42:59Z

vllm/v1/sample/sampler.py


        # Get with the logprob of the prompt or sampled token.
-        token_ids = token_ids.unsqueeze(-1)
+        token_ids = token_ids.unsqueeze(-1).to(torch.long)


Reference: https://pytorch.org/docs/stable/generated/torch.Tensor.long.html#torch.Tensor.long

Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

fix long dtype in topk sampling

cec422d

chujiezheng requested review from WoosukKwon, alexm-redhat, comaniac, njhill, robertgshaw2-redhat and ywang96 as code owners March 18, 2025 16:34

mergify bot added the v1 label Mar 18, 2025

houseroad reviewed Mar 18, 2025

View reviewed changes

houseroad approved these changes Mar 18, 2025

View reviewed changes

WoosukKwon merged commit 027827c into vllm-project:main Mar 18, 2025
16 of 17 checks passed

This was referenced Mar 18, 2025

Change logprobs to use int64 datatype in torch.gather #14999

Closed

[V1] Ensure using int64 for sampled token ids #15065

Merged

b8zhong mentioned this pull request Apr 1, 2025

[V1] Fix: make sure k_index is int64 for apply_top_k_only #15907

Merged

lulmer pushed a commit to lulmer/vllm that referenced this pull request Apr 7, 2025

fix long dtype in topk sampling (vllm-project#15049)

a28214b

Signed-off-by: Louis Ulmer <ulmerlouis@gmail.com>

ckhordiasma mentioned this pull request Apr 17, 2025

[do not merge] pr test for nm changes into 2.20 red-hat-data-services/vllm#107

Closed

shreyankg pushed a commit to shreyankg/vllm that referenced this pull request May 3, 2025

fix long dtype in topk sampling (vllm-project#15049)

b266161

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

fix long dtype in topk sampling (vllm-project#15049)

8572b7a

Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix long dtype in topk sampling #15049

fix long dtype in topk sampling #15049

Uh oh!

chujiezheng commented Mar 18, 2025

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

houseroad Mar 18, 2025

Uh oh!

WoosukKwon Mar 18, 2025

Uh oh!

houseroad Mar 18, 2025

Uh oh!

houseroad Mar 18, 2025

Uh oh!

houseroad commented Mar 18, 2025

Uh oh!

houseroad left a comment

Uh oh!

houseroad left a comment

Uh oh!

houseroad Mar 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

fix long dtype in topk sampling #15049

fix long dtype in topk sampling #15049

Uh oh!

Conversation

chujiezheng commented Mar 18, 2025

Uh oh!

github-actions bot commented Mar 18, 2025

Uh oh!

houseroad Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

WoosukKwon Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

houseroad commented Mar 18, 2025

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad left a comment

Choose a reason for hiding this comment

Uh oh!

houseroad Mar 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants