Fix some speculative decode tests with tl.dot #17371

huydhn · 2025-04-29T09:36:20Z

I'm seeing these failures from my other PR #16859, but they don't seem to be related to PyTorch 2.7.0 release. They seem to come from #13305 with the issue on Triton triton-lang/triton#2266. I have seen similar a PR from PyTorch about this pytorch/pytorch#147765.

The PR attempts to fix the failed tests accordingly.

Signed-off-by: Huy Do <huydhn@gmail.com>

github-actions · 2025-04-29T09:36:34Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

huydhn · 2025-04-30T02:45:47Z

This fixes some spec decode failures but not all of them. Specifically, https://github.com/vllm-project/vllm/blob/main/.buildkite/test-pipeline.yaml#L283 has been fixed, but there are more failures from the next line VLLM_ATTENTION_BACKEND=FLASH_ATTN pytest -v -s spec_decode --ignore=spec_decode/e2e/test_multistep_correctness.py --ignore=spec_decode/e2e/test_mtp_correctness.py. I have been debugging them and the root cause points to #17084. cc @WoosukKwon

Signed-off-by: Huy Do <huydhn@gmail.com>

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

huydhn added 2 commits April 29, 2025 02:30

Fix some spec decode tests with tl.dot

8016494

Signed-off-by: Huy Do <huydhn@gmail.com>

Another

8b47659

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn requested review from LiuXiaoxuanPKU and njhill as code owners April 29, 2025 09:36

mergify bot added the speculative-decoding label Apr 29, 2025

huydhn mentioned this pull request Apr 29, 2025

Update PyTorch to 2.7.0 #16859

Merged

mgoin approved these changes Apr 29, 2025

View reviewed changes

mgoin added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 29, 2025

vllm-bot merged commit 88fcf00 into vllm-project:main Apr 30, 2025
41 of 43 checks passed

huydhn mentioned this pull request Apr 30, 2025

Fix more broken speculative decode tests #17450

Merged

radeksm pushed a commit to radeksm/vllm that referenced this pull request May 2, 2025

Fix some speculative decode tests with tl.dot (vllm-project#17371)

8743c0f

Signed-off-by: Huy Do <huydhn@gmail.com>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

Fix some speculative decode tests with tl.dot (vllm-project#17371)

149bc42

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

Fix some speculative decode tests with tl.dot (vllm-project#17371)

7290532

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Fix some speculative decode tests with tl.dot #17371

Fix some speculative decode tests with tl.dot #17371

Uh oh!

huydhn commented Apr 29, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 29, 2025

Uh oh!

Uh oh!

huydhn commented Apr 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Uh oh!

Fix some speculative decode tests with tl.dot #17371

Fix some speculative decode tests with tl.dot #17371

Uh oh!

Conversation

huydhn commented Apr 29, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 29, 2025

Uh oh!

Uh oh!

huydhn commented Apr 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

huydhn commented Apr 29, 2025 •

edited by github-actions bot

Loading