Fix more broken speculative decode tests #17450

huydhn · 2025-04-30T06:46:32Z

A follow-up PR to fix some more speculative decode tests from #17084. There are 2 fixes:

After [Chore] Remove Sampler from Model Code #17084, there are now two sampler objects:
- https://github.com/vllm-project/vllm/blob/main/vllm/spec_decode/multi_step_worker.py#L53
- and https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/models/mlp_speculator.py#L143

The latter doesn't have include_gpu_probs_tensor set to True, which cause a bunch of failures with pytest -v spec_decode/e2e/test_mlp_correctness.py. @WoosukKwon Please let me know if the fix makes sense to you. This feels like a quick patch to cover the underlying setup from #17084. But it kind of works.

More block_size update missed by Fix some speculative decode tests with tl.dot #17371

The failures come from vllm-project#17084 Signed-off-by: Huy Do <huydhn@gmail.com>

github-actions · 2025-04-30T06:48:02Z

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Huy Do <huydhn@gmail.com>

WoosukKwon

LGTM. @huydhn Thanks for fixing this!

Signed-off-by: Huy Do <huydhn@gmail.com>

DarkLight1337 · 2025-05-01T13:05:47Z

Nice, now we just have these to worry about:

FAILED spec_decode/e2e/test_eagle_correctness.py::test_eagle_e2e_greedy_correctness_with_preemption[1-4-128-test_llm_kwargs0-baseline_llm_kwargs0-per_test_common_llm_kwargs0-common_llm_kwargs0] - ValueError: 0 is not in list
FAILED spec_decode/e2e/test_medusa_correctness.py::test_medusa_e2e_greedy_correctness_with_preemption[-1-1-4-128-test_llm_kwargs0-baseline_llm_kwargs0-per_test_common_llm_kwargs0-common_llm_kwargs0] - ValueError: 0 is not in list
FAILED spec_decode/test_memory_usage.py::test_memory_usage_no_spec - TypeError: EngineArgs.__init__() got an unexpected keyword argument 'speculative_model'

huydhn · 2025-05-01T16:24:45Z

Nice, now we just have these to worry about:

FAILED spec_decode/e2e/test_eagle_correctness.py::test_eagle_e2e_greedy_correctness_with_preemption[1-4-128-test_llm_kwargs0-baseline_llm_kwargs0-per_test_common_llm_kwargs0-common_llm_kwargs0] - ValueError: 0 is not in list
FAILED spec_decode/e2e/test_medusa_correctness.py::test_medusa_e2e_greedy_correctness_with_preemption[-1-1-4-128-test_llm_kwargs0-baseline_llm_kwargs0-per_test_common_llm_kwargs0-common_llm_kwargs0] - ValueError: 0 is not in list
FAILED spec_decode/test_memory_usage.py::test_memory_usage_no_spec - TypeError: EngineArgs.__init__() got an unexpected keyword argument 'speculative_model'

Oh darn, more failures are sneaking in I think. They weren't there before I rebased.

Signed-off-by: Huy Do <huydhn@gmail.com>

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

Fix more broken speculative decode tests

593f9ee

The failures come from vllm-project#17084 Signed-off-by: Huy Do <huydhn@gmail.com>

mergify bot added the speculative-decoding label Apr 30, 2025

Add a check for sampler attribute

1046401

Signed-off-by: Huy Do <huydhn@gmail.com>

huydhn marked this pull request as ready for review April 30, 2025 07:15

huydhn requested review from LiuXiaoxuanPKU and njhill as code owners April 30, 2025 07:15

WoosukKwon approved these changes Apr 30, 2025

View reviewed changes

WoosukKwon added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 30, 2025

houseroad approved these changes Apr 30, 2025

View reviewed changes

Merge branch 'main' into fix-more-spec-decode-tests

48cbd57

Signed-off-by: Huy Do <huydhn@gmail.com>

WoosukKwon enabled auto-merge (squash) May 1, 2025 09:04

vllm-bot merged commit b74d888 into vllm-project:main May 1, 2025
43 of 46 checks passed

radeksm pushed a commit to radeksm/vllm that referenced this pull request May 2, 2025

Fix more broken speculative decode tests (vllm-project#17450)

ee0eb6b

Signed-off-by: Huy Do <huydhn@gmail.com>

RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025

Fix more broken speculative decode tests (vllm-project#17450)

c13b082

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>

zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025

Fix more broken speculative decode tests (vllm-project#17450)

d212f8d

Signed-off-by: Huy Do <huydhn@gmail.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix more broken speculative decode tests #17450

Fix more broken speculative decode tests #17450

Uh oh!

huydhn commented Apr 30, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Apr 30, 2025

Uh oh!

WoosukKwon left a comment

Uh oh!

DarkLight1337 commented May 1, 2025

Uh oh!

Uh oh!

huydhn commented May 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Fix more broken speculative decode tests #17450

Fix more broken speculative decode tests #17450

Uh oh!

Conversation

huydhn commented Apr 30, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Apr 30, 2025

Uh oh!

WoosukKwon left a comment

Choose a reason for hiding this comment

Uh oh!

DarkLight1337 commented May 1, 2025

Uh oh!

Uh oh!

huydhn commented May 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

huydhn commented Apr 30, 2025 •

edited by github-actions bot

Loading