Skip to content

Conversation

@benchislett
Copy link
Collaborator

@benchislett benchislett commented Oct 14, 2025

Purpose

Patch for speculative_config.enforce_eager not being respected in the draft model dummy run.

This flag was introduced for DSV3.2 MTP which currently must run in eager mode, so it is not a common crash.

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
@benchislett benchislett added the bug Something isn't working label Oct 14, 2025
@mergify mergify bot added the v1 label Oct 14, 2025
Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly fixes a bug where speculative_config.enforce_eager was not being respected during the dummy run for the draft model. The change adds the necessary condition to disable CUDA graphs for the drafter's dummy run when eager mode is enforced. The implementation is correct and addresses the issue effectively. I have no further comments.

@benchislett benchislett added the ready ONLY add when PR is ready to merge/full CI is needed label Oct 14, 2025
@benchislett benchislett enabled auto-merge (squash) October 15, 2025 18:47
@benchislett benchislett disabled auto-merge October 15, 2025 19:20
@benchislett
Copy link
Collaborator Author

disabling auto-merge, forgot I still need a stamp

@pavanimajety pavanimajety merged commit 1974880 into vllm-project:main Oct 21, 2025
46 checks passed
@benchislett benchislett deleted the bugfix-eagle-with-eager branch October 21, 2025 23:00
baonudesifeizhai pushed a commit to baonudesifeizhai/vllm that referenced this pull request Oct 21, 2025
sstamenk pushed a commit to sstamenk/vllm that referenced this pull request Oct 23, 2025
…oject#26821)

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Signed-off-by: sstamenk <strahinja.stamenkovic@amd.com>
usberkeley pushed a commit to usberkeley/vllm that referenced this pull request Oct 23, 2025
albertoperdomo2 pushed a commit to albertoperdomo2/vllm that referenced this pull request Oct 23, 2025
…oject#26821)

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Signed-off-by: Alberto Perdomo <aperdomo@redhat.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
…oject#26821)

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
0xrushi pushed a commit to 0xrushi/vllm that referenced this pull request Oct 26, 2025
…oject#26821)

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>
Signed-off-by: 0xrushi <6279035+0xrushi@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

bug Something isn't working ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants