Skip to content

Conversation

@chaunceyjiang
Copy link
Collaborator

@chaunceyjiang chaunceyjiang commented Apr 17, 2025

FIX #16738

Fix the issue where llm.generate cannot be called repeatedly after setting GuidedDecodingParams

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
@github-actions
Copy link

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

@chaunceyjiang
Copy link
Collaborator Author

/cc @DarkLight1337 @russellb PTAL.

@russellb
Copy link
Member

Thanks for the PR! I'm taking a closer look now.

@dgslqh
Copy link

dgslqh commented Apr 17, 2025

I meet the same problem, thanks a lot!

Closes vllm-project#16738

Signed-off-by: Russell Bryant <rbryant@redhat.com>
@russellb
Copy link
Member

@chaunceyjiang can you take a look at the commit I added? It's an alternate solution that I prefer, but want to make sure you're OK with it.

@chaunceyjiang
Copy link
Collaborator Author

@chaunceyjiang can you take a look at the commit I added? It's an alternate solution that I prefer, but want to make sure you're OK with it.

I think your solution is even better. @russellb

@russellb
Copy link
Member

Thanks for taking a look! Since I made changes, I asked for someone else to take a look for approval.

Copy link
Member

@mgoin mgoin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, however I would like to see a test case specifically for this

@russellb
Copy link
Member

LGTM, however I would like to see a test case specifically for this

good point - I can do that pretty quick I think

Signed-off-by: Russell Bryant <rbryant@redhat.com>
@russellb russellb self-requested a review as a code owner April 17, 2025 15:30
@russellb
Copy link
Member

LGTM, however I would like to see a test case specifically for this

good point - I can do that pretty quick I think

added test coverage by slightly extending an existing test.

@chaunceyjiang
Copy link
Collaborator Author

/cc @russellb @DarkLight1337 I think this PR can be merged.

@DarkLight1337 DarkLight1337 added the ready ONLY add when PR is ready to merge/full CI is needed label Apr 22, 2025
@DarkLight1337 DarkLight1337 enabled auto-merge (squash) April 22, 2025 06:02
@DarkLight1337 DarkLight1337 merged commit acba33a into vllm-project:main Apr 22, 2025
60 checks passed
@chaunceyjiang chaunceyjiang deleted the guided_decoding branch April 22, 2025 13:48
frieda-huang pushed a commit to frieda-huang/vllm that referenced this pull request Apr 23, 2025
… after setting GuidedDecodingParams (vllm-project#16767)

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
Signed-off-by: Frieda (Jingying) Huang <jingyingfhuang@gmail.com>
@kaimatzu
Copy link

When is this getting released?

@DarkLight1337
Copy link
Member

The next release will be around the end of this month. If you can't wait, you can install the nightly vLLM package.

https://docs.vllm.ai/en/latest/getting_started/installation/gpu.html#pre-built-wheels

@jgoriasilva
Copy link

Thanks! I was having the same problem and this solves it, I tested it with the nightly package.

jikunshang pushed a commit to jikunshang/vllm that referenced this pull request Apr 29, 2025
… after setting GuidedDecodingParams (vllm-project#16767)

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025
… after setting GuidedDecodingParams (vllm-project#16767)

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
adobrzyn pushed a commit to HabanaAI/vllm-fork that referenced this pull request Apr 30, 2025
… after setting GuidedDecodingParams (vllm-project#16767)

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
Signed-off-by: Agata Dobrzyniewicz <adobrzyniewicz@habana.ai>
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025
… after setting GuidedDecodingParams (vllm-project#16767)

Signed-off-by: chaunceyjiang <chaunceyjiang@gmail.com>
Signed-off-by: Russell Bryant <rbryant@redhat.com>
Co-authored-by: Russell Bryant <rbryant@redhat.com>
Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ready ONLY add when PR is ready to merge/full CI is needed v1

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Bug]: GuidedDecodingParams choice - Request-level structured output backend must match engine-level backend

7 participants