Skip to content

Conversation

@hmellor
Copy link
Member

@hmellor hmellor commented Apr 22, 2025

Fixes #17002

…uided_decoding_backend_options`

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@github-actions
Copy link

👋 Hi! Thank you for contributing to the vLLM project.

💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels.

Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can either: Add ready label to the PR or enable auto-merge.

🚀

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
hmellor added 2 commits April 24, 2025 10:29
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@hmellor
Copy link
Member Author

hmellor commented Apr 28, 2025

This should merge after #17303 to leverage the improved conversion from dataclass configs to argparse arguments

hmellor added 4 commits April 28, 2025 14:28
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
hmellor added 3 commits April 28, 2025 20:28
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@hmellor
Copy link
Member Author

hmellor commented Apr 28, 2025

I've fixed all the issues locally, but encounter an issue in the entrypoints test which requires a decision.

In my local branch (which fixes the errors you see in CI) I see the following error:

$ pytest tests/entrypoints/llm/test_guided_generate.py -vs -k test_guided_number_range_json_completion
...
ValueError: Request-level structured output backend selection is
no longer supported. The request specified 'outlines', but vLLM
was initialised with 'auto'. This error can be resolved by removing
backend selection from the request.

This error is supposed to throw when we do this so I'm not sure what to do about it.

hmellor added 2 commits April 29, 2025 09:10
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@russellb
Copy link
Member

@hmellor that file only works with V0. CI runs it with VLLM_USE_V1=0. It would be better to edit the file to force it to do that on its own, but if it passes for you with V0, you're good to go.

@hmellor
Copy link
Member Author

hmellor commented Apr 29, 2025

I've found an issue with unions of literals that I'm going to make a separate pr for. Then I'll update this test to only run for v0. Then all the tests should pass and we should be good to merge!

hmellor added 5 commits April 29, 2025 15:11
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@hmellor
Copy link
Member Author

hmellor commented Apr 29, 2025

Here is the union of literals PR I mentioned #17391

hmellor added 4 commits April 29, 2025 15:46
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
…kend-options

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@hmellor
Copy link
Member Author

hmellor commented Apr 29, 2025

Looks like I can't use the monkeypatch fixture to instantiate the LLM with VLLM_USE_V1=0.

I'm going to leave it as is because I'd rather the use the llm fixture and run faster.

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
@russellb russellb merged commit a6977db into vllm-project:main Apr 29, 2025
48 checks passed
@hmellor hmellor deleted the split-guided-decoding-backend-options branch April 29, 2025 19:10
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request Apr 29, 2025
…roject#17008)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
f"`guided_backend=xgrammar:disable-any-whitespace` "
f"to prevent runaway generation of whitespaces.")
logger.info_once(msg)
logger.info_once(
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is causing number of arguments mismatch for me https://github.com/vllm-project/vllm/blob/main/vllm/logger.py#L75

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sorry about that, fix in #17416

radeksm pushed a commit to radeksm/vllm that referenced this pull request May 2, 2025
…roject#17008)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request May 12, 2025
…roject#17008)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
zzzyq pushed a commit to zzzyq/vllm that referenced this pull request May 24, 2025
…roject#17008)

Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Yuqi Zhang <yuqizhang@google.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation ready ONLY add when PR is ready to merge/full CI is needed structured-output v1

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

[Bug]: Guided Decoding Backend options with the OpenAI server recently broken

4 participants