-
-
Couldn't load subscription status.
- Fork 10.8k
Simplify (and fix) passing of guided decoding backend options #17008
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Simplify (and fix) passing of guided decoding backend options #17008
Conversation
…uided_decoding_backend_options` Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
👋 Hi! Thank you for contributing to the vLLM project. 💬 Join our developer Slack at https://slack.vllm.ai to discuss your PR in #pr-reviews, coordinate on features in #feat- channels, or join special interest groups in #sig- channels. Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging. To run CI, PR reviewers can either: Add 🚀 |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
This should merge after #17303 to leverage the improved conversion from dataclass configs to argparse arguments |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
I've fixed all the issues locally, but encounter an issue in the entrypoints test which requires a decision. In my local branch (which fixes the errors you see in CI) I see the following error: $ pytest tests/entrypoints/llm/test_guided_generate.py -vs -k test_guided_number_range_json_completion
...
ValueError: Request-level structured output backend selection is
no longer supported. The request specified 'outlines', but vLLM
was initialised with 'auto'. This error can be resolved by removing
backend selection from the request.This error is supposed to throw when we do this so I'm not sure what to do about it. |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
@hmellor that file only works with V0. CI runs it with |
|
I've found an issue with unions of literals that I'm going to make a separate pr for. Then I'll update this test to only run for v0. Then all the tests should pass and we should be good to merge! |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
Here is the union of literals PR I mentioned #17391 |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
…kend-options Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
|
Looks like I can't use the I'm going to leave it as is because I'd rather the use the |
Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
…roject#17008) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
| f"`guided_backend=xgrammar:disable-any-whitespace` " | ||
| f"to prevent runaway generation of whitespaces.") | ||
| logger.info_once(msg) | ||
| logger.info_once( |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is causing number of arguments mismatch for me https://github.com/vllm-project/vllm/blob/main/vllm/logger.py#L75
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Sorry about that, fix in #17416
…roject#17008) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com>
…roject#17008) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Mu Huai <tianbowen.tbw@antgroup.com>
…roject#17008) Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Signed-off-by: Yuqi Zhang <yuqizhang@google.com>
Fixes #17002