[v0.9.1][Bugfix] Fix guided decoding invalid backend #2645

shen-shanshan · 2025-08-30T02:29:48Z

What this PR does / why we need it?

Run:

pytest -sv tests/singlecard/test_guided_decoding.py

Come across an error:

FAILED tests/singlecard/test_guided_decoding.py::test_guided_json_completion[guidance:disable-any-whitespace] - NotImplementedError: VLLM_USE_V1=1 is not supported with --guided-decoding-backend=guidance:disable-any-whitespace.
FAILED tests/singlecard/test_guided_decoding.py::test_guided_regex[guidance:disable-any-whitespace] - NotImplementedError: VLLM_USE_V1=1 is not supported with --guided-decoding-backend=guidance:disable-any-whitespace.

Does this PR introduce any user-facing change?

How was this patch tested?

Run:

pytest -sv tests/singlecard/test_guided_decoding.py

Output:

=========================================================================== test session starts ===========================================================================
platform linux -- Python 3.10.18, pytest-8.4.1, pluggy-1.6.0
rootdir: /home/sss/github/vllm-v0.9.1/vllm-ascend
configfile: pytest.ini
plugins: anyio-4.10.0, mock-3.14.1
collected 8 items                                                                                                                                                         

tests/singlecard/test_guided_decoding.py ss.ss..s                                                                                                                   [100%]

============================================================================ warnings summary =============================================================================
<frozen importlib._bootstrap>:241
  <frozen importlib._bootstrap>:241: DeprecationWarning: builtin type SwigPyPacked has no __module__ attribute

<frozen importlib._bootstrap>:241
  <frozen importlib._bootstrap>:241: DeprecationWarning: builtin type SwigPyObject has no __module__ attribute

../../../miniconda3/envs/vllm-v0.9.1/lib/python3.10/site-packages/torch_npu/dynamo/torchair/__init__.py:8
  /home/sss/miniconda3/envs/vllm-v0.9.1/lib/python3.10/site-packages/torch_npu/dynamo/torchair/__init__.py:8: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html
    import pkg_resources

-- Docs: https://docs.pytest.org/en/stable/how-to/capture-warnings.html
========================================================== 3 passed, 5 skipped, 3 warnings in 193.21s (0:03:13) ===========================================================

Signed-off-by: shen-shanshan <467638484@qq.com>

gemini-code-assist

Code Review

This pull request addresses a bug in the guided decoding tests where an invalid backend, guidance:disable-any-whitespace, was causing failures when running with VLLM_USE_V1=1. The fix correctly replaces the invalid backend with guidance, which resolves the NotImplementedError and allows the tests to pass as expected. The change is minimal, targeted, and effectively resolves the issue.

Yikun · 2025-08-30T02:47:37Z

so just a test fixed? Why it was failure in history CI?

shen-shanshan · 2025-08-30T02:52:58Z

so just a test fixed? Why it was failure in history CI?

I am also comfused about this. 😂 How does the CI passed in history v0.9.1 PR?

Signed-off-by: shen-shanshan <467638484@qq.com>

MengqingCao · 2025-09-01T02:14:39Z

tests/singlecard/test_guided_decoding.py

            f"{guided_decoding_backend} will fall back to outlines, skip it")
+    if guided_decoding_backend == "outlines":
+        pytest.skip(
+            f"{guided_decoding_backend} will take up too much time for json "


How much time does this case cost? if it is reasonable, I think we'd better keep it running, as the CI on v0.9.1-dev actually is not too much

maybe 10+ minutes...

Okay, then let's keep it skipped

Fix guided decoding invalid backend

9d2b838

Signed-off-by: shen-shanshan <467638484@qq.com>

shen-shanshan mentioned this pull request Aug 30, 2025

[Release]: Release checklist for v0.9.1 #2585

Closed

39 tasks

gemini-code-assist bot reviewed Aug 30, 2025

View reviewed changes

github-actions bot added the module:tests label Aug 30, 2025

shen-shanshan added 2 commits August 30, 2025 08:14

update

e4a17ad

Signed-off-by: shen-shanshan <467638484@qq.com>

fix lint

5640f25

Signed-off-by: shen-shanshan <467638484@qq.com>

MengqingCao reviewed Sep 1, 2025

View reviewed changes

wangxiyuan merged commit 234a5a4 into vllm-project:v0.9.1-dev Sep 1, 2025
8 checks passed

shen-shanshan mentioned this pull request Sep 1, 2025

[Feature]: Add Support for Guided Decoding (Structured Output) #177

Closed

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[v0.9.1][Bugfix] Fix guided decoding invalid backend #2645

[v0.9.1][Bugfix] Fix guided decoding invalid backend #2645

Uh oh!

shen-shanshan commented Aug 30, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Yikun commented Aug 30, 2025

Uh oh!

shen-shanshan commented Aug 30, 2025 •

edited

Loading

Uh oh!

MengqingCao Sep 1, 2025

Uh oh!

shen-shanshan Sep 1, 2025

Uh oh!

MengqingCao Sep 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[v0.9.1][Bugfix] Fix guided decoding invalid backend #2645

[v0.9.1][Bugfix] Fix guided decoding invalid backend #2645

Uh oh!

Conversation

shen-shanshan commented Aug 30, 2025

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Yikun commented Aug 30, 2025

Uh oh!

shen-shanshan commented Aug 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MengqingCao Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

shen-shanshan Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

MengqingCao Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

shen-shanshan commented Aug 30, 2025 •

edited

Loading