-
-
Notifications
You must be signed in to change notification settings - Fork 11.2k
[V1] Structured Outputs + Thinking compatibility #16577
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
simon-mo
merged 52 commits into
vllm-project:main
from
aarnphm:feat/support-thinking-struct-outputs
May 14, 2025
Merged
Changes from all commits
Commits
Show all changes
52 commits
Select commit
Hold shift + click to select a range
7f9b174
chore: migrate tokenizer init to manager only
aarnphm 023807d
chore: init reasoning_parser on manager
aarnphm 92527f6
feat: support parsing thinking tokens
aarnphm e50ea40
chore: add a check to make sure that the reasoning token is not being…
aarnphm 7542582
chore: update docs
aarnphm fa6da3f
chore: move reasoning_ended to so_request
aarnphm 061ee09
chore: reduce diff
aarnphm 5eecdbb
chore: move up checker logics
aarnphm 873b08b
chore: update correct function imports
aarnphm 218ad9c
chore: remove incorrect function
aarnphm 1ec8928
fix: make sure to reset the bitmask before update
aarnphm 9b6f4e8
chore: make sure non reasoning case works
aarnphm 63eecbf
fix: remove unused check
aarnphm ea1487f
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm 910ee0c
chore: fix pre-comimt
aarnphm e220eac
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm 327a0d0
revert: bad merge and remove inlines
aarnphm 6d26942
fix: make sure to initialize DecodingConfig by default, and fix types
aarnphm 7635f17
merge: with upstream and add compatibility with thinking cases
aarnphm 5a708aa
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm fcfef12
--wip--
aarnphm 1e828bd
chore: move logic to manager
aarnphm ce1fddc
chore: update notes
aarnphm c211110
fix: make sure works with both thinking, spec and struct matrixes
aarnphm 97d1d4e
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm b89662a
chore: cleanup logics
aarnphm 8627691
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm 27817a0
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm 591da8e
fix: update to newer logics
aarnphm c41e80c
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm 5cf804d
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm a807bee
chore: revert whitespace changes
aarnphm 6c2b9df
fix(tests): ignore runaway properties
aarnphm fb92d8a
fix: broken tests
aarnphm 174e7e8
Update tests/v1/entrypoints/llm/test_struct_output_generate.py
aarnphm 42671cf
revert: update noqa changes
aarnphm 9c364d0
chore: add a notes about bitmask reset
aarnphm ffd3fa1
fix: initialize default decoding_config
aarnphm b64f5f5
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm ddc9c47
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm edd235b
chore(test): use deepseek_r1 parser for qwen3
aarnphm 3cbbd8c
chore: separate out reasoning tests
aarnphm a559b72
fix: reasoning tests to parse it
aarnphm 1f3c369
chore: replicate duplicate thinking budget
aarnphm d5574be
revert: remove duplications
aarnphm 59f2aa7
chore: reorder test logs
aarnphm ded3890
chore: keep main change to reduce diff
aarnphm 0fb92a5
fix: use deepseek_r1 parser for tests
aarnphm 7ace2cb
chore: use a slightly larger models for smarter cot
aarnphm 1816b3b
fix: support for qwen3 prompts
aarnphm 91058ba
merge: branch 'main' of github.com:vllm-project/vllm into feat/suppor…
aarnphm d96fa45
chore: make it more clear
aarnphm File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.