[Bugfix] Fix and add tests for GptOss reasoning parser #28000

benchislett · 2025-11-03T19:53:02Z

Purpose

Test Plan

Added a unit test for GptOssReasoningParser to target is_reasoning_end specifically. Future work can extend this to cover more cases as needed.

Test Result

Tests now passing.

I also ran the benchmark script:

python3 ../benchmarks/benchmark_serving_structured_output.py --backend openai-chat --model openai/gpt-oss-120b --dataset xgrammar_bench --structured-output-ratio 1.0 --request-rate 100 --num-prompts 100 --port 8046 --endpoint /openai/v1/chat/completions --output-len 900 --save-results

and see 98% accuracy before (same as with ratio set to 0.0. At least one confirmed failure case due to this feature gap), and 100% accuracy after this change.

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

gemini-code-assist

Code Review

This pull request addresses a bug in the GptOss reasoning parser, which previously struggled to correctly identify the end of a reasoning block when extra tokens were present between the final channel and the message tag. The implemented fix introduces a more flexible parsing mechanism that searches for a prefix and suffix separately within a defined token distance, which effectively resolves the issue and enhances parsing accuracy. The inclusion of a comprehensive unit test suite for is_reasoning_end is a valuable addition, as it thoroughly validates the new logic across different scenarios.

I have one suggestion to enhance the performance of the is_reasoning_end method, which could become a performance bottleneck with long input sequences.

vllm/reasoning/gptoss_reasoning_parser.py

alecsolder · 2025-11-03T21:19:44Z

vllm/reasoning/gptoss_reasoning_parser.py

+        # The model can output some special tokens between "final" and "<|message|>"
+        # So we need to look for both sequences to determine the end of reasoning.
+        self.reasoning_end_token_ids_prefix = self.model_tokenizer.encode(
+            "<|start|>assistant<|channel|>final"


So the model will only output <|start|>assistant if it does decide to output a reasoning message, in some situations it may just jump straight to the final message, so it may be more reasonable to just do <|channel|>final instead as it will cover both situations.

To cover tool calling scenarios, as soon as you see to= in the header, then the message is a tool call and no longer reasoning.

Both situations can be covered by the ending suffix of <|message|> though.

updated to cover the first case. I think it would be easiest for me to leave any tool calling updates to follow-up work, as I have no context as to how we do them in vLLM. I'm not entirely sure if even ever uses this code pathway since the work using structured tags

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

alecsolder

Looks good to me, I have a PR ready soon as a follow up which will adjust this for tool calling as well

…28000) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

benchislett added 2 commits November 3, 2025 19:44

fix bug in gpt oss reasoning parser

9d4ad04

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

add max length limit for performance and robustness

409ad03

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

benchislett requested review from aarnphm and chaunceyjiang as code owners November 3, 2025 19:53

benchislett mentioned this pull request Nov 3, 2025

[Bug]: Structured output is not correctly enforced when using GPT-OSS #23120

Open

1 task

mergify bot added the gpt-oss Related to GPT-OSS models label Nov 3, 2025

github-project-automation bot added this to gpt-oss Issues & Enhancements Nov 3, 2025

github-project-automation bot moved this to To Triage in gpt-oss Issues & Enhancements Nov 3, 2025

gemini-code-assist bot reviewed Nov 3, 2025

View reviewed changes

vllm/reasoning/gptoss_reasoning_parser.py Show resolved Hide resolved

benchislett added the bug Something isn't working label Nov 3, 2025

Merge branch 'main' into bugfix-gptoss-reasoning-parser

3b67cf6

alecsolder reviewed Nov 3, 2025

View reviewed changes

add more coverage and update tests

97d0a9b

Signed-off-by: Benjamin Chislett <bchislett@nvidia.com>

benchislett requested a review from alecsolder November 4, 2025 16:14

Merge branch 'main' into bugfix-gptoss-reasoning-parser

251bebd

aarnphm approved these changes Nov 5, 2025

View reviewed changes

github-project-automation bot moved this from To Triage to Ready in gpt-oss Issues & Enhancements Nov 5, 2025

alecsolder approved these changes Nov 5, 2025

View reviewed changes

alecsolder mentioned this pull request Nov 5, 2025

[Frontend] [gpt-oss] Chat format GD for tool calling with gptoss #28148

Open

5 tasks

yeqcharlotte enabled auto-merge (squash) November 6, 2025 07:47

github-actions bot added the ready ONLY add when PR is ready to merge/full CI is needed label Nov 6, 2025

Merge branch 'main' into bugfix-gptoss-reasoning-parser

0fff9bb

yeqcharlotte merged commit 1890321 into vllm-project:main Nov 7, 2025
45 checks passed

github-project-automation bot moved this from Ready to Done in gpt-oss Issues & Enhancements Nov 7, 2025

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request Nov 13, 2025

[Bugfix] Fix and add tests for GptOss reasoning parser (vllm-project#…

1bec391

…28000) Signed-off-by: Benjamin Chislett <bchislett@nvidia.com> Signed-off-by: xuebwang-amd <xuebwang@amd.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Bugfix] Fix and add tests for GptOss reasoning parser #28000

[Bugfix] Fix and add tests for GptOss reasoning parser #28000

benchislett commented Nov 3, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

alecsolder Nov 3, 2025

Uh oh!

benchislett Nov 4, 2025 •

edited

Loading

Uh oh!

alecsolder left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Bugfix] Fix and add tests for GptOss reasoning parser #28000

[Bugfix] Fix and add tests for GptOss reasoning parser #28000

Conversation

benchislett commented Nov 3, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Test Plan

Test Result

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

alecsolder Nov 3, 2025

Choose a reason for hiding this comment

Uh oh!

benchislett Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alecsolder left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

benchislett commented Nov 3, 2025 •

edited by github-actions bot

Loading

benchislett Nov 4, 2025 •

edited

Loading