Add Olmo 3 reasoning parser #26054

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

DarkLight1337 merged 5 commits into vllm-project:main from soldni:main

Oct 4, 2025

Contributor

soldni commented Oct 1, 2025 •

edited by github-actions bot

Loading

Purpose

This PR adds support for parsing reasoning traces for the Olmo 3 family of models. (support for models has been already merged in #24534).

Olmo models use <think> and </think> string to bracket their reasoning traces; however, unlike Qwen or Deepseek families, the strings are not special tokens in the vocabulary, therefore requiring a new parser.

Test Plan

This PR includes tests to verify that the parser behaves as expected. Existing tests should not be impacted.

Test Result

Run tests:

pytest tests/reasoning/test_olmo3_reasoning_parser.py

Test output:

============================= test session starts ==============================
platform linux -- Python 3.12.11, pytest-8.4.2, pluggy-1.6.0
rootdir: /home/ec2-user/vllm
configfile: pyproject.toml
plugins: anyio-4.11.0, forked-1.6.0, cov-7.0.0, shard-0.1.2, asyncio-1.2.0, timeout-2.4.0, rerunfailures-16.0.1, mock-3.15.1, buildkite-test-collector-0.1.9, subtests-0.14.2, hypothesis-6.140.2, schemathesis-4.1.4, hydra-core-1.3.2, typeguard-4.4.4
asyncio: mode=Mode.STRICT, debug=False, asyncio_default_fixture_loop_scope=None, asyncio_default_test_loop_scope=function
collected 14 items
Running 14 items in this shard

tests/reasoning/test_olmo3_reasoning_parser.py ..............            [100%]

============================== 14 passed in 3.50s ==============================

Essential Elements of an Effective PR Description Checklist

The purpose of the PR, such as "Fix some issue (link existing issues this PR will resolve)".
The test plan, such as providing test command.
The test results, such as pasting the results comparison before and after, or e2e results
(Optional) The necessary documentation update, such as updating supported_models.md and examples for a new model.
(Optional) Release notes update. If your change is user facing, please update the release notes draft in the [Google Doc](https://docs.google.com/document/d/1YyVqrgX4gHTtrstbq8oWUImOyPCKSGnJ7xtTpmXzlRs/edit?tab=t.0).


          Add Olmo 3 reasoning parser

f5e4e6a

Signed-off-by: Luca Soldaini <luca@soldaini.net>

soldni requested review from aarnphm and chaunceyjiang as code owners

October 1, 2025 22:38

gemini-code-assist bot reviewed

View reviewed changes

Contributor

gemini-code-assist bot left a comment

Code Review

This pull request introduces a reasoning parser for Olmo 3 models. The implementation for non-streaming mode appears correct, and the test suite is a good start. However, the streaming implementation has a critical correctness issue that can lead to data loss when a stream ends with a partial reasoning tag. I've also identified some apparently unreachable code in the buffer processing logic. My review includes detailed comments on these issues.

vllm/reasoning/olmo3_reasoning_parser.py Outdated Show resolved Hide resolved

vllm/reasoning/olmo3_reasoning_parser.py Outdated Show resolved Hide resolved

soldni added 3 commits

October 2, 2025 02:58


          fixed dead code

100da8a

Signed-off-by: Luca Soldaini <luca@soldaini.net>


          fixes from yapf

1fa4108

Signed-off-by: Luca Soldaini <luca@soldaini.net>


          doc building failure

fbc3985

Signed-off-by: Luca Soldaini <luca@soldaini.net>

aarnphm approved these changes

View reviewed changes


          Merge branch 'main' into main

aa3fa8d

mgoin added new-model ready tool-calling labels

github-project-automation bot added this to Tool Calling

DarkLight1337 merged commit d0df145 into vllm-project:main

47 checks passed

github-project-automation bot moved this to Done in Tool Calling

Member

DarkLight1337 commented Oct 4, 2025

Can you open a follow-up PR to add it to https://docs.vllm.ai/en/latest/features/reasoning_outputs.html?

tomeras91 pushed a commit to tomeras91/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

4aa7dd6

Signed-off-by: Luca Soldaini <luca@soldaini.net>
Signed-off-by: Tomer Asida <57313761+tomeras91@users.noreply.github.com>

karan pushed a commit to karan/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

6a35391

Signed-off-by: Luca Soldaini <luca@soldaini.net>
Signed-off-by: Karan Goel <3261985+karan@users.noreply.github.com>

southfreebird pushed a commit to southfreebird/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

082cdda

Signed-off-by: Luca Soldaini <luca@soldaini.net>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

5c521c5

Signed-off-by: Luca Soldaini <luca@soldaini.net>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>

lywa1998 pushed a commit to lywa1998/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

Signed-off-by: Luca Soldaini <luca@soldaini.net>

alhridoy pushed a commit to alhridoy/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

5178a0c

Signed-off-by: Luca Soldaini <luca@soldaini.net>

xuebwang-amd pushed a commit to xuebwang-amd/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

50d9216

Signed-off-by: Luca Soldaini <luca@soldaini.net>
Signed-off-by: xuebwang-amd <xuebwang@amd.com>

rtourgeman pushed a commit to rtourgeman/vllm that referenced this pull request


          Add Olmo 3 reasoning parser (vllm-project#26054)

910e23e

Signed-off-by: Luca Soldaini <luca@soldaini.net>

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

new-model ready tool-calling