Add OpenRouterModel as OpenAIChatModel subclass #3089

ajac-zero · 2025-10-05T05:43:04Z

Hi! This pull request takes a shot at implementing a dedicated OpenRouterModel model. Closes #2936.

The differentiator for this PR is that this implementation minimizes code duplication as much as possible by delegating the main logic to OpenAIChatModel, such that the new model class serves as a convenience layer for OpenRouter specific features.

The main thinking behind this solution is that as long as the OpenRouter API is still fully accessible via the openai package, it would be inefficient to reimplement the internal logic using this same package again. We can instead use hooks to achieve the requested features.

I would like to get some thoughts on this implementation before starting to update the docs.

Addressed issues

Closes Store OpenRouter provider metadata in ModelResponse vendor details #1849

Provider metadata can now be accessed via the 'downstream_provider' key in ModelMessage.provider_details:

from pydantic_ai import ModelRequest
from pydantic_ai.direct import model_request_sync
from pydantic_ai.models.openrouter import OpenRouterModel

model = OpenRouterModel('moonshotai/kimi-k2-0905')

response = model_request_sync(model, [ModelRequest.user_text_prompt('Who are you')])

assert response.provider_details is not None
print(response.provider_details['downstream_provider'])  # <-- Final provider that was routed to
# Output: AtlasCloud

Closes Can I get thinking part from openrouter provider using google/gemini-2.5-pro? #2999

The new OpenRouterModelSettings allows for the reasoning parameter by OpenRouter, the thinking can then be accessed as a ThinkingPart in the model response:

from pydantic_ai import ModelRequest
from pydantic_ai.direct import model_request_sync
from pydantic_ai.models.openrouter import OpenRouterModel, OpenRouterModelSettings

model = OpenRouterModel('google/gemini-2.5-pro')

settings = OpenRouterModelSettings(openrouter_reasoning={'effort': 'high'})

response = model_request_sync(model, [ModelRequest.user_text_prompt('Who are you')], model_settings=settings)

print(response.parts[0])
# Output: ThinkingPart(content='**Identifying the Core Inquiry**\n\nI\'m grappling with the core question: "Who am I?" Initially, I\'m identifying the root of the query. The user wants a fundamental identity explained, and I\'ve begun by pinpointing the key words and associations. AI, specifically. Next step, I\'ll move onto broadening this.\n\n\n**Clarifying My Nature**\n\nI\'m now dissecting the definition of "language model," focusing on what that *means* in practical terms. I\'ve moved past simply stating the term and am now delving into how my functions—answering, generating, translating—are executed. This requires explaining my training on vast datasets and my lack of personal experience, which is key to the identity question. I am trying to find the right framing for this complex process.\n\n\n**Formulating a Direct Response**\n\nI\'m now trying to directly answer the question, avoiding technical jargon where possible. I\'m organizing my response. The essential elements have been identified: My nature, my capabilities, and what I *cannot* do. I\'m thinking of ways to explain these facts in a concise, accessible format, focusing on clarity for the user.\n\n\n**Constructing a Detailed Answer**\n\nI\'m now translating the structured plan into actual sentences. I\'m working on the opening, the "I am..." statement, and aiming for a direct, clear tone. Then, I am carefully crafting the explanation of my capabilities and limitations to avoid misunderstandings. I\'m actively searching for concise and impactful language.\n\n\n**Drafting the Final Response**\n\n\\n\\n\n\nI\'m now integrating all the elements I\'ve identified. I\'m beginning the final draft. I\'m focusing on flow and readability, weaving the key points—my nature, my origin, my abilities, and my constraints—into a cohesive narrative. The goal is a concise and informative self-description, tailored to the user\'s inquiry.\n\n\n', id='reasoning', provider_name='openrouter')

Closes Handle error response from OpenRouter as exception instead of validation failure #2323. Closes OpenRouter uses non-compatible finish reason #2844

These are dependent on some downstream logic from OpenRouter or their own downstream providers (that a response of type 'error' will have a >= 400 status code), but for most cases I would say it works as one would expect:

from pydantic_ai import ModelHTTPError, ModelRequest
from pydantic_ai.direct import model_request_sync
from pydantic_ai.models.openrouter import OpenRouterModel, OpenRouterModelSettings

model = OpenRouterModel('google/gemini-2.5-pro')

settings = OpenRouterModelSettings(
    openrouter_preferences={'only': ['azure']}  # Gemini is not available in Azure; Guaranteed failure.
)

try:
    response = model_request_sync(model, [ModelRequest.user_text_prompt('Who are you')], model_settings=settings)
except ModelHTTPError as e:
    print(e)
# status_code: 404, model_name: google/gemini-2.5-pro, body: {'message': 'No allowed providers are available for the selected model.', 'code': 404}

Add OpenRouterModel #1870 (comment)

Add some additional type support to set the provider routing options from OpenRouter:

from pydantic_ai import ModelRequest
from pydantic_ai.direct import model_request_sync
from pydantic_ai.models.openrouter import OpenRouterModel, OpenRouterModelSettings

model = OpenRouterModel('moonshotai/kimi-k2-0905')

settings = OpenRouterModelSettings(
    openrouter_preferences={
        'order': ['moonshotai', 'deepinfra', 'fireworks', 'novita'],
        'allow_fallbacks': True,
        'require_parameters': True,
        'data_collection': 'allow',
        'zdr': True,
        'only': ['moonshotai', 'fireworks'],
        'ignore': ['deepinfra'],
        'quantizations': ['fp8'],
        'sort': 'throughput',
        'max_price': {'prompt': 1},
    }
)

response = model_request_sync(model, [ModelRequest.user_text_prompt('Who are you')], model_settings=settings)
assert response.provider_details is not None
print(response.provider_details['downstream_provider'])
# Output: Fireworks

DouweM

@ajac-zero Muchas gracias Anibal!

pydantic_ai_slim/pydantic_ai/models/openrouter.py

Co-authored-by: Douwe Maan <me@douwe.me>

pydantic_ai_slim/pydantic_ai/models/openrouter.py

ajac-zero · 2025-10-13T13:10:55Z

Buen día @DouweM, can you take a look when you get the chance?

DouweM

Gracias!

It'd be interesting to add support for the WebSearchTool built-in tool as well, shouldn't be too complicated I think: https://openrouter.ai/docs/features/web-search

pydantic_ai_slim/pydantic_ai/models/openrouter.py

DouweM · 2025-10-21T17:03:37Z

@ajac-zero We can also remove this comment from openai.py:

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/openai.py

Lines 558 to 560 in c5b1495

    
           # NOTE: We don't currently handle OpenRouter `reasoning_details`: 
        
           # - https://openrouter.ai/docs/use-cases/reasoning-tokens#preserving-reasoning-blocks 
        
           # If you need this, please file an issue.

Co-authored-by: Douwe Maan <me@douwe.me>

xcpky · 2025-10-26T12:04:29Z

Hi, just found this useful pr and I think top_k and other missing model config should be added to align with the Request Schema documented here https://openrouter.ai/docs/api-reference/overview.

DouweM · 2025-10-28T14:15:44Z

@ajac-zero Please have a look at the failing linting & coverage!

ajac-zero · 2025-10-29T19:46:32Z

@DouweM This part from OpenAIChatModel is causing some unexpected behavior with the thinking content, because it appends it to the message content, which OpenRouter doesn't want.

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/openai.py

Lines 684 to 689 in 63d1b84

    
           elif isinstance(item, ThinkingPart): 
        
               # NOTE: DeepSeek `reasoning_content` field should NOT be sent back per https://api-docs.deepseek.com/guides/reasoning_model, 
        
               # but we currently just send it in `<think>` tags anyway as we don't want DeepSeek-specific checks here. 
        
               # If you need this changed, please file an issue. 
        
               start_tag, end_tag = self.profile.thinking_tags 
        
               texts.append('\n'.join([start_tag, item.content, end_tag]))

I fixed it by regexing the content afterward, but I don't like this solution very much.

pydantic-ai/pydantic_ai_slim/pydantic_ai/models/openrouter.py

Lines 474 to 477 in 63d1b84

    
           if openai_message['role'] == 'assistant' and isinstance( 
        
               contents := openai_message.get('content'), str 
        
           ):  # pragma: lax no cover 
        
               openai_message['content'] = re.sub(r'<think>.*?</think>\s*', '', contents, flags=re.DOTALL).strip()

I am hesitant on changing OpenAIChatModel at all, but maybe that we could wrap the ThinkingPart logic in a function, and then override it from OpenRouterModel and any future models?

DouweM · 2025-10-29T22:22:19Z

@ajac-zero Yep I think pulling this into a method that can be overridden in a subclass is a good idea. It shouldn't be specific for ThinkingPart, but for the entire ModelResponsePart, so that it has that isinstance(...) branching it, and and the overridden method can short-circuit by handling ThinkingPart itself.

pydantic_ai_slim/pydantic_ai/models/openai.py

DouweM · 2025-11-03T21:42:33Z

pydantic_ai_slim/pydantic_ai/models/openai.py

-                        maybe_event.part.id = 'content'
-                        maybe_event.part.provider_name = self.provider_name
-                    yield maybe_event
+    async def _validate_response(self):


This and the other new methods need a return type hint

DouweM · 2025-11-03T21:42:51Z

pydantic_ai_slim/pydantic_ai/models/openai.py

+        By default, this is a no-op since `ChatCompletionChunk` is already validated.
+        """
+        async for chunk in self._response:
+            yield chunk


Could this just return self._response

I tested it but no :( If we use return the method returns a coroutine instead, we have to use yield for it to behave as an async iterable

We could make the method sync though, and then return the coroutine, and I think it'll work

pydantic_ai_slim/pydantic_ai/models/openai.py

DouweM · 2025-11-03T22:01:15Z

pydantic_ai_slim/pydantic_ai/models/openrouter.py

+    @override
+    def _process_reasoning(self, response: chat.ChatCompletion) -> list[ThinkingPart]:
+        # We can cast with confidence because response was validated in `_validate_completion`
+        response = cast(OpenRouterChatCompletion, response)


Can we do an assert isinstance instead to raise an error if we somehow get here with an unexpected type?

pydantic_ai_slim/pydantic_ai/models/openrouter.py

DouweM · 2025-11-03T22:02:45Z

pydantic_ai_slim/pydantic_ai/models/openrouter.py

+        message = response.choices[0].message
+        items: list[ThinkingPart] = []
+
+        if reasoning_details := message.reasoning_details:


Are we sure we can entirely drop the .reasoning field processing we have in the superclass? Will OpenRouter always have reasoning_details as well?

DouweM · 2025-11-03T22:03:06Z

pydantic_ai_slim/pydantic_ai/models/openrouter.py

+        provider_details = super()._process_provider_details(response)
+
+        provider_details['downstream_provider'] = response.provider
+        provider_details['native_finish_reason'] = response.choices[0].native_finish_reason


Are we 100% sure there's more than 1 choice at this point?

DouweM · 2025-11-03T22:03:55Z

pydantic_ai_slim/pydantic_ai/models/openrouter.py

+            if isinstance(item, TextPart):
+                texts.append(item.content)
+            elif isinstance(item, ThinkingPart):
+                if item.provider_name == self.system and isinstance(item, OpenRouterThinkingPart):


See above, this unfortunately won't work right :(

ajac-zero added 3 commits October 3, 2025 18:40

Add OpenRouter support and test coverage

227e873

Add OpenRouter reasoning config and refactor response details

c3c1546

Move OpenRouterModelSettings import into try block

10a1a17

DouweM self-assigned this Oct 7, 2025

DouweM requested changes Oct 7, 2025

View reviewed changes

DouweM mentioned this pull request Oct 7, 2025

Add native OpenRouter model #2409

Closed

DouweM added the awaiting author revision label Oct 7, 2025

ajac-zero and others added 5 commits October 7, 2025 13:36

Update pydantic_ai_slim/pydantic_ai/models/openrouter.py

5e64a62

Co-authored-by: Douwe Maan <me@douwe.me>

Merge branch 'main' into main

5e787da

Handle OpenRouter errors and extract response metadata

e219b8c

Merge branch 'pydantic:main' into main

c5e0600

Add type ignores to tests

6f99fb2

DouweM requested changes Oct 9, 2025

View reviewed changes

pydantic_ai_slim/pydantic_ai/models/openrouter.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/models/openrouter.py Outdated Show resolved Hide resolved

pydantic_ai_slim/pydantic_ai/models/openrouter.py Outdated Show resolved Hide resolved

DouweM mentioned this pull request Oct 10, 2025

Create LiteLLMModel to fix thinking parts not being sent to Anthropic on Vertex via LiteLLM with OpenAIChatModel #3113

Open

2 tasks

ajac-zero added 3 commits October 10, 2025 09:17

Merge branch 'pydantic:main' into main

83d14b1

Send back reasoning_details/signature

ef3c6dd

Merge branch 'pydantic:main' into main

0ba3691

DouweM requested changes Oct 13, 2025

View reviewed changes

ajac-zero added 2 commits October 16, 2025 11:08

add OpenRouterChatCompletion model

ed9e7df

Merge branch 'pydantic:main' into main

0689e29

DouweM requested changes Oct 21, 2025

View reviewed changes

ajac-zero and others added 6 commits October 24, 2025 06:48

Update pydantic_ai_slim/pydantic_ai/models/openrouter.py

75adbb4

Co-authored-by: Douwe Maan <me@douwe.me>

Update pydantic_ai_slim/pydantic_ai/models/openrouter.py

ab9d690

Co-authored-by: Douwe Maan <me@douwe.me>

Merge branch 'main' into main

db1630d

fix spelling mistake

5700a19

add openrouter web plugin

ee93121

WIP build reasoning_details from ThinkingParts

ca45f8a

ajac-zero added 11 commits October 28, 2025 10:18

Merge branch 'main' into main

91bee62

wip reasoning details conversion

b325816

finish openrouter thinking part

1db529f

add preserve reasoning tokens test

3d7f1b4

fix typing

1d7a8a4

Merge branch 'main' into main

e81621b

Merge branch 'main' into main

c6aca8d

remove <thinking> tags from content

516e823

Merge branch 'main' into main

b8406d0

fix typing

c16c960

Merge branch 'main' into main

63d1b84

DouweM mentioned this pull request Oct 30, 2025

Unsupported binary content type: video/mp4 #3121

Closed

2 tasks

ajac-zero added 7 commits November 1, 2025 08:33

Merge branch 'main' into main

0835073

add _map_model_response method

baede41

move assert_never import to typing_extensions

89ef9a8

add tool calling test

ebc8d08

replace process_response with hooks

21a78e4

add stream hooks

0b37792

simplify hooks

8d090f0

DouweM mentioned this pull request Nov 3, 2025

Unlock VideoUrl for vLLM Models #3306

Open

ajac-zero added 4 commits November 3, 2025 14:51

fix coverage/linting

e8c3c81

Merge branch 'main' into main

02b8527

Merge branch 'main' into main

895ea03

fix lint

7c50f07

DouweM requested changes Nov 3, 2025

View reviewed changes

ajac-zero added 2 commits November 4, 2025 07:49

replace OpenRouterThinking with encoding in 'id'

8e32475

Merge branch 'main' into main

9d57be0

Add OpenRouterModel as OpenAIChatModel subclass #3089

Are you sure you want to change the base?

Add OpenRouterModel as OpenAIChatModel subclass #3089

Uh oh!

Conversation

ajac-zero commented Oct 5, 2025 • edited by DouweM Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Addressed issues

Uh oh!

DouweM left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ajac-zero commented Oct 13, 2025

Uh oh!

DouweM left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DouweM commented Oct 21, 2025

Uh oh!

xcpky commented Oct 26, 2025

Uh oh!

DouweM commented Oct 28, 2025

Uh oh!

ajac-zero commented Oct 29, 2025

Uh oh!

DouweM commented Oct 29, 2025

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ajac-zero commented Oct 5, 2025 •

edited by DouweM

Loading