Add tenacity utilities/integration for improved retry handling #2282

dmontagu · 2025-07-23T10:10:36Z

This came up in a discussion in public slack with @mpfaffenberger — https://pydanticlogfire.slack.com/archives/C083V7PMHHA/p1752430089758299.

Some things to resolve before merging:

Are we okay with this module naming? Should we call it pydantic_ai.retries or similar instead of pydantic_ai.tenacity?
Are the tests/docs adequate?
Do we need to add some integration for Model? I have a tenacity-integrated WrapperModel, but I'm not sure if it's necessary/very useful on top of the async transport stuff currently included in this PR.

dmontagu · 2025-07-23T10:10:42Z

In case it's useful, here is the RetryModel implementation for future reference.

from __future__ import annotations as _annotations

from collections.abc import AsyncIterator
from contextlib import asynccontextmanager
from dataclasses import dataclass
from typing import Literal

from tenacity import AsyncRetrying

from . import KnownModelName, Model, ModelRequestParameters, StreamedResponse
from .wrapper import WrapperModel
from ..messages import ModelMessage, ModelResponse
from ..settings import ModelSettings


@dataclass(init=False)
class RetryModel(WrapperModel):
    def __init__(
        self,
        wrapped: Model | KnownModelName,
        retry: AsyncRetrying | None = None,
        retry_stream: AsyncRetrying | Literal[False] | None = None,
    ):
        super().__init__(wrapped)
        self.controller = retry
        self.stream_controller = retry if retry_stream is None else retry_stream

    async def request(
        self,
        messages: list[ModelMessage],
        model_settings: ModelSettings | None,
        model_request_parameters: ModelRequestParameters,
    ) -> ModelResponse:
        async for attempt in self.controller:
            with attempt:
                return await super().request(messages, model_settings, model_request_parameters)
        raise RuntimeError('The retry controller did not make any attempts')

    @asynccontextmanager
    async def request_stream(
        self,
        messages: list[ModelMessage],
        model_settings: ModelSettings | None,
        model_request_parameters: ModelRequestParameters,
    ) -> AsyncIterator[StreamedResponse]:
        if not self.stream_controller:
            # No special retrying logic for streaming in this case:
            async with super().request_stream(messages, model_settings, model_request_parameters) as stream:
                yield stream
                return

        entered_stream = False
        async for attempt in self.controller:
            attempt.__enter__()
            try:
                async with super().request_stream(messages, model_settings, model_request_parameters) as stream:
                    entered_stream = True
                    attempt.__exit__(None, None, None)
                    yield stream
                    return
            finally:
                if not entered_stream:
                    attempt.__exit__(None, None, None)
        raise RuntimeError('The retry controller did not make any attempts')

hyperlint-ai · 2025-07-23T10:10:54Z

PR Change Summary

Added tenacity utilities for improved retry handling in HTTP requests, enhancing error resilience and user experience.

Introduced the pydantic_ai.tenacity module for retry functionality in HTTP requests.
Added detailed documentation on using tenacity for handling transient failures.
Implemented transport classes for both asynchronous and synchronous HTTP clients.

Added Files

docs/api/tenacity.md
docs/retries.md

How can I customize these reviews?

Check out the Hyperlint AI Reviewer docs for more information on how to customize the review.

If you just want to ignore it on this PR, you can add the hyperlint-ignore label to the PR. Future changes won't trigger a Hyperlint review.

Note specifically for link checks, we only check the first 30 links in a file and we cache the results for several hours (for instance, if you just added a page, you might experience this). Our recommendation is to add hyperlint-ignore to the PR to ignore the link check for this PR.

github-actions · 2025-07-23T10:19:51Z

Docs Preview

commit:	`3cd330d`
Preview URL:	https://a88e5ec5-pydantic-ai-previews.pydantic.workers.dev

docs/api/tenacity.md

docs/retries.md

DouweM · 2025-07-23T18:33:02Z

Are we okay with this module naming? Should we call it pydantic_ai.retries or similar instead of pydantic_ai.tenacity?

I think that'd be better

Are the tests/docs adequate?

I think so, save for some comments I left

Do we need to add some integration for Model? I have a tenacity-integrated WrapperModel, but I'm not sure if it's necessary/very useful on top of the async transport stuff currently included in this PR.

I don't think we need it, acting directly on the HTTP client level is more powerful

mpfaffenberger · 2025-07-25T05:40:26Z

Watching you guys work is awesome. Appreciate this! Thank you!

docs/retries.md

Satisfy linter

odedva · 2025-08-12T08:02:05Z

docs/retries.md

+    def should_retry_status(response):
+        """Raise exceptions for retryable HTTP status codes."""
+        if response.status_code in (429, 502, 503, 504):
+            response.raise_for_status()  # This will raise HTTPStatusError


i've been trying to follow this approach, but for some reason, when the code reaches this line it explodes because the request is not set on the response object.
i wonder if it's related to this new change

@odedva Thanks for the report, can you please file a new issue for this?

Add tenacity utilities/integration

0be044e

Fix 3.9 tests

a5523db

DouweM mentioned this pull request Jul 23, 2025

Rate Limit and Retry for Models #1734

Closed

DouweM requested changes Jul 23, 2025

View reviewed changes

DouweM self-assigned this Jul 23, 2025

DouweM added the awaiting author revision label Jul 23, 2025

dmontagu mentioned this pull request Jul 24, 2025

Gracefully handle errors in evals #2295

Merged

dmontagu added 2 commits July 23, 2025 21:19

Address feedback

08140ba

Rename tenacity group to retries

ca8e485

DouweM reviewed Jul 25, 2025

View reviewed changes

docs/retries.md Outdated Show resolved Hide resolved

DouweM reviewed Jul 25, 2025

View reviewed changes

docs/retries.md Outdated Show resolved Hide resolved

Clarify response_validator in docs

a306579

DouweM approved these changes Jul 25, 2025

View reviewed changes

DouweM reviewed Jul 25, 2025

View reviewed changes

docs/retries.md Outdated Show resolved Hide resolved

DouweM reviewed Jul 25, 2025

View reviewed changes

docs/retries.md Outdated Show resolved Hide resolved

Apply suggestions from code review

3cd330d

Satisfy linter

DouweM enabled auto-merge (squash) July 25, 2025 17:24

DouweM merged commit 4941468 into main Jul 25, 2025
17 checks passed

DouweM deleted the dmontagu/retry-handling branch July 25, 2025 17:34

DouweM mentioned this pull request Aug 7, 2025

Cache model responses for identical requests #2452

Open

odedva reviewed Aug 12, 2025

View reviewed changes

Add tenacity utilities/integration for improved retry handling #2282

Add tenacity utilities/integration for improved retry handling #2282

Uh oh!

Conversation

dmontagu commented Jul 23, 2025

Uh oh!

dmontagu commented Jul 23, 2025

Uh oh!

hyperlint-ai bot commented Jul 23, 2025

PR Change Summary

Uh oh!

github-actions bot commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Docs Preview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

DouweM commented Jul 23, 2025

Uh oh!

mpfaffenberger commented Jul 25, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

odedva Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

DouweM Aug 12, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

github-actions bot commented Jul 23, 2025 •

edited

Loading