Load balancing

### Description

I am running into trouble with rate limits in some scenarios. I would love to see a feature in pydantic ai where I can provide the agent with (multiple) models/fallback models, so the agent could then use the other models if rate limits are reached.

Right now, the only way to achieve this is to define multiple agents, give each a model and wrap them all over the system prompts and tools, which is already really messy and then try catch the rate limits and balance the load elsewhere.

### References

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Load balancing #1983

Description

References

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Load balancing #1983

Description

Description

References

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions