Skip to content

Supporting OpenAI's Flex Processing #1919

@empezarcero

Description

@empezarcero

Description

OpenAI released a new service tier called flex.
It reduces the cost of the tokens at the expense of performance/availability.
Is it possible to add a service_tier parameter to the OpenAIModel and the ability to control the timeout value?

References

OpenAI documentation

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions