Description
OpenAI released a new service tier called flex.
It reduces the cost of the tokens at the expense of performance/availability.
Is it possible to add a service_tier parameter to the OpenAIModel and the ability to control the timeout value?
References
OpenAI documentation