feat: Add OpenAI Rate limiting #1805

anticorrelator · 2023-11-27T16:17:11Z

resolves #1663

Implements an adaptive rate limiter that will gradually increase the rate of submission until a rate limit error is encountered, it will then lower the rate limit and block until the rate limited request can be completed.

axiomofjoy · 2023-11-27T18:24:31Z

Looks like this is adding rate limiting to both the OpenAI model and the Bedrock model. Why not add to all models?

anticorrelator · 2023-11-27T19:03:06Z

Looks like this is adding rate limiting to both the OpenAI model and the Bedrock model. Why not add to all models?

@axiomofjoy I checked the other models and we were not catching any kind of rate limiting error in their implementations so I think it's out of scope to try and add that functionality in this PR

src/phoenix/experimental/evals/models/bedrock.py

src/phoenix/experimental/evals/models/rate_limiters.py

axiomofjoy

LGTM. I am wondering if long-term, the retry behavior would more naturally belong on the executor and handle via a priority queue. An issue for another day.

src/phoenix/experimental/evals/models/rate_limiters.py

src/phoenix/experimental/evals/functions/classify.py

* Implement adaptive rate limiter for OpenAI * Add adaptive rate limiter to Bedrock model * Use a sensible default maximum request rate * Ruff 🐶 * Mark test as xfail after llama_index update * Do not retry on rate limit errors with tenacity * Remove xfail after llama_index version lock * Use events and locks instead of nesting asyncio.run * Ensure that events are always set after rate limit handling * Retry on httpx ReadTimeout errors * Update rate limiters with verbose generation info * Improve end of queue handling in AsyncExecutor * improve types to remove the need for casts (#1817) * Improve interrupt handling * Exit early from queue.join on termination events * Properly cancel running tasks * Add pytest-asyncio to hatch env * Do not await cancelled tasks * Improve task_done marking logic * Increase default concurrency --------- Co-authored-by: Xander Song <axiomofjoy@gmail.com>

anticorrelator added 3 commits November 27, 2023 03:35

Implement adaptive rate limiter for OpenAI

b7711b6

Add adaptive rate limiter to Bedrock model

48006c8

Use a sensible default maximum request rate

cbc355b

anticorrelator requested review from axiomofjoy and mikeldking November 27, 2023 16:17

anticorrelator added 3 commits November 27, 2023 11:18

Ruff 🐶

c35f111

Mark test as xfail after llama_index update

ff885af

Do not retry on rate limit errors with tenacity

bd3d311

anticorrelator added 2 commits November 27, 2023 14:03

Merge remote-tracking branch 'origin' into dustin/implement-ratelimiter

cc72439

Remove xfail after llama_index version lock

19e0dff

axiomofjoy reviewed Nov 27, 2023

View reviewed changes

anticorrelator and others added 15 commits November 27, 2023 16:40

Use events and locks instead of nesting asyncio.run

af535c7

Ensure that events are always set after rate limit handling

cffcc72

Retry on httpx ReadTimeout errors

61ff1d3

Update rate limiters with verbose generation info

fee9743

Improve end of queue handling in AsyncExecutor

e06c7aa

Merge remote-tracking branch 'origin' into dustin/implement-ratelimiter

ecbfcd2

improve types to remove the need for casts (#1817)

01e203a

Improve interrupt handling

1f4abe1

Exit early from queue.join on termination events

a2798c8

Properly cancel running tasks

7b4d66e

Add pytest-asyncio to hatch env

9f3cd2e

Do not await cancelled tasks

ddc562c

Improve task_done marking logic

f9f2753

Merge remote-tracking branch 'origin' into dustin/implement-ratelimiter

ef0cbda

Merge branch 'main' into dustin/implement-ratelimiter

5657ec7

axiomofjoy approved these changes Nov 29, 2023

View reviewed changes

src/phoenix/experimental/evals/models/rate_limiters.py Outdated Show resolved Hide resolved

src/phoenix/experimental/evals/models/rate_limiters.py Outdated Show resolved Hide resolved

src/phoenix/experimental/evals/functions/classify.py Outdated Show resolved Hide resolved

Increase default concurrency

eef2e55

anticorrelator merged commit 115e044 into main Nov 29, 2023
9 checks passed

anticorrelator deleted the dustin/implement-ratelimiter branch November 29, 2023 23:40

github-actions bot mentioned this pull request Nov 29, 2023

chore(main): release 1.3.0 #1787

Merged

anticorrelator mentioned this pull request Nov 29, 2023

feat: Implement basic clientside rate limiting #1599

Closed

github-actions bot mentioned this pull request Feb 16, 2024

chore(main): release phoenix 4.0.0 #2321

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add OpenAI Rate limiting #1805

feat: Add OpenAI Rate limiting #1805

anticorrelator commented Nov 27, 2023

axiomofjoy commented Nov 27, 2023

anticorrelator commented Nov 27, 2023

axiomofjoy left a comment

feat: Add OpenAI Rate limiting #1805

feat: Add OpenAI Rate limiting #1805

Conversation

anticorrelator commented Nov 27, 2023

axiomofjoy commented Nov 27, 2023

anticorrelator commented Nov 27, 2023

axiomofjoy left a comment

Choose a reason for hiding this comment