[Tests] Disable retries and use context manager for openai client #7565

njhill · 2024-08-15T20:57:26Z

The openai python client by default retries failed requests up to two times. In our tests I think we should disable this to avoid hiding issues.

Doing this actually caused some failures (at least with another PR I'm working on), which seem to be related to how a single client fixture is shared between multiple async tests.

The openai docs suggest it should be used via a context manager so I have updated the various usages to do so and have reduced the scope of associated client fixtures.

github-actions · 2024-08-15T20:57:42Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which consists a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of default ones by unblocking the steps in your fast-check build on Buildkite UI.

Once the PR is approved and ready to go, please make sure to run full CI as it is required to merge (or just use auto-merge).

To run full CI, you can do one of these:

Comment /ready on the PR
Add ready label to the PR
Enable auto-merge.

🚀

youkaichao

LGTM, thanks for the fix!

DarkLight1337 · 2024-08-16T00:22:54Z

The guided decoding tests have been shown to be fragile (see #5526 (comment)). Seems like changing the usage of OpenAI client in this way breaks them as well.

njhill · 2024-08-27T00:13:52Z

Thanks @DarkLight1337. I was digging into the cause of the failures here but got diverted, going to resume that now :)

The openai python client by default retries failed requests up to two times. In our tests I think we should disable this to avoid hiding issues. Doing this actually caused some other failures, which seem to be related to how a single client fixture is shared between multiple async tests. The openai docs suggest it should be used via a context manager so I have updated the various usages to do so and have reduced the scope of associated client fixtures.

…lm-project#7565)

…lm-project#7565) Signed-off-by: Alvant <alvasian@yandex.ru>

…lm-project#7565)

youkaichao approved these changes Aug 15, 2024

View reviewed changes

njhill mentioned this pull request Aug 15, 2024

[Core] Add engine option to return only deltas or final output #7381

Merged

njhill added 2 commits August 26, 2024 17:14

more haste less speed

8d21009

njhill force-pushed the oai-client-retries branch from e562aba to 8d21009 Compare August 27, 2024 00:15

Update more-recently-added tests

dc839c7

njhill added the ready ONLY add when PR is ready to merge/full CI is needed label Aug 27, 2024

One more

bdaf563

njhill merged commit 39178c7 into vllm-project:main Aug 27, 2024
32 checks passed

njhill deleted the oai-client-retries branch August 27, 2024 04:33

mgoin mentioned this pull request Sep 3, 2024

[Feature] OpenAI-Compatible Tools API + Streaming for Hermes & Mistral models #5649

Merged

20 tasks

K-Mistele added a commit to Constellate-AI/vllm that referenced this pull request Sep 4, 2024

fix(tests): update pytest fixture for client based on vllm-project#7565

4972a89

triple-Mu pushed a commit to triple-Mu/vllm_official that referenced this pull request Sep 4, 2024

[Tests] Disable retries and use context manager for openai client (vl…

4753d8a

…lm-project#7565)

Jeffwan pushed a commit to aibrix/vllm that referenced this pull request Sep 19, 2024

[Tests] Disable retries and use context manager for openai client (vl…

b86824a

…lm-project#7565)

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request Oct 26, 2024

[Tests] Disable retries and use context manager for openai client (vl…

f9c9a4d

…lm-project#7565) Signed-off-by: Alvant <alvasian@yandex.ru>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[Tests] Disable retries and use context manager for openai client (vl…

d13cb0c

…lm-project#7565)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Tests] Disable retries and use context manager for openai client #7565

[Tests] Disable retries and use context manager for openai client #7565

njhill commented Aug 15, 2024

github-actions bot commented Aug 15, 2024

youkaichao left a comment

DarkLight1337 commented Aug 16, 2024 •

edited

Loading

njhill commented Aug 27, 2024

[Tests] Disable retries and use context manager for openai client #7565

[Tests] Disable retries and use context manager for openai client #7565

Conversation

njhill commented Aug 15, 2024

github-actions bot commented Aug 15, 2024

youkaichao left a comment

Choose a reason for hiding this comment

DarkLight1337 commented Aug 16, 2024 • edited Loading

njhill commented Aug 27, 2024

DarkLight1337 commented Aug 16, 2024 •

edited

Loading