fix: Fix DB unittest reliability #4548

anticorrelator · 2024-09-07T05:29:34Z

Updates our DB management strategy for unittests, this resolves the majority of stability issues we were seeing in CI.

.github/workflows/python-CI.yml

pyproject.toml

axiomofjoy · 2024-09-12T18:35:47Z

pyproject.toml

@@ -248,21 +248,21 @@ dependencies = [
 ]

 [tool.hatch.envs.default.scripts]
-tests = "pytest -n auto {args}"
+tests = "pytest {args}"


Do we no longer need pytest-xdist?

how many cores are we using if we don't specify this option? do we still need pytest-xdist?

tests/datasets/test_experiments.py

axiomofjoy · 2024-09-12T18:46:01Z

tests/conftest.py

@@ -116,7 +111,7 @@ def openai_api_key(monkeypatch: pytest.MonkeyPatch) -> str:
 postgresql_connection = factories.postgresql("postgresql_proc")


-@pytest.fixture()
+@pytest.fixture(scope="function")


Isn't "function" the default scope?

yes, but it's better to be explicit in case the default changes

okay. fwiw i doubt pytest will change that default since it would be a very disruptive change.

axiomofjoy · 2024-09-19T18:06:16Z

pyproject.toml

@@ -248,21 +248,21 @@ dependencies = [
 ]

 [tool.hatch.envs.default.scripts]
-tests = "pytest -n auto {args}"
+tests = "pytest {args}"


how many cores are we using if we don't specify this option? do we still need pytest-xdist?

* Remove busywait * Ruff 🐶 * Use closure loop * Use nest-asyncio for nested asgi fixture management * Wait for db insertions before reading in test * Ensure the entire experiment has run * Experiment with locks * xfail unstable tests * Use asyncio.sleep before querying database after client interactions * Ruff 🐶 * Reduce number of evaluators to make tests more reliable * Only bypass lock for unittests * Convert to an integration test * Set default loop scope for unit tests * Remove loop policy * xfail tests where evals do not reliably write to the database * Ensure databases are function scoped * Ensure inmemory sqlite testing * Ruff 🐶 * Wipe DBs between tests * Continue github actions on error * Use async sleep in spans test * Remove needless import * Refactor engine setup to potentially reduce deadlock risk * Wait for evaluations for more stable tests * Don't continue on failure * Ruff 🐶 * BulkInsterters insert immediately in tests * Remove xdist * Increase timeout to 30 * Xfail test * Use shared cache * Use tempfile based sqlite db * Use tempdirs for windows compatibility * Xfail test again * Wait a waiter to llm eval test * Skip flaky tests only on windows and mac

dosubot bot added the size:S This PR changes 10-29 lines, ignoring generated files. label Sep 7, 2024

anticorrelator added the DO NOT MERGE label Sep 7, 2024

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. size:L This PR changes 100-499 lines, ignoring generated files. and removed size:S This PR changes 10-29 lines, ignoring generated files. size:M This PR changes 30-99 lines, ignoring generated files. labels Sep 7, 2024

anticorrelator force-pushed the dustin/lifespan-testing branch from b3cfce7 to fced295 Compare September 11, 2024 14:43

anticorrelator removed the DO NOT MERGE label Sep 11, 2024

anticorrelator changed the title ~~fix(DO NOT MERGE): Lifespan testing~~ fix: Fix DB unittest reliability Sep 11, 2024

axiomofjoy reviewed Sep 12, 2024

View reviewed changes

axiomofjoy approved these changes Sep 19, 2024

View reviewed changes

anticorrelator force-pushed the dustin/lifespan-testing branch from 3f481ca to da69b6c Compare September 20, 2024 05:48

anticorrelator added 18 commits September 20, 2024 02:08

Remove busywait

0e5a10c

Ruff 🐶

148a6ea

Use closure loop

06e1ff9

Use nest-asyncio for nested asgi fixture management

12b25dd

Wait for db insertions before reading in test

c3d785f

Ensure the entire experiment has run

2397aac

Experiment with locks

9095d7a

xfail unstable tests

aca3124

Use asyncio.sleep before querying database after client interactions

2d73c48

Ruff 🐶

085a5f6

Reduce number of evaluators to make tests more reliable

0618dfe

Only bypass lock for unittests

adae758

Convert to an integration test

aa46b38

Set default loop scope for unit tests

c16c952

Remove loop policy

ee87dad

xfail tests where evals do not reliably write to the database

d445cc4

Ensure databases are function scoped

66b5735

Ensure inmemory sqlite testing

9cb9d1d

anticorrelator added 11 commits September 20, 2024 02:08

Ruff 🐶

ce46a5b

Wipe DBs between tests

9200ed9

Continue github actions on error

f1e1183

Use async sleep in spans test

f2cf7db

Remove needless import

56bcaab

Refactor engine setup to potentially reduce deadlock risk

17e7c36

Wait for evaluations for more stable tests

036a170

Don't continue on failure

165bec1

Ruff 🐶

7b0bcbc

BulkInsterters insert immediately in tests

3ca5b06

Remove xdist

07b218a

anticorrelator force-pushed the dustin/lifespan-testing branch from da69b6c to 07b218a Compare September 20, 2024 06:09

anticorrelator added 8 commits September 20, 2024 02:18

Increase timeout to 30

a48660c

Xfail test

0c7d0fa

Use shared cache

fc4da4c

Use tempfile based sqlite db

b76949f

Use tempdirs for windows compatibility

60b2ad3

Xfail test again

87562c6

Wait a waiter to llm eval test

3861e3a

Skip flaky tests only on windows and mac

4fe6552

anticorrelator merged commit 5e1d0b5 into auth Sep 20, 2024
16 checks passed

anticorrelator deleted the dustin/lifespan-testing branch September 20, 2024 15:40

This was referenced Sep 25, 2024

docs(auth): instrumentation migration #4732

Merged

fix!: deprecate python 3.8 #4766

Merged

chore(main): release arize-phoenix 5.0.0 #4707

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Fix DB unittest reliability #4548

fix: Fix DB unittest reliability #4548

anticorrelator commented Sep 7, 2024 •

edited

Loading

axiomofjoy Sep 12, 2024

axiomofjoy Sep 19, 2024

axiomofjoy Sep 12, 2024

anticorrelator Sep 12, 2024

axiomofjoy Sep 12, 2024

axiomofjoy Sep 19, 2024

fix: Fix DB unittest reliability #4548

fix: Fix DB unittest reliability #4548

Conversation

anticorrelator commented Sep 7, 2024 • edited Loading

axiomofjoy Sep 12, 2024

Choose a reason for hiding this comment

axiomofjoy Sep 19, 2024

Choose a reason for hiding this comment

axiomofjoy Sep 12, 2024

Choose a reason for hiding this comment

anticorrelator Sep 12, 2024

Choose a reason for hiding this comment

axiomofjoy Sep 12, 2024

Choose a reason for hiding this comment

axiomofjoy Sep 19, 2024

Choose a reason for hiding this comment

anticorrelator commented Sep 7, 2024 •

edited

Loading