Unit test fixing - remove network requirements #217

afg1 · 2025-10-27T09:16:13Z

Our unit tests for the pipeline don't run very often, and when they do, they fail on things that are irrelevant, partly because they use live APIs and data changes.

This PR aims to remove a lot of the network access by mocking the API calls. This has been done by recording the output of API calls and using that to build mock data structures, applied by unittest.mock patches so the tests don't use the real API.

This is a work in progress, and will touch many files, so I intend to do incremental PRs to keep reviews manageable.

This batch removes network calls for the OLS, tmRNA, ensembl karyotype and core PDBe tests.

So we can run them in isolation while fixing the tests

… moving things like citation count that cause spurious failures

…nes alone

afg1 · 2025-10-27T09:33:43Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces changes to test configurations and adds mocking to remove network dependencies in several tests. The changes involve modifying pyproject.toml and pytest.ini to include new test markers, and updating test files to use mocking for external API calls. This aims to improve the reliability and speed of unit tests by eliminating their reliance on live network connections.

gemini-code-assist · 2025-10-27T09:35:23Z

tests/databases/ols/fetch_test.py



 @pytest.mark.ols


This test is marked as @pytest.mark.ols but it is missing the mock_ols_api fixture as an argument. This means the test will still attempt to make network calls, defeating the purpose of the mocking fixture. This needs to be corrected.

Suggested change

@pytest.mark.ols

def test_can_fetch_a_go_term():

gemini-code-assist · 2025-10-27T09:35:23Z

pyproject.toml

+    "ci: Tests that should run in continuous integration",
    "slow: Tests that take a long time",
    "db: Test that require access to our database",
+    "network: Tests that require network",


Adding network marker to pyproject.toml to mark tests that require network access. This is good for categorizing tests, but it's important to ensure that all tests that actually use the network are marked, and that tests are not unnecessarily marked as requiring network access.

gemini-code-assist · 2025-10-27T09:35:23Z

tests/databases/ensembl/metadata/karyotypes_test.py

    }

 @pytest.mark.ensembl
-@pytest.mark.network


The @pytest.mark.network marker is no longer needed here, as the test has been refactored to use mocking and no longer depends on a network connection. Removing this marker will accurately reflect the test's independence from network resources.

Suggested change

@pytest.mark.network

def test_builds_with_known_bands():

blakesweeney · 2025-10-28T10:35:23Z

I'm not sure I like this overall. The upside to these things failing is sometimes the APIs and the like change without notice. This does catch them. There are ways to just log which tests are flakly and I'd rather start with that to be honest.

afg1 · 2025-10-28T10:52:16Z

100%, we need tests to catch changes in the APIs. My plan was to separate them out. So we have our assumptions about the data fromat in these tests, which run fast and can be put into CI on github actions or something. Then we have separate tests that run less frequently (a month before release?) to check the API matches our assumptions.

My problem is that these tests fail for completely irrelevant things like the citation count changing, or the inclusion of an extra full stop in an ontology term definition. If we keep them on the live API, they will always fail, and my worry iss we will assume it is because of the dumb reason when a bigger API change happened.

I would propose having bigger integration tests that use the live API and don't check the fine details like these do, or have specific API data contract testing checks that compare the mocked structure to the real one.

This was just something I decided should be fixed one evening, so very happy to have any input!

afg1 added 9 commits October 26, 2025 21:08

Mock API call for OLS to remove network dependency

b9026c4

Add mocking for gtdb taxonomy in tmRNA tests to remove network calls

59f0230

Update tmrna test with current data

55d14ba

Add pytest markers for every database

dbfe3b8

So we can run them in isolation while fixing the tests

Apply test markers to some target tests for sorting out

302243a

Add database marker for pdb

de604ed

Remove network dependency for EPMC tests. Also removes checks against…

b0bacef

… moving things like citation count that cause spurious failures

Mock the Ensembl karyotypes API as well

2a6c20b

Mock about half the tests in PDBe. Leaves many of the parameterised o…

7ff083a

…nes alone

afg1 requested review from blakesweeney and pmb59 October 27, 2025 09:16

afg1 self-assigned this Oct 27, 2025

afg1 added the enhancement label Oct 27, 2025

gemini-code-assist bot reviewed Oct 27, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unit test fixing - remove network requirements #217

Unit test fixing - remove network requirements #217

Uh oh!

afg1 commented Oct 27, 2025

Uh oh!

afg1 commented Oct 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Oct 27, 2025

Uh oh!

gemini-code-assist bot Oct 27, 2025

Uh oh!

gemini-code-assist bot Oct 27, 2025

Uh oh!

blakesweeney commented Oct 28, 2025

Uh oh!

afg1 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Unit test fixing - remove network requirements #217

Are you sure you want to change the base?

Unit test fixing - remove network requirements #217

Uh oh!

Conversation

afg1 commented Oct 27, 2025

Uh oh!

afg1 commented Oct 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

blakesweeney commented Oct 28, 2025

Uh oh!

afg1 commented Oct 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants