Fix some flakey tests #851

michelletran-codecov · 2024-11-04T15:28:55Z

Extracted this from #729

A few of the tests become flaky when migrating to the Django models because of some unique references that's defined in Django, but not SQLAlchemy.

In particular, the way that FactoryBoy creates references can be flakey (i.e. Commits referencing Pulls. If the original pull is not flushed, there is a chance that the commit gets created first, which inadvertently creates a separate pull with the id of the one that you specified to create). So, doing an explicit flush will of pull before creating the commit will fix this.

Also fixed a test where DB returned results are not necessarily ordered.

Legal Boilerplate

Look, I get it. The entity doing business as "Sentry" was incorporated in the State of Delaware in 2015 as Functional Software, Inc. In 2022 this entity acquired Codecov and as result Sentry is going to need some rights from me in order to utilize my contributions in this PR. So here's the deal: I retain all rights, title and interest in and to my contributions, and by keeping this boilerplate intact I confirm that Sentry can use, modify, copy, and redistribute my contributions, under Sentry's choice of terms.

codecov · 2024-11-04T15:34:53Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.02%. Comparing base (8fec23c) to head (1adff7e).
Report is 5 commits behind head on main.

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@           Coverage Diff           @@
##             main     #851   +/-   ##
=======================================
  Coverage   98.02%   98.02%           
=======================================
  Files         444      444           
  Lines       36003    36016   +13     
=======================================
+ Hits        35292    35305   +13     
  Misses        711      711

Flag	Coverage Δ
integration	`98.02% <100.00%> (+<0.01%)`	⬆️
unit	`98.02% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`95.97% <ø> (+<0.01%)`	⬆️
OutsideTasks	`98.05% <ø> (+<0.01%)`	⬆️

Files with missing lines	Coverage Δ
tasks/tests/integration/test_upload_e2e.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_manual_trigger.py	`97.50% <100.00%> (+0.20%)`	⬆️
...sks/tests/unit/test_test_results_processor_task.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_upload_finisher_task.py	`100.00% <100.00%> (ø)`

... and 6 files with indirect coverage changes

codecov-notifications · 2024-11-04T15:34:59Z

Codecov Report

All modified and coverable lines are covered by tests ✅

@@           Coverage Diff           @@
##             main     #851   +/-   ##
=======================================
  Coverage   98.02%   98.02%           
=======================================
  Files         444      444           
  Lines       36003    36016   +13     
=======================================
+ Hits        35292    35305   +13     
  Misses        711      711

Flag	Coverage Δ
integration	`98.02% <100.00%> (+<0.01%)`	⬆️
unit	`98.02% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`95.97% <ø> (+<0.01%)`	⬆️
OutsideTasks	`98.05% <ø> (+<0.01%)`	⬆️

Files with missing lines	Coverage Δ
tasks/tests/integration/test_upload_e2e.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_manual_trigger.py	`97.50% <100.00%> (+0.20%)`	⬆️
...sks/tests/unit/test_test_results_processor_task.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_upload_finisher_task.py	`100.00% <100.00%> (ø)`

... and 6 files with indirect coverage changes

codecov-qa · 2024-11-04T15:35:05Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.02%. Comparing base (8fec23c) to head (1adff7e).
Report is 5 commits behind head on main.

✅ All tests successful. No failed tests found.

@@           Coverage Diff           @@
##             main     #851   +/-   ##
=======================================
  Coverage   98.02%   98.02%           
=======================================
  Files         444      444           
  Lines       36003    36016   +13     
=======================================
+ Hits        35292    35305   +13     
  Misses        711      711

Flag	Coverage Δ
integration	`98.02% <100.00%> (+<0.01%)`	⬆️
unit	`98.02% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`95.97% <ø> (+<0.01%)`	⬆️
OutsideTasks	`98.05% <ø> (+<0.01%)`	⬆️

Files with missing lines	Coverage Δ
tasks/tests/integration/test_upload_e2e.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_manual_trigger.py	`97.50% <100.00%> (+0.20%)`	⬆️
...sks/tests/unit/test_test_results_processor_task.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_upload_finisher_task.py	`100.00% <100.00%> (ø)`

... and 6 files with indirect coverage changes

codecov-public-qa · 2024-11-04T15:35:19Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 98.02%. Comparing base (8fec23c) to head (1adff7e).
Report is 4 commits behind head on main.

✅ All tests successful. No failed tests found.

@@           Coverage Diff           @@
##             main     #851   +/-   ##
=======================================
  Coverage   98.02%   98.02%           
=======================================
  Files         444      444           
  Lines       36003    36016   +13     
=======================================
+ Hits        35292    35305   +13     
  Misses        711      711

Flag	Coverage Δ
integration	`98.02% <100.00%> (+<0.01%)`	⬆️
unit	`98.02% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
NonTestCode	`95.97% <ø> (+<0.01%)`	⬆️
OutsideTasks	`98.05% <ø> (+<0.01%)`	⬆️

Files	Coverage Δ
tasks/tests/integration/test_upload_e2e.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_manual_trigger.py	`97.50% <100.00%> (+0.20%)`	⬆️
...sks/tests/unit/test_test_results_processor_task.py	`100.00% <100.00%> (ø)`
tasks/tests/unit/test_upload_finisher_task.py	`100.00% <100.00%> (ø)`

... and 6 files with indirect coverage changes

Swatinem · 2024-11-05T09:57:23Z

tasks/tests/unit/test_test_results_processor_task.py

+        # assert that test_flag_bridges is a subset of tests
+        assert set(bridge.test_id for bridge in test_flag_bridges).issubset(
+            set(x.id for x in tests)
+        )


previously this was testing for equality, not a subset relation. is this really what we want?

Hmm, good point, the subset test doesn't really make sense. We might actually want to check that the test_ids in test_flag_bridge match the ones in test_instances (non-failure, I'm guessing). The flakiness with this test is that it assumes ordering of tests, where we should not have that assumption.

Since I'm having to guess the intent of this check, I can also be convince that this test isn't super useful, and just remove it. I'm leaning towards just modifying it to match the ids in test_instance out of conservatism.

This is fragile because hard-coding the pullid is more likely to run into conflicts

The first flush to create the commit wit pullid=12 actually creates it, but then it gets added again later. This just avoids flushing multiple times and attempts to create the pull first.

This avoids any other factories that might potentially use the pull object from recreating it in the database

…l object

Flushing the pull before creating the commit seems to reduce the chance of id collision. This is probably because commit _tries_ to create a pull. Flushing the pull logs the pull object in the DB, which disables the CommitFactory from also creating the pull object. This is my best understanding.

michelletran-codecov marked this pull request as ready for review November 4, 2024 16:34

michelletran-codecov requested a review from a team November 4, 2024 16:34

Swatinem approved these changes Nov 5, 2024

View reviewed changes

michelletran-codecov added 5 commits November 5, 2024 10:06

Autogenerate pullid for test

7ab6f6d

This is fragile because hard-coding the pullid is more likely to run into conflicts

Ensure that only one pull gets created

b97b20f

The first flush to create the commit wit pullid=12 actually creates it, but then it gets added again later. This just avoids flushing multiple times and attempts to create the pull first.

Create the pull object earlier

b2d4e1f

This avoids any other factories that might potentially use the pull object from recreating it in the database

Separate the pull flushing from other objects that might create a pul…

68cb3cf

…l object

michelletran-codecov force-pushed the fix_some_flakey_tests branch from 2982784 to 49fb6b4 Compare November 5, 2024 15:06

Make TestUploadTestProcessor less flaky

1adff7e

michelletran-codecov force-pushed the fix_some_flakey_tests branch from 49fb6b4 to 1adff7e Compare November 5, 2024 15:17

michelletran-codecov added this pull request to the merge queue Nov 5, 2024

Merged via the queue into main with commit bf934d6 Nov 5, 2024
26 of 27 checks passed

michelletran-codecov deleted the fix_some_flakey_tests branch November 5, 2024 16:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix some flakey tests #851

Fix some flakey tests #851

michelletran-codecov commented Nov 4, 2024 •

edited

Loading

codecov bot commented Nov 4, 2024 •

edited

Loading

codecov-notifications bot commented Nov 4, 2024 •

edited

Loading

codecov-qa bot commented Nov 4, 2024 •

edited

Loading

codecov-public-qa bot commented Nov 4, 2024 •

edited

Loading

Swatinem Nov 5, 2024

michelletran-codecov Nov 5, 2024

Fix some flakey tests #851

Fix some flakey tests #851

Conversation

michelletran-codecov commented Nov 4, 2024 • edited Loading

Legal Boilerplate

codecov bot commented Nov 4, 2024 • edited Loading

Codecov Report

codecov-notifications bot commented Nov 4, 2024 • edited Loading

Codecov Report

codecov-qa bot commented Nov 4, 2024 • edited Loading

Codecov Report

codecov-public-qa bot commented Nov 4, 2024 • edited Loading

Codecov Report

Swatinem Nov 5, 2024

Choose a reason for hiding this comment

michelletran-codecov Nov 5, 2024

Choose a reason for hiding this comment

michelletran-codecov commented Nov 4, 2024 •

edited

Loading

codecov bot commented Nov 4, 2024 •

edited

Loading

codecov-notifications bot commented Nov 4, 2024 •

edited

Loading

codecov-qa bot commented Nov 4, 2024 •

edited

Loading

codecov-public-qa bot commented Nov 4, 2024 •

edited

Loading