Finish restructuring the tests to follow the structure of the code #6125

Armavica · 2022-09-13T21:35:25Z

What is this PR about?
This PR restructures the last test files to follow the structure of the code, as discussed in #5777.

I think that this closes #5777. One last unresolved question is whether we want to move tests out of pymc, like it is done in aesara for example.

Checklist

Explain important implementation details 👆
Make sure that the pre-commit linting/style checks pass.
Link relevant issues (preferably in nice commit messages)
Are the changes covered by tests and docstrings?
Fill out the short summary sections 👇

Major / Breaking Changes

...

Bugfixes / New features

...

Docs / Maintenance

Finish restructuring the tests to follow the structure of the code

codecov · 2022-09-13T21:55:07Z

Codecov Report

Merging #6125 (a9eca4a) into main (244c37d) will increase coverage by 0.33%.
The diff coverage is 99.77%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #6125      +/-   ##
==========================================
+ Coverage   93.04%   93.37%   +0.33%     
==========================================
  Files          91      100       +9     
  Lines       20813    21896    +1083     
==========================================
+ Hits        19365    20445    +1080     
- Misses       1448     1451       +3

Impacted Files	Coverage Δ
pymc/tests/smc/test_smc.py	`100.00% <ø> (ø)`
pymc/tests/variational/test_inference.py	`99.02% <99.02%> (ø)`
pymc/tests/step_methods/hmc/test_hmc.py	`98.14% <100.00%> (ø)`
pymc/tests/step_methods/hmc/test_nuts.py	`100.00% <100.00%> (ø)`
pymc/tests/step_methods/test_compound.py	`100.00% <100.00%> (ø)`
pymc/tests/step_methods/test_metropolis.py	`100.00% <100.00%> (ø)`
pymc/tests/step_methods/test_slicer.py	`100.00% <100.00%> (ø)`
pymc/tests/variational/test_approximations.py	`100.00% <100.00%> (ø)`
pymc/tests/variational/test_callbacks.py	`100.00% <100.00%> (ø)`
pymc/tests/variational/test_opvi.py	`100.00% <100.00%> (ø)`
... and 3 more

Armavica · 2022-09-16T02:30:41Z

These metropolis tests are flaky even though they are supposed to be SeededTests. It is because _get_seeds_per_chain creates a numpy.random.default_rng(None) which is based on entropy, not on the global numpy random generator:

pymc/pymc/sampling.py

Lines 287 to 290 in 4ea8dde

    
           if random_state is None or isinstance(random_state, int): 
        
               if chains == 1 and isinstance(random_state, int): 
        
                   return (random_state,) 
        
               return _get_unique_seeds_per_chain(np.random.default_rng(random_state).integers)

My suggestion is to use a seed from the global numpy random generator in case number 4 described in this function's docstring: this makes these tests deterministic. Concretely I would add this couple of lines before L287 above:

if random_state is None:
    random_state = np.random.randint(2**30)

Would that be okay?

ricardoV94 · 2022-09-16T05:44:52Z

No, we officially don't allow users to control seeding via global seeding anymore (even if we still use global seeding internally), because those have less quality.

The solution is to explicitly pass random_seed to the sampling routines in those tests.

But how often does it fail? I haven't seen it fail before for a long time now.

pymc/tests/helpers.py

Armavica · 2022-09-16T13:23:10Z

I see, that makes sense. I will change the name of the class and we will see if that repeats.

But doesn't this mean that SeededTest in its current form is not doing much?

ricardoV94 · 2022-09-16T13:59:40Z

I see, that makes sense. I will change the name of the class and we will see if that repeats.

But doesn't this mean that SeededTest in its current form is not doing much?

Yes it's not doing much anymore, other than providing a seed when requested

ricardoV94 · 2022-09-16T14:01:49Z

.github/workflows/tests.yml

@@ -149,7 +149,7 @@ jobs:
          - pymc/tests/test_variational_inference.py pymc/tests/test_initial_point.py
          - pymc/tests/test_model.py pymc/tests/test_step.py
          - pymc/tests/gp/test_cov.py pymc/tests/gp/test_gp.py pymc/tests/gp/test_mean.py pymc/tests/gp/test_util.py pymc/tests/ode/test_ode.py pymc/tests/ode/test_utils.py pymc/tests/test_smc.py pymc/tests/test_parallel_sampling.py
-          - pymc/tests/test_sampling.py pymc/tests/test_posteriors.py
+          - pymc/tests/test_sampling.py pymc/tests/step_methods/test_metropolis.py pymc/tests/step_methods/test_slicer.py pymc/tests/step_methods/hmc/test_nuts.py


Isn't it enough to do -pymc/tests/step_methods/ ?

Yes it is, but here I selected only the few files where tests from test_posteriors ended up, to not increase too much the runtime of the tests. The rest is already tested in other platforms. I am not sure how much of a difference that makes, though. If we test the whole of step_methods I will also check that the check_all_tests_are_covered.py script understands what happens.

Armavica · 2022-09-16T14:24:12Z

These two tests (TestKmeansInducing::test_kmeans (this one is tested on both Windows and Linux on the github runners but fails only on Windows) and TestDEMetropolisZ::test_tuning_reset) have been consistently failing on this PR, even though they pass locally on my computer. I will check if I did something wrong.

ricardoV94 · 2022-09-30T13:00:30Z

@Armavica Do you want to try to rebase this from main and see if we can get it merged quickly? One of the flaky tests you mention was fixed recently

ricardoV94 · 2022-09-30T13:01:19Z

Otherwise it might be easier to do it one module at a time

* three_var_aevb_groups * three_var_aevb_approx

* fit_method_with_object * aevb_model

Armavica · 2022-09-30T19:09:26Z

The TestDEMetropolisZ::test_tuning_reset does seem to fail about 10-20% of the time on my laptop. Should I just add a seed?

ricardoV94 · 2022-09-30T19:20:51Z

The TestDEMetropolisZ::test_tuning_reset does seem to fail about 10-20% of the time on my laptop. Should I just add a seed?

Yes

ricardoV94 · 2022-10-01T07:20:52Z

Awesome work @Armavica, are we done?

Armavica · 2022-10-02T00:34:17Z

Awesome work @Armavica, are we done?

Yes, that was the last batch!

Armavica added maintenance tests labels Sep 14, 2022

Armavica force-pushed the structure-tests branch from dd9a250 to ca8ef50 Compare September 15, 2022 22:02

ricardoV94 reviewed Sep 16, 2022

View reviewed changes

pymc/tests/helpers.py Outdated Show resolved Hide resolved

Armavica force-pushed the structure-tests branch from ca8ef50 to c5805cd Compare September 16, 2022 13:37

ricardoV94 reviewed Sep 16, 2022

View reviewed changes

Armavica added 11 commits September 30, 2022 09:55

Merge test_missing with test_model

eae6705

Distribute test_posteriors into step_methods

e177671

Remove two unused fixtures

800b435

* three_var_aevb_groups * three_var_aevb_approx

Remove two unused fixtures

58808f5

* fit_method_with_object * aevb_model

Remove unused fixture aevb_initial

799bd00

Distribute test_variational_inference

56ee247

Distribute test_step into step_methods

20bd72c

Merge old test_hmc into new one

d4802be

Move test_smc into smc

41f4f1c

Move test_types into test_sampling

db5cac6

Split test_shared into test_model & test_sampling

d3e41de

Armavica force-pushed the structure-tests branch from c5805cd to d3e41de Compare September 30, 2022 13:56

Fix TestDEMetropolisZ::test_tuning_reset flakyness

a9eca4a

Armavica requested a review from ricardoV94 September 30, 2022 22:31

ricardoV94 approved these changes Oct 1, 2022

View reviewed changes

ricardoV94 merged commit f515e91 into pymc-devs:main Oct 1, 2022

Armavica deleted the structure-tests branch October 7, 2024 14:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Finish restructuring the tests to follow the structure of the code #6125

Finish restructuring the tests to follow the structure of the code #6125

Armavica commented Sep 13, 2022

codecov bot commented Sep 13, 2022 •

edited

Loading

Armavica commented Sep 16, 2022 •

edited

Loading

ricardoV94 commented Sep 16, 2022 •

edited

Loading

Armavica commented Sep 16, 2022

ricardoV94 commented Sep 16, 2022

ricardoV94 Sep 16, 2022

Armavica Sep 16, 2022 •

edited

Loading

Armavica commented Sep 16, 2022 •

edited

Loading

ricardoV94 commented Sep 30, 2022

ricardoV94 commented Sep 30, 2022

Armavica commented Sep 30, 2022

ricardoV94 commented Sep 30, 2022

ricardoV94 commented Oct 1, 2022

Armavica commented Oct 2, 2022

Finish restructuring the tests to follow the structure of the code #6125

Finish restructuring the tests to follow the structure of the code #6125

Conversation

Armavica commented Sep 13, 2022

Major / Breaking Changes

Bugfixes / New features

Docs / Maintenance

codecov bot commented Sep 13, 2022 • edited Loading

Codecov Report

Armavica commented Sep 16, 2022 • edited Loading

ricardoV94 commented Sep 16, 2022 • edited Loading

Armavica commented Sep 16, 2022

ricardoV94 commented Sep 16, 2022

ricardoV94 Sep 16, 2022

Choose a reason for hiding this comment

Armavica Sep 16, 2022 • edited Loading

Choose a reason for hiding this comment

Armavica commented Sep 16, 2022 • edited Loading

ricardoV94 commented Sep 30, 2022

ricardoV94 commented Sep 30, 2022

Armavica commented Sep 30, 2022

ricardoV94 commented Sep 30, 2022

ricardoV94 commented Oct 1, 2022

Armavica commented Oct 2, 2022

codecov bot commented Sep 13, 2022 •

edited

Loading

Armavica commented Sep 16, 2022 •

edited

Loading

ricardoV94 commented Sep 16, 2022 •

edited

Loading

Armavica Sep 16, 2022 •

edited

Loading

Armavica commented Sep 16, 2022 •

edited

Loading