Uniform and better MCMC params for the tests #1107

famura · 2024-03-22T17:15:11Z

What does this implement/fix? Explain your changes

A test errored with in MCMC samples for some runs.

Does this close any currently open issues?

Fixes #1090 (somebody else should test in on their machine, too since it is/was a stochastic bug

Any relevant code examples, logs, error output, etc?

To test if the issue is fixed, run pytest tests/linearGaussian_snre_test.py -k test_api_snre_multiple_trials_and_rounds_map

Any other comments?

Related to issue #910 and indirectly PR #1053

To run the newly flagged tests locally use

pytest tests -m "mcmc and not gpu" -v

Checklist

Put an x in the boxes that apply. You can also fill these out after creating
the PR. If you're unsure about any of them, don't hesitate to ask. We're here to
help! This is simply a reminder of what we are going to look for before merging
your code.

I have read and understood the contribution
guidelines
I agree with re-licensing my contribution from AGPLv3 to Apache-2.0.
I have commented my code, particularly in hard-to-understand areas
I have added tests that prove my fix is effective or that my feature works
I have reported how long the new tests run and potentially marked them
with pytest.mark.slow.
New and existing unit tests pass locally with my changes
I performed linting and formatting as described in the contribution
guidelines
I rebased on main (or there are no conflicts with main)

famura · 2024-03-22T19:02:45Z

So I think that some tests were "ill-parametrized" in the sense that they had very few warmup samples (down to 1) and very many chains (up to 20) and high thinning (10-20). I am currently trying to find 1 hyper-param setting that works for all tests. There was a file-based solution for this in place, which was partially overwitten for tests. I kept the overwrites. Some of them could for sure be deleted.

famura · 2024-03-22T19:04:05Z

I will for now not change the MCMC parameterization in the examples since they might be WIP

manuelgloeckler · 2024-03-25T10:55:03Z

pyproject.toml

@@ -118,7 +118,8 @@ testpaths = [
 ]
 markers = [
    "slow: marks tests as slow (deselect with '-m \"not slow\"')",
-    "gpu: marks tests that require a gpu (deselect with '-m \"not gpu\"')"
+    "gpu: marks tests that require a gpu (deselect with '-m \"not gpu\"')",
+    "mcmc: marks tests that require MCMC sampling (deselect with '-m \"not mcmc\"')"


Great. How long do the MCMC tests run, currently?

Edit: I see these are the current tests but with an additional flag. So its fine.

without -n auto about 850s on my machine (this includes slow). with -n auto it's 4500s. I will need to investigate this further, but I can imagine that with multiple chains, we might cause more harm than good with test parallelization. this could interest you, too @Baschdl

In this case we could think about running the mcmc tests sequentially (e.g. in CI) but a more sustainable solution might be to add a pytest.mark for tests that should only be executed sequentially. At this point, a legitimate question would be whether the parallelization of the tests brings enough benefit to outweigh the maintenance cost.

tests/conftest.py

janfb

This is great, thanks @famura
Just one comment about the num_chains default.

tests/conftest.py

janfb · 2024-03-25T11:28:53Z

tests/linearGaussian_mdn_test.py

-
-def mdn_inference_with_different_methods(method):
+@pytest.mark.parametrize(
+    "method", (SNPE, pytest.param(SNLE, marks=[pytest.mark.slow, pytest.mark.mcmc]))


manuelgloeckler

Looks, good all tests pass also locally on my machine using pytest tests -m "mcmc and not gpu" -v.

This will currently just add the capability to rapidly test new MCMC paramters, but does not actually change them, right? So for that we can already merge it. I will leave it to @janfb to approve.

famura · 2024-03-25T12:33:31Z

Looks, good all tests pass also locally on my machine using pytest tests -m "mcmc and not gpu" -v.

This will currently just add the capability to rapidly test new MCMC parameters, but does not actually change them, right? So for that we can already merge it. I will leave it to @janfb to approve.

I want to add that I pulled some params out of thin air and some tests might run longer or shorter now. Actually, I never ran the slow tests locally before, so I have no reference.

I will scan for the slice_np_vec sampler and use more chains. Oh and I've seen that there is already a merge conflict. Soon I will have to go back to my main work, and then won't find the time to continuously merge origin/main back in. I can understand if you dont want to rush things though

# Conflicts: # tests/conftest.py

famura · 2024-03-25T13:15:49Z

Regarding num_chains for slice_np_vectoruzed, I did not see any relevant speed up between 3 and 10 for this test on my local machine

pytest tests/inference_on_device_test.py::test_training_and_mcmc_on_device -k "slice_np_vectorized"

famura · 2024-03-25T13:36:23Z

From my point of view this PR is ready

janfb

Thanks for taking the extra time. Looks all good, I will double check the num_chains things and take it from here into main.

janfb · 2024-03-25T16:43:50Z

I added separate fixtures for mcmc_params_fast and mcmc_params_accurate. The speed up is marginal on my machine too, but I think it makes sense to separate these two cases.

famura · 2024-03-26T08:21:43Z

I added separate fixtures for mcmc_params_fast and mcmc_params_accurate. The speed up is marginal on my machine too, but I think it makes sense to separate these two cases.

I think that's a good change. I was just lacking the insight which test should be sovled accurately. One final thing before merging this PR: be aware that num_chains=20 can backfire, i.e., be much slower, in the case of samplers that use multiple threads/processes and machines with few cores. I think 10 or so would do the same trick.

janfb · 2024-03-26T08:39:50Z

Thanks for fixing the remaining override!
I see your point about the chains.
But at the moment, we are using slice_np_vectorized for most "accurate" tests, which requires just a single core. If not, we are setting num_chains=1.

Setting num_chains=10 would work as well, but takes the same time for me.

famura · 2024-03-26T09:11:57Z

I see. Once the ruff check runs, donno what that error is tbh, we can merge it :)

famura · 2024-03-26T09:14:32Z

ruff format and ruff check show nothing (that is related to this PR)

manuelgloeckler · 2024-03-26T09:16:35Z

This is what the workflow is complaining about. Which also seems not to be related to the changes here.
Not sure how this happened.

sbi/neural_nets/density_estimators/mixed_density_estimator.py:153:89: E501 Line too long (90 > 88)
sbi/neural_nets/density_estimators/mixed_density_estimator.py:178:89: E501 Line too long (91 > 88)
sbi/utils/user_input_checks.py:613:89: E501 Line too long (103 > 88)
sbi/utils/user_input_checks.py:636:89: E501 Line too long (103 > 88)

famura · 2024-03-26T09:19:15Z

Yeah, I think these are a problem from main. The files are not changed, so mergning should not propagate that error in main

manuelgloeckler · 2024-03-26T09:26:18Z

Well, main is currently also failing. I can locally reproduce these four errors.
I will quickly fix them here before merging.

famura · 2024-03-26T11:20:21Z

Nice, thanks guys. So this PR is done

Better MCMC hparams

94a3c80

famura added bug Something isn't working hackathon labels Mar 22, 2024

famura requested a review from bkmi March 22, 2024 17:15

famura self-assigned this Mar 22, 2024

MCMC testing for adults

c9f5564

famura mentioned this pull request Mar 22, 2024

Integrate PyMC samplers and clean up unsued MCMC sampler #1053

Merged

8 tasks

Reducing the computational load

8850a31

famura added 2 commits March 22, 2024 20:23

What? that wasn't even a test

7f3fb4c

Marking more MCMC tests

bfda70e

famura requested review from janfb and michaeldeistler March 22, 2024 20:08

famura mentioned this pull request Mar 25, 2024

linearGaussian_snre_test fails #1090

Closed

Merge remote-tracking branch 'origin/main' into fix/snre_test_mcmc_inf

69deeba

manuelgloeckler reviewed Mar 25, 2024

View reviewed changes

tests/conftest.py Outdated Show resolved Hide resolved

janfb reviewed Mar 25, 2024

View reviewed changes

manuelgloeckler reviewed Mar 25, 2024

View reviewed changes

famura added 2 commits March 25, 2024 13:35

Merge branch 'main' into fix/snre_test_mcmc_inf

8a82766

# Conflicts: # tests/conftest.py

No speedup for num_chains 3->10 with slice_np_vectorized seen in test

54b5fab

Minor edits; test_slice.py will be deleted in another PR anyway

5db31d7

famura changed the title ~~Better MCMC hparams~~ Uniform and better MCMC params for the tests Mar 25, 2024

Formatted

69617de

janfb reviewed Mar 25, 2024

View reviewed changes

use separate mcmc param fixtures; small fixes to slow mcmc tests.

047e393

janfb requested review from Baschdl and manuelgloeckler March 25, 2024 16:49

janfb approved these changes Mar 25, 2024

View reviewed changes

Deleted unnecessary override

70a8912

manuelgloeckler approved these changes Mar 26, 2024

View reviewed changes

manuelgloeckler and others added 2 commits March 26, 2024 10:27

Fix ruff errors from main

7e1aa1a

fix gpu test

84a6bd8

janfb merged commit 3b63c9f into main Mar 26, 2024
3 checks passed

famura deleted the fix/snre_test_mcmc_inf branch March 27, 2024 08:24

famura mentioned this pull request Apr 3, 2024

fix snpe-a tests. #1119

Merged

janfb mentioned this pull request Apr 9, 2024

Refactor Pyro MCMC methods #986

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uniform and better MCMC params for the tests #1107

Uniform and better MCMC params for the tests #1107

famura commented Mar 22, 2024 •

edited

Loading

famura commented Mar 22, 2024

famura commented Mar 22, 2024

manuelgloeckler Mar 25, 2024 •

edited

Loading

famura Mar 25, 2024

Baschdl Mar 25, 2024

janfb left a comment

janfb Mar 25, 2024

manuelgloeckler left a comment

famura commented Mar 25, 2024

famura commented Mar 25, 2024

famura commented Mar 25, 2024

janfb left a comment

janfb commented Mar 25, 2024

famura commented Mar 26, 2024

janfb commented Mar 26, 2024 •

edited

Loading

famura commented Mar 26, 2024

famura commented Mar 26, 2024

manuelgloeckler commented Mar 26, 2024

famura commented Mar 26, 2024

manuelgloeckler commented Mar 26, 2024

famura commented Mar 26, 2024

Uniform and better MCMC params for the tests #1107

Uniform and better MCMC params for the tests #1107

Conversation

famura commented Mar 22, 2024 • edited Loading

What does this implement/fix? Explain your changes

Does this close any currently open issues?

Any relevant code examples, logs, error output, etc?

Any other comments?

Checklist

famura commented Mar 22, 2024

famura commented Mar 22, 2024

manuelgloeckler Mar 25, 2024 • edited Loading

Choose a reason for hiding this comment

famura Mar 25, 2024

Choose a reason for hiding this comment

Baschdl Mar 25, 2024

Choose a reason for hiding this comment

janfb left a comment

Choose a reason for hiding this comment

janfb Mar 25, 2024

Choose a reason for hiding this comment

manuelgloeckler left a comment

Choose a reason for hiding this comment

famura commented Mar 25, 2024

famura commented Mar 25, 2024

famura commented Mar 25, 2024

janfb left a comment

Choose a reason for hiding this comment

janfb commented Mar 25, 2024

famura commented Mar 26, 2024

janfb commented Mar 26, 2024 • edited Loading

famura commented Mar 26, 2024

famura commented Mar 26, 2024

manuelgloeckler commented Mar 26, 2024

famura commented Mar 26, 2024

manuelgloeckler commented Mar 26, 2024

famura commented Mar 26, 2024

famura commented Mar 22, 2024 •

edited

Loading

manuelgloeckler Mar 25, 2024 •

edited

Loading

janfb commented Mar 26, 2024 •

edited

Loading