Tests for task's API #29

psteinb · 2021-11-09T16:27:32Z

As discussed in #19 this PR isolated the unit test for two_moons to serve as a starting point establishing tests for all tasks.

jan-matthis · 2021-11-10T14:59:36Z

This is great, cheers!

I'd suggest that we parametrizing those tests such that they run for all tasks, i.e., something along the lines of:

@pytest.mark.parametrize(
    "task_name",
    [
        (task_name,) for task_name in sbibm.get_available_tasks()
    ],
)

What do you think?

psteinb · 2021-11-10T15:08:19Z

I think that this makes a lot of sense. I can add that if you want.
I suggest to be a bit more selective though in terms of what is tested, in order not to extend the runtime of the tests. One way to go would perhaps be, to do these automated tests in /repo/tests/task as they mostly test the interface of the tests which is settled in /repo/sbibm/tasks/task.py.
For tests potentially running longer, we could mirror the same structure as in /repo/sbibm/tasks/ and allow for specialized tests (e.g. see the .demo.test cases in this PR) in the/repo/tests/tasks/two_moons/test_special.py`. I think this keeps the balance between a "god test suite" and more specialized tests if they are needed. And I'd hope, it makes contributing easier.

jan-matthis · 2021-11-10T15:19:22Z

I think that this makes a lot of sense. I can add that if you want.

That would be great!

I suggest to be a bit more selective though in terms of what is tested, in order not to extend the runtime of the tests. One way to go would perhaps be, to do these automated tests in /repo/tests/task as they mostly test the interface of the tests which is settled in /repo/sbibm/tasks/task.py.
For tests potentially running longer, we could mirror the same structure as in /repo/sbibm/tasks/ and allow for specialized tests (e.g. see the .demo.test cases in this PR) in the/repo/tests/tasks/two_moons/test_special.py`. I think this keeps the balance between a "god test suite" and more specialized tests if they are needed. And I'd hope, it makes contributing easier.

That's an excellent suggestion. I agree that, depending on the task, we will probably want specialized tests as well -- probably mostly marked as slow for CI execution.

- 3 types of test suites added + test_task_interface.py for mere API tests + test_task_rej_abc_demo.py for testing the API demonstrated on the landing page (README.md) + test_task_benchmark.py to see/document if the benchmarks work - added some "noref" sentinels for tasks which do not have a reference posterior - using sets to better work with list of tasks to run tests for - as of now, the tests exclude julia based tests

psteinb · 2021-11-11T17:07:00Z

@jan-matthis done for now. I am happy to adapt #18 if this can be merged earlier.
Please review whenever you can find the time.

jan-matthis

Great, I think this provides a good foundation for testing existing and future tasks. Thanks a lot for your work on this, much appreciated!

I only left small comments

tests/tasks/test_task_benchmarks.py

tests/tasks/test_task_interface.py

- include noref tasks as we don't use the reference posterior - add TODO for later

psteinb · 2021-11-12T09:59:24Z

Thanks for the review. I hope I implemented those comments alright.

jan-matthis

Except for a single comment this looks good to go from my side, cheers!

tests/tasks/two_moons/test_task.py

jan-matthis · 2021-11-12T16:23:35Z

Cheers!

unit test for two moons, basis for testing other tasks

a9a0fc6

jan-matthis mentioned this pull request Nov 10, 2021

Add a forward-only task to sbibm #19

Closed

psteinb added a commit to psteinb/sbibm that referenced this pull request Nov 11, 2021

removed test of two_moons in favor of sbi-benchmark#29

a676e65

jan-matthis linked an issue Nov 11, 2021 that may be closed by this pull request

Improvements to unit tests #23

Closed

psteinb force-pushed the two-moons-task-test branch from b4f3adb to b7b78f7 Compare November 11, 2021 17:02

psteinb marked this pull request as ready for review November 11, 2021 17:03

placeholder test for task-only code

c224755

jan-matthis reviewed Nov 11, 2021

View reviewed changes

tests/tasks/test_task_benchmarks.py Outdated Show resolved Hide resolved

tests/tasks/test_task_benchmarks.py Outdated Show resolved Hide resolved

tests/tasks/test_task_interface.py Outdated Show resolved Hide resolved

jan-matthis changed the title ~~unit test for two moons, basis for testing other tasks~~ Tests for task's API Nov 12, 2021

psteinb added 2 commits November 12, 2021 10:50

removed superfluous code

26d92f5

- include noref tasks as we don't use the reference posterior - add TODO for later

using pyro set_rng_seed utility to fix seed in tests

9e91818

jan-matthis reviewed Nov 12, 2021

View reviewed changes

tests/tasks/two_moons/test_task.py Outdated Show resolved Hide resolved

using pyro utils to set seed

abc4a85

psteinb force-pushed the two-moons-task-test branch from 9ffc0f6 to abc4a85 Compare November 12, 2021 16:04

psteinb requested a review from jan-matthis November 12, 2021 16:05

jan-matthis merged commit 60c1210 into sbi-benchmark:main Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Tests for task's API #29

Tests for task's API #29

Uh oh!

psteinb commented Nov 9, 2021

Uh oh!

jan-matthis commented Nov 10, 2021 •

edited

Loading

Uh oh!

psteinb commented Nov 10, 2021

Uh oh!

jan-matthis commented Nov 10, 2021

Uh oh!

psteinb commented Nov 11, 2021 •

edited

Loading

Uh oh!

jan-matthis left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psteinb commented Nov 12, 2021

Uh oh!

jan-matthis left a comment

Uh oh!

Uh oh!

jan-matthis commented Nov 12, 2021

Uh oh!

Uh oh!

Tests for task's API #29

Tests for task's API #29

Uh oh!

Conversation

psteinb commented Nov 9, 2021

Uh oh!

jan-matthis commented Nov 10, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

psteinb commented Nov 10, 2021

Uh oh!

jan-matthis commented Nov 10, 2021

Uh oh!

psteinb commented Nov 11, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jan-matthis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

psteinb commented Nov 12, 2021

Uh oh!

jan-matthis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jan-matthis commented Nov 12, 2021

Uh oh!

Uh oh!

jan-matthis commented Nov 10, 2021 •

edited

Loading

psteinb commented Nov 11, 2021 •

edited

Loading