Checkpointing #273

michaeldeistler · 2020-07-23T08:55:45Z

Checkpointing and new API for multi-round

API

# Run additional rounds with the last posterior as new proposal.
proposal = None
posteriors = []
for round_ in range(num_rounds):
    posterior = infer(num_simulations=200, proposal=proposal)
    proposal = posterior.set_default_x(x_o)
    posteriors.append(posterior)

Main changes

no more x_o argument
no more _x_o_trained_on attribute
proposal needs to be passed as a function only in theta

michaeldeistler · 2020-07-23T08:57:41Z

Hi @jan-matthis @janfb @Meteore

for my STG project, I need the mechanism described above, so I would like to implement it in sbi soon. Please let me know if you have a preference for suggestion 1 or suggestion 2.

To seed this discussion:
I prefer suggestion 1 because:
a) it does not require an additional argument to __call__()
b) having a .continue() function makes it very explicit to the user what is happing.

alvorithm · 2020-07-23T09:04:22Z

Hi @jan-matthis @janfb @Meteore

for my STG project, I need the mechanism described above, so I would like to implement it in sbi soon. Please let me know if you have a preference for suggestion 1 or suggestion 2.

To seed this discussion:
I prefer suggestion 1 because:
a) it does not require an additional argument to __call__()
b) having a .continue() function makes it very explicit to the user what is happing.

I would like us to list somewhere the state that we are carrying around and cannot be managed as return values that get passed again into one round of inference. I mentioned something similar on the PR about external data and I think it is a discussion worth to have, possibly in a dedicated meeting.

PS. not sure I understand what start_new_round is doing and how it interacts with num_rounds.

michaeldeistler · 2020-07-23T09:17:41Z

Here's a list of the state:

_theta_roundwise
_x_roundwise
_prior_masks
_data_round_index
_posterior
_model_bank

michaeldeistler · 2020-07-23T09:20:22Z

In the example above, start_new_round=False and hence the next simulations will still come from the same distribution as in the round before, in this case from the prior. If it were True, we would start a new round, i.e. a second round and hence simulate from the posterior.

num_rounds simply indicates how many rounds are being run in the current call to __call__() or .continue(), respectively.

michaeldeistler · 2020-07-24T07:07:44Z

@jan-matthis @janfb @Meteore After quite some discussions yesterday, we thought that it might be a good idea to change the API of multi-round. In the description of this PR, I outline one way to do it. Have a look and let me know what you think.

The code for this is also "ready" (despite only for snpe and not documented yet), so, if you want, also have a look at that.

jan-matthis

Had a first look at the PR, looking good! I guess SNLE, SNRE, tests and changes to infer() are still forthcoming

sbi/inference/base.py

sbi/inference/snpe/snpe_base.py

sbi/inference/snpe/snpe_c.py

sbi/inference/snpe/snpe_base.py

jan-matthis · 2020-07-24T08:31:10Z

sbi/inference/posterior.py

@@ -93,6 +94,16 @@ def __init__(
        # Correction factor for leakage, only applicable to SNPE-family methods.
        self._leakage_density_correction_factor = None

+    def focus_training_on(self, x) -> "NeuralPosterior":


I'm seeing no invocation of this method in the code, shouldn't it be called at some point?

The user has to call it after inference. See the API example in the PR description.

Ah, I see. I missed the edited version. How about sticking to method names that unspecific to training/inference?

For example:

# Single round inference with prior as proposal posterior = infer(num_simulations=200, proposal=None) # proposal=None is also default. posteriors = [posterior] # Run additional rounds with the last posterior as new proposal. for round_ in range(1, num_rounds): posteriors.append(infer(num_simulations=200, proposal=posteriors[round_-1].set_default_x(x_o)))

Or shorter:

posteriors = [] proposal = None for _ in range(num_rounds): posterior = infer(num_simulations=200, proposal=proposal) proposal = posterior.set_default_x(x_o) posteriors.append(posterior)

More generally, I wonder if infer() should keep a num_rounds keyword and build such a loop internally

michaeldeistler added enhancement New feature or request API changes This impacts the public API of the project (e.g. inference class). labels Jul 23, 2020

michaeldeistler requested review from alvorithm, jan-matthis and janfb July 23, 2020 08:55

michaeldeistler self-assigned this Jul 23, 2020

jan-matthis requested changes Jul 24, 2020

View reviewed changes

jan-matthis reviewed Jul 24, 2020

View reviewed changes

michaeldeistler force-pushed the checkpointing branch from 7e97cc0 to 2ec0929 Compare July 24, 2020 11:37

michaeldeistler linked an issue Jul 24, 2020 that may be closed by this pull request

Warn if invalid simulations + multiround SNPE-C #271

Closed

michaeldeistler force-pushed the checkpointing branch 3 times, most recently from c55e075 to 7a5a625 Compare July 24, 2020 15:27

michaeldeistler added 12 commits July 27, 2020 11:15

loop over rounds now part of API

961a338

check proposal distribution

2253166

mle at first call, regardless of proposal

843e3b0

documentation

2a0123d

new multi-round interface adapted for snle

f0b22f9

new multi-round interface for snre

82e3352

remove num_rounds from call made by simple interface

c03dcf9

updated tests for new api

17e3cde

warn if multi-round snpe-c and invalid simulations

bc7fe1f

if proposal is used in first round, we use atomic loss

4ba936a

getting rid of x_o_trained_on

2709f92

Adapting examples/tutorials

359461a

added changes to CHANGELOG.md for v0.11.0

241cf69

michaeldeistler force-pushed the checkpointing branch 2 times, most recently from 1762c55 to 042f584 Compare July 28, 2020 20:43

ran isort and black

042f584

michaeldeistler merged commit 4fbc2ba into main Jul 28, 2020

michaeldeistler deleted the checkpointing branch July 28, 2020 21:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Checkpointing #273

Checkpointing #273

michaeldeistler commented Jul 23, 2020 •

edited

Loading

michaeldeistler commented Jul 23, 2020

alvorithm commented Jul 23, 2020

michaeldeistler commented Jul 23, 2020

michaeldeistler commented Jul 23, 2020 •

edited

Loading

michaeldeistler commented Jul 24, 2020 •

edited

Loading

jan-matthis left a comment •

edited

Loading

jan-matthis Jul 24, 2020

michaeldeistler Jul 24, 2020

jan-matthis Jul 24, 2020

jan-matthis Jul 24, 2020

jan-matthis Jul 24, 2020

jan-matthis Jul 24, 2020

Checkpointing #273

Checkpointing #273

Conversation

michaeldeistler commented Jul 23, 2020 • edited Loading

Checkpointing and new API for multi-round

API

Main changes

michaeldeistler commented Jul 23, 2020

alvorithm commented Jul 23, 2020

michaeldeistler commented Jul 23, 2020

michaeldeistler commented Jul 23, 2020 • edited Loading

michaeldeistler commented Jul 24, 2020 • edited Loading

jan-matthis left a comment • edited Loading

Choose a reason for hiding this comment

jan-matthis Jul 24, 2020

Choose a reason for hiding this comment

michaeldeistler Jul 24, 2020

Choose a reason for hiding this comment

jan-matthis Jul 24, 2020

Choose a reason for hiding this comment

jan-matthis Jul 24, 2020

Choose a reason for hiding this comment

jan-matthis Jul 24, 2020

Choose a reason for hiding this comment

jan-matthis Jul 24, 2020

Choose a reason for hiding this comment

michaeldeistler commented Jul 23, 2020 •

edited

Loading

michaeldeistler commented Jul 23, 2020 •

edited

Loading

michaeldeistler commented Jul 24, 2020 •

edited

Loading

jan-matthis left a comment •

edited

Loading