DEMetropolis: tune lambda instead of epsilon #3720

michaelosthege · 2019-12-10T22:05:08Z

Our DEMetropolis tunes the scaling factor of the noise distribution:
https://github.com/pymc-devs/pymc3/blob/1c30a6f487afaeef73464a98320e35961b11873f/pymc3/step_methods/metropolis.py#L572-L581

My feeling these days is that tuning the noise distribution is a bit pointless after the first few iterations & could obscure the warmup, or even lead to slingshots if it overshoots.

Instead, we could tune lambda parameter. It's optimal value depends on the (dimensionality of the) target density (ter Braak (2006)), so it should be a good candidate for tuning.
This approach is described in Nelson et al. (2013), section 4.1.2.

The text was updated successfully, but these errors were encountered:

closes pymc-devs#3720

michaelosthege · 2019-12-17T15:42:24Z

On my test problem (50-dim MvNormal) the tuning converges to the same rule of thumb that was used before (2.38 / sqrt(2*ndim)).
Here the progression of the tuned parameter (3000 tuning its)

I also see no significant difference in the effective sample size...

I'm thinking to ditch tuning of scaling/lambda alltogether. What do you think?
cc @junpenglao

junpenglao · 2019-12-17T16:03:57Z

Could you try on a ODE example?

michaelosthege · 2019-12-17T16:23:42Z

Could you try on a ODE example?

I tried with Demetris benchmark example, but it was very slow & inefficient (--> noisy) while having just 2 dimensions.

junpenglao · 2019-12-17T16:25:12Z

Do you mean slower than no tuning?

michaelosthege · 2019-12-17T16:49:11Z

Do you mean slower than no tuning?

No the sampling was just slow/inefficient because it's an ODE. Also DifferentialEquation computes sensitivities that DEMetropolis doesn't use.
So I'd have to wait very long for the benchmark results to give a significant answer.

I could also implement a kwarg like DEMetropolis(tune_par=x) with x in {None, 'epsilon', 'lambda'} where None is the default because tuning epsilon is a bit pointless & tuning lambda is not necessarily better than lambda = 2.38 / sqrt(2*ndim).

Maybe the result of my testing is simply that DEMetropolis doesn't need hyperparameter tuning. (Needs warmup/burnin though.)

junpenglao · 2019-12-17T21:04:10Z

I see. Thanks! Feel free to close.

+ tune argument now one of None,scaling,lambda + support for tuning lambda (closes pymc-devs#3720) + added test to check checking of tune setting + both scaling and lambda are recorded in the sampler stats

michaelosthege added enhancements metropolis labels Dec 10, 2019

michaelosthege added a commit to michaelosthege/pymc that referenced this issue Dec 17, 2019

tune lambda instead of epsilon

8f48252

closes pymc-devs#3720

michaelosthege mentioned this issue Dec 19, 2019

Don't tune DEMetropolis by default #3743

Merged

michaelosthege closed this as completed in #3743 Dec 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DEMetropolis: tune lambda instead of epsilon #3720

DEMetropolis: tune lambda instead of epsilon #3720

michaelosthege commented Dec 10, 2019

michaelosthege commented Dec 17, 2019 •

edited

Loading

junpenglao commented Dec 17, 2019

michaelosthege commented Dec 17, 2019

junpenglao commented Dec 17, 2019

michaelosthege commented Dec 17, 2019

junpenglao commented Dec 17, 2019

DEMetropolis: tune lambda instead of epsilon #3720

DEMetropolis: tune lambda instead of epsilon #3720

Comments

michaelosthege commented Dec 10, 2019

michaelosthege commented Dec 17, 2019 • edited Loading

junpenglao commented Dec 17, 2019

michaelosthege commented Dec 17, 2019

junpenglao commented Dec 17, 2019

michaelosthege commented Dec 17, 2019

junpenglao commented Dec 17, 2019

michaelosthege commented Dec 17, 2019 •

edited

Loading