[WIP] Adding support for adaptation of a dense mass matrix based on sample covariances #3596

dfm · 2019-08-19T20:58:31Z

This pull request suggests an interface for full mass matrix adaptation based on an online estimate of the sample covariance. This is similar to the existing QuadPotentialDiagAdapt implementation except it updates the full covariance matrix.

I have blogged an example where the performance of the default adaptation routine is much poorer than it should be. While this example might seem a bit contrived, I've found that that dense mass matrix adaptation is absolutely crucial for many of the applications that I've been considering in astronomy (for example: the exoplanet library).

So based on this experience, I took a quick stab at implementing an interface like this directly into pymc3 and I'd love to get your feedback!

I also posted a short demo based on the blog post implemented using this pull request: https://gist.github.com/dfm/da1d0470d6fb54c63e6a913c1ef67a9e

pymc3/step_methods/hmc/quadpotential.py

junpenglao · 2019-08-20T04:56:58Z

Thanks for the pull request! This is a great contribution!!!

pymc3/step_methods/hmc/quadpotential.py

aseyboldt

This looks nice.
I'd love to have a tests that also samples using the new adaptation.
Eventually, we should expose this as adapt='jitter+adapt_dense' in pm.sample, but for now I'd rather hide it a bit, so that we can do some testing. I'd also add a warning that this is experimental for now.
There are a couple of improvements I could think of:

Is it worth it to update the covariance in every step during tuning? It will have to do a cholesky decomposition each time.
We never need the full matrix, but only matrix products and products with a decomposition. So we could just store the (k, n) matrix V of samples (dim n, num samples k) and then do a svd of that. We could compute the matrix product as (I + sigma VV.T), and use the svd to draw samples.
What are good window sizes? This needs testing.

But I wouldn't let either of those stop us from merging this.

pymc3/step_methods/hmc/quadpotential.py

aseyboldt · 2019-08-27T18:33:06Z

@dfm I wrote a very experimental version of mass matrix adaptation that uses low-rank approximations for the mass matrix, so it does not need to store a whole n by n matrix. I'd be interested to hear if this helps for your models.

https://github.com/aseyboldt/covadapt

dfm · 2019-09-02T12:17:11Z

@aseyboldt: Thanks for the feedback! Your covadapt project looks awesome and I'll definitely take it for a test drive.

dfm · 2019-09-02T12:21:25Z

Is it worth it to update the covariance in every step during tuning? It will have to do a cholesky decomposition each time.

In my experience, it tends to be fine to just update the matrix at the end of each adaptation window (that's what Stan does too) but I couldn't figure out how to incorporate that with the current tuning implementation in PyMC3 because the step doesn't seem to know anything about how long tuning is and when it is going to finish. This seemed like a problem because then the last adaptation window might finish after tuning and the covariance estimate might never be updated based on the longest run. Does that make sense and is there something I'm missing?

We never need the full matrix, but only matrix products and products with a decomposition. So we could just store the (k, n) matrix V of samples (dim n, num samples k) and then do a svd of that. We could compute the matrix product as (I + sigma VV.T), and use the svd to draw samples.

This is a good suggestion. Would you rather try to merge or point people to covadapt instead of something like this pull request?

What are good window sizes? This needs testing.

Agreed!

codecov · 2019-12-03T18:23:38Z

Codecov Report

Merging #3596 into master will increase coverage by 0.04%.
The diff coverage is 94.82%.

@@            Coverage Diff             @@
##           master    #3596      +/-   ##
==========================================
+ Coverage    89.9%   89.94%   +0.04%     
==========================================
  Files         134      134              
  Lines       20269    20430     +161     
==========================================
+ Hits        18222    18376     +154     
- Misses       2047     2054       +7

Impacted Files	Coverage Δ
pymc3/step_methods/hmc/quadpotential.py	`73.75% <92.39%> (+5.41%)`	⬆️
pymc3/tests/test_quadpotential.py	`95.45% <97.56%> (+3.53%)`	⬆️

dfm · 2019-12-03T18:57:47Z

@aseyboldt:

I added an option that allows the factorization to be recomputed on a different interval (e.g. only every 10 tuning steps) which isn't perfect, but I don't see an obvious better method in the current sampling implementation.
I added a few more tests to make sure that things work as expected and that pm.sample executes.
There is now a UserWarning with an "experimental" warning when a QuadPotentialFullAdapt is instantiated.
I've found that the default window sizes work well for all the problems I've worked on (we've been using it in exoplanet for a few versions now) but I don't have any sense of what kinds of tests you'd want in general.

Let me know if there's something you'd like to see added/removed or if you'd rather skip this and encourage people to use covadapt instead. Thanks!

eigenfoo

A minor nitpick and a comment - thanks for the PR @dfm!

pymc3/step_methods/hmc/quadpotential.py

Co-Authored-By: George Ho <19851673+eigenfoo@users.noreply.github.com>

dfm added 5 commits November 29, 2018 08:01

switching travis badges to svg

063eea6

Merge branch 'master' of https://github.com/pymc-devs/pymc3

22c8695

Merge branch 'master' of https://github.com/pymc-devs/pymc3

2099c9b

Adding adaptation for dense mass matrix

01fd6d9

Moving shared code to QuadPotentialFull base

2af9820

junpenglao requested a review from aseyboldt August 20, 2019 04:49

junpenglao reviewed Aug 20, 2019

View reviewed changes

pymc3/step_methods/hmc/quadpotential.py Outdated Show resolved Hide resolved

adding references to Welford's algorithm

ad20b57

dfm commented Aug 20, 2019

View reviewed changes

pymc3/step_methods/hmc/quadpotential.py Outdated Show resolved Hide resolved

aseyboldt reviewed Aug 21, 2019

View reviewed changes

pymc3/step_methods/hmc/quadpotential.py Outdated Show resolved Hide resolved

dfm added 3 commits October 29, 2019 10:32

Merge branch 'master' of https://github.com/pymc-devs/pymc3

e4d7963

Merge branch 'master' of https://github.com/pymc-devs/pymc3

06ac6b9

Merge branch 'master' into dense-quadpotential

1ac67e7

dfm added 2 commits December 3, 2019 10:33

adding a few more tests for full_adapt

db39838

add test for sampling using QuadPotentialFullAdapt

225a394

eigenfoo approved these changes Dec 4, 2019

View reviewed changes

pymc3/step_methods/hmc/quadpotential.py Outdated Show resolved Hide resolved

pymc3/step_methods/hmc/quadpotential.py Show resolved Hide resolved

eigenfoo reviewed Dec 4, 2019

View reviewed changes

pymc3/step_methods/hmc/quadpotential.py Outdated Show resolved Hide resolved

Update pymc3/step_methods/hmc/quadpotential.py

d97c330

Co-Authored-By: George Ho <19851673+eigenfoo@users.noreply.github.com>

eigenfoo merged commit c412736 into pymc-devs:master Dec 4, 2019

eigenfoo mentioned this pull request Dec 4, 2019

Add multiplier for adaptation window sizes #3705

Merged

eigenfoo mentioned this pull request Mar 20, 2020

Add full adaptation as option to pm.sample #3845

Closed

jdehning mentioned this pull request Apr 17, 2020

Add sampling with dense mass matrix Priesemann-Group/covid19_inference#9

Closed

bmorris3 mentioned this pull request Sep 2, 2020

Use dense mass matrix when sampling bmorris3/dot#32

Open

dfm mentioned this pull request Nov 2, 2020

PR to PyMC3? exoplanet-dev/pymc-ext#4

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Adding support for adaptation of a dense mass matrix based on sample covariances #3596

[WIP] Adding support for adaptation of a dense mass matrix based on sample covariances #3596

dfm commented Aug 19, 2019

junpenglao commented Aug 20, 2019 •

edited

Loading

aseyboldt left a comment

aseyboldt commented Aug 27, 2019

dfm commented Sep 2, 2019

dfm commented Sep 2, 2019

codecov bot commented Dec 3, 2019 •

edited

Loading

dfm commented Dec 3, 2019

eigenfoo left a comment

[WIP] Adding support for adaptation of a dense mass matrix based on sample covariances #3596

[WIP] Adding support for adaptation of a dense mass matrix based on sample covariances #3596

Conversation

dfm commented Aug 19, 2019

junpenglao commented Aug 20, 2019 • edited Loading

aseyboldt left a comment

Choose a reason for hiding this comment

aseyboldt commented Aug 27, 2019

dfm commented Sep 2, 2019

dfm commented Sep 2, 2019

codecov bot commented Dec 3, 2019 • edited Loading

Codecov Report

dfm commented Dec 3, 2019

eigenfoo left a comment

Choose a reason for hiding this comment

junpenglao commented Aug 20, 2019 •

edited

Loading

codecov bot commented Dec 3, 2019 •

edited

Loading