Feature/issue 2670 warmup stepsize factor #2675

Tuxonomics · 2018-10-19T20:07:11Z

Submission Checklist

Run unit tests: ./runTests.py src/test/unit
Run cpplint: make cpplint
Declare copyright holder and open-source license: see below

Summary

As in issue 2670.
The stepsize factor is exposed in the services via function overload of the existing algorithms.

Intended Effect

How to Verify

Side Effects

Documentation

Copyright and Licensing

Please list the copyright holder for the work you are submitting (this will be you or your assignee, such as a university or company): Jonas Kose

By submitting this pull request, the copyright holder is agreeing to license the submitted work under the following licenses:

Code: BSD 3-clause (https://opensource.org/licenses/BSD-3-Clause)
Documentation: CC-BY 4.0 (https://creativecommons.org/licenses/by/4.0/)

Tuxonomics · 2018-10-22T18:57:57Z

@bob-carpenter I added a unit test for the getter and setter of the new member in the stepsize_adaptation class. Regarding the services, the current unit tests are testing also the changes to the code.

bob-carpenter · 2018-10-22T20:33:47Z

I'm not sure why you're pinging me. @betanalpha would be better to review this one.

bob-carpenter · 2018-10-22T20:34:20Z

Also, we can't take anythnig without copyright assignment. You need to declare that you (or your assignee, such as a university), own the code and license it under the BSD.

Tuxonomics · 2018-10-23T07:19:40Z

Sorry, I didn't know who to ping. I pinged you because you accepted the issue, no more thought went into it.

@betanalpha Could you have a look at this? (PR regarding https://discourse.mc-stan.org/t/issue-with-dual-averaging/5995/27)

bob-carpenter · 2018-10-24T17:21:18Z

I asked @betanalpha who's busy most of this week but can get to it next week. If that doesn't work, I can review it next week.

Also, just a heads up that this PR shouldn't change any adaptation strategies. If we change existing behavior, we'll want to test them thoroughly for performance and include them in a separate PR.

Tuxonomics · 2018-10-24T18:58:03Z

@bob-carpenter This PR keeps all the defaults. It only allows to set the coefficient in the assignment mu = log( mu_c * epsilon ). By default the coefficient is still 10.0.

I don't mind waiting for @betanalpha, but I would postpone working on an RStan PR until then.

bob-carpenter · 2018-10-24T23:05:12Z

You should be able to build on top of the branch. We want to expose this, so we'll make sure we can get it working.

Tuxonomics · 2018-10-25T07:11:29Z

Yes, I know. But I would like to avoid unnecessary refactoring. Once this PR here is cleared, the RStan changes should be easy too.

betanalpha · 2018-10-26T21:10:22Z

A few comments.

I think that the name mu_c should be changed. mu_c isn't on the same scale as mu, as it's the logarithm of mu_c that actually translates mu. Indeed it seems like expanding the log would be more interpretable, in which case we'd write

this->stepsize_adaptation_.set_mu(  log(this->nom_epsilon_) 
                                                           + this->stepsize_adaptation_.get_delta_mu());

with delta_mu defaulting to log(10).

We could keep the current logic but then any meaningful naming would be a bit unwieldy. Maybe log_mu_eps_scale?

The other big thing is that we shouldn't overload the service routes with a version that sets this new parameter and one that doesn't. There are already too many service routes to maintain and no where else do we allow parameters to be optionally set. We'll need to keep the expanded routes and then update all of the interfaces at the same time. @seantalts, do we have a well-established procedure for this yet, or would this be optimal after the move to the monorepo?

bob-carpenter · 2018-10-26T21:40:33Z

This is why we need to move to a builder-like pattern here with defaults for config so changes like this can be made without disrupting the interfaces. I'll have a design for that soon.

@seantalts, do we have a well-established procedure for this yet, or would this be optimal after the move to the monorepo?

The preferred way forward is to implement both, deprecate the old one, and then when the interfaces catch up, remove the deprecated one. It's a bit of a pain, but it's easier than manually finagling the continuous integration testing---we just don't have multiple platforms, so it's disruptive.

I'm afraid the monorepo won't help as that's only going to be math/stan/cmdstan, not rstan and pystan. The good news is that it looks like that's about to happen.

Tuxonomics · 2018-10-27T09:01:05Z

We could keep the current logic but then any meaningful naming would be a bit unwieldy. Maybe log_mu_eps_scale?

I will change the name.

The other big thing is that we shouldn't overload the service routes with a version that sets this new parameter and one that doesn't.

I agree that an overload is not a good solution. I used it because it was already in place.

The preferred way forward is to implement both, deprecate the old one, and then when the interfaces catch up, remove the deprecated one.

What does that mean for this PR now, also regarding the deprecation? Do I just change the variable name and then come back in a few months to clean up the services?

bob-carpenter · 2018-10-27T23:42:19Z

I think overloading is OK here to maintain backward compatibility. And yes, we'll clean up the ones that aren't used in Stan 3 at the latest, but we can also do it before then if PyStan and RStan and CmdStan all catch up to the new service code.

betanalpha · 2018-10-29T15:16:24Z

What we typically do in this circumstance is create corresponding forks of the three main interfaces (PyStan, RStan, and CmdStan) whose submodules point to the forked Stan repository. Once those forked interfaces are ready then all four can be merged to the corresponding develop branches in quick succession. It’s a pain, but something we occasionally have to do so we have some precedent procedure for the dev ops once the interface forks have been reviewed. It also avoids adding a bunch of unused routes that clutter up the Stan API and make adding similar features in the future even more of a pain.

…

On Oct 27, 2018, at 7:42 PM, Bob Carpenter ***@***.***> wrote: I think overloading is OK here to maintain backward compatibility. And yes, we'll clean up the ones that aren't used in Stan 3 at the latest, but we can also do it before then if PyStan and RStan and CmdStan all catch up to the new service code. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#2675 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ABdNlr7CCf3MVfLA4kRMMcyd7k3rYymlks5upO9cgaJpZM4XxWFs>.

bob-carpenter · 2018-10-29T16:37:04Z

Modifying Stan will cause the upstream tests to fail unless any new arguments get defaults (which forces them to be the last arguments). Typically, the dev ops needs to be finagled to connect the appropriate upstream version to the downstream tests. The bigger question is whether we should allow changes at the Stan level that are *not* reflected with matching PRs in PyStan, RStan, and CmdStan. Requiring four PRs across four repos in three languages is a bit of a hurdle for most devs (including me).

Tuxonomics · 2018-10-30T16:34:09Z

I renamed the variable. The name speaks for itself though I think it is a bit too verbose. It makes the code less readable.

Now I would just like to know how to proceed.

In general, the current function overloads will not break the interfaces. The users will just not be able to set this new parameter. Overall I'd also prefer a more structured approach, but should that now mean that I wait for the new builder, or can we proceed with this PR here?

rok-cesnovar · 2021-03-15T11:35:00Z

Closing stale stan-dev/stan pull requests. This one has been stale for 2+ years and has a lot of unresolved conflicts. Feel free to reopen if you wish to continue on this.

Jonas Kose added 6 commits October 19, 2018 17:13

stepsize coefficient in stepsize_adaptation

f5cbe88

call stepsize factor in hmc algorithms

b67e53e

added stepsize factor to services

ab64da7

small refactor

a956429

stepsize adaptation test mu_c

360d8c1

typo corrected

9ea4ef4

Tuxonomics changed the title ~~[WIP] Feature/issue 2670 warmup stepsize factor~~ Feature/issue 2670 warmup stepsize factor Oct 22, 2018

reorder stepsize_adaptation members

eed472a

renamed variable

331a49d

serban-nicusor-toptal added this to the 2.21.0 milestone Oct 18, 2019

serban-nicusor-toptal modified the milestones: 2.22.0++, 2.23.0++ Apr 22, 2020

serban-nicusor-toptal modified the milestones: 2.23.0++, 2.24.0++ Jul 28, 2020

serban-nicusor-toptal removed this from the 2.24.0++ milestone Nov 3, 2020

serban-nicusor-toptal added this to the 2.25.0++ milestone Nov 3, 2020

rok-cesnovar closed this Mar 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature/issue 2670 warmup stepsize factor #2675

Feature/issue 2670 warmup stepsize factor #2675

Tuxonomics commented Oct 19, 2018 •

edited

Loading

Tuxonomics commented Oct 22, 2018 •

edited

Loading

bob-carpenter commented Oct 22, 2018

bob-carpenter commented Oct 22, 2018

Tuxonomics commented Oct 23, 2018

bob-carpenter commented Oct 24, 2018

Tuxonomics commented Oct 24, 2018

bob-carpenter commented Oct 24, 2018 via email

Tuxonomics commented Oct 25, 2018

betanalpha commented Oct 26, 2018

bob-carpenter commented Oct 26, 2018 •

edited

Loading

Tuxonomics commented Oct 27, 2018

bob-carpenter commented Oct 27, 2018

betanalpha commented Oct 29, 2018 via email

bob-carpenter commented Oct 29, 2018 via email

Tuxonomics commented Oct 30, 2018

rok-cesnovar commented Mar 15, 2021

Feature/issue 2670 warmup stepsize factor #2675

Feature/issue 2670 warmup stepsize factor #2675

Conversation

Tuxonomics commented Oct 19, 2018 • edited Loading

Submission Checklist

Summary

Intended Effect

How to Verify

Side Effects

Documentation

Copyright and Licensing

Tuxonomics commented Oct 22, 2018 • edited Loading

bob-carpenter commented Oct 22, 2018

bob-carpenter commented Oct 22, 2018

Tuxonomics commented Oct 23, 2018

bob-carpenter commented Oct 24, 2018

Tuxonomics commented Oct 24, 2018

bob-carpenter commented Oct 24, 2018 via email

Tuxonomics commented Oct 25, 2018

betanalpha commented Oct 26, 2018

bob-carpenter commented Oct 26, 2018 • edited Loading

Tuxonomics commented Oct 27, 2018

bob-carpenter commented Oct 27, 2018

betanalpha commented Oct 29, 2018 via email

bob-carpenter commented Oct 29, 2018 via email

Tuxonomics commented Oct 30, 2018

rok-cesnovar commented Mar 15, 2021

Tuxonomics commented Oct 19, 2018 •

edited

Loading

Tuxonomics commented Oct 22, 2018 •

edited

Loading

bob-carpenter commented Oct 26, 2018 •

edited

Loading