How to use momentum and weightdecay #106

baggepinnen · 2017-11-13T11:59:38Z

I have been trying to figure out how to use the weightdecay function and the Momentum optimizer.
It seems weightdecay should be passed to the optimiser function, e.g., when an optimizer is created. None of the optimizers do however have an option to pass in a weight decay parameter. Am I supposed to create a separate optimizer for weight decay and momentum that is called before the optimizer that calls descent?

The text was updated successfully, but these errors were encountered:

MikeInnes · 2017-12-13T16:29:27Z

The optimiser APIs are pretty clumsy right now; they are supposed to be composable but it doesn't quite work. I'll take a look at redesigning this soon.

baggepinnen · 2017-12-13T17:01:31Z

I did manage to compose an optimizer with weight decay now when a Vector of optimizers is accepted by train!, but the code did not look pretty...

opt  = [ADAM(params(m), 0.01, decay=0.001); [weightdecay(Param(p), 0.002) for p in params(m) if isa(p, AbstractMatrix)]]

MikeInnes · 2019-03-26T16:28:29Z

Hopefully this is easier since #379.

MikeInnes mentioned this issue Apr 14, 2018

Optimisers make me sad #234

Closed

MikeInnes closed this as completed Mar 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use momentum and weightdecay #106

How to use momentum and weightdecay #106

baggepinnen commented Nov 13, 2017

MikeInnes commented Dec 13, 2017

baggepinnen commented Dec 13, 2017

MikeInnes commented Mar 26, 2019

How to use momentum and weightdecay #106

How to use momentum and weightdecay #106

Comments

baggepinnen commented Nov 13, 2017

MikeInnes commented Dec 13, 2017

baggepinnen commented Dec 13, 2017

MikeInnes commented Mar 26, 2019