Adjusting optimizer learning rates on-the-fly #233

staticfloat · 2018-04-13T16:14:51Z

It would be great to allow the Flux optimizers to have parameters that can be tweaked from the outside. As an example, I would like to have an SGD where I can tune the learning rate over time. I could do this by destroying the optimizer and recreating it every epoch, (which is fine for SGD) but that hurts performance with optimizers that keep running estimates of gradient magnitude such as ADAM or RMSProp.

tejank10 · 2018-04-13T16:54:06Z

I am thinking of treating the optimizers as structs rather than functions. For example, ADAM will be a struct having learning rate, momentum, velocity etc. as its fields. And there can be an update function which takes this struct and performs the update, thereby updating the momentum and velocity. In this way, we'll be able to mutate the learning rate.

iblislin · 2018-04-17T06:25:38Z

Currently, I tweak learning rate via opening the training loop and changing the Δ of back!(loss, Δ).

MikeInnes · 2019-03-26T17:07:42Z

Fixed by #379.

MikeInnes mentioned this issue Apr 14, 2018

Optimisers make me sad #234

Closed

MikeInnes closed this as completed Mar 26, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adjusting optimizer learning rates on-the-fly #233

Adjusting optimizer learning rates on-the-fly #233

staticfloat commented Apr 13, 2018

tejank10 commented Apr 13, 2018 •

edited

Loading

iblislin commented Apr 17, 2018 •

edited

Loading

MikeInnes commented Mar 26, 2019

Adjusting optimizer learning rates on-the-fly #233

Adjusting optimizer learning rates on-the-fly #233

Comments

staticfloat commented Apr 13, 2018

tejank10 commented Apr 13, 2018 • edited Loading

iblislin commented Apr 17, 2018 • edited Loading

MikeInnes commented Mar 26, 2019

tejank10 commented Apr 13, 2018 •

edited

Loading

iblislin commented Apr 17, 2018 •

edited

Loading