Warmup updates bug for LR < 1 #4384

emilydinan · 2022-02-28T20:32:12Z

Patch description
#4242 introduced a bug in which _is_warming_up returns True if the last LR is < 1. This will introduce a bug for all initial LRs != 1. In particular, for models with initial learning rate < 1, the model will be "warming up" forever, and the LR will remain constant after the warmup updates have finished rather than starting to decay according to the provided schedule.

Testing steps

parlai tm -t convai2 -m transformer/generator --lr-scheduler linear --warmup-updates 10 -lstep 1 -vstep 10000000 --max-lr-steps 100 --skip-generation True --warmup-rate 0.01 -lr 0.00001 --dict-file /tmp/test123.dict -mf /tmp/test1234dsfsdf5

I had to relax restrictions to get tests to pass. If we change self._number_training_updates < self.warmup_updates -->, self._number_training_updates <= self.warmup_updates, we hit the exact max LR, but don't quite anneal to zero. Will leave it to follow up PR (Jude) to test this more robustly

stephenroller

Ty

emilydinan · 2022-02-28T22:42:54Z

Tests pass locally

stephenroller · 2022-03-01T16:17:18Z

@klshuster and @jxmsML confirm this does NOT affect reduceonplataeu

stephenroller · 2022-03-01T16:18:36Z

@spencerp found this did NOT affect invsqrt

revert bug

de19c56

facebook-github-bot added the CLA Signed label Feb 28, 2022

emilydinan requested a review from stephenroller February 28, 2022 20:32

relax restrictions

db46fa0

stephenroller approved these changes Feb 28, 2022

View reviewed changes

even more relaxed :/

2f3aa1b

emilydinan merged commit 9d77adb into main Feb 28, 2022

emilydinan deleted the lrschedbug branch February 28, 2022 22:43

juderoque mentioned this pull request Mar 3, 2022

Lrsched missing step #4392

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Warmup updates bug for LR < 1 #4384

Warmup updates bug for LR < 1 #4384

emilydinan commented Feb 28, 2022 •

edited

Loading

stephenroller left a comment

emilydinan commented Feb 28, 2022

stephenroller commented Mar 1, 2022

stephenroller commented Mar 1, 2022

Warmup updates bug for LR < 1 #4384

Warmup updates bug for LR < 1 #4384

Conversation

emilydinan commented Feb 28, 2022 • edited Loading

stephenroller left a comment

Choose a reason for hiding this comment

emilydinan commented Feb 28, 2022

stephenroller commented Mar 1, 2022

stephenroller commented Mar 1, 2022

emilydinan commented Feb 28, 2022 •

edited

Loading