Early Stopping kicks in at min_epochs + 2 instead of min_epochs #606

awaelchli · 2019-12-08T00:30:41Z

I was working on a fix for #524 and found that early stopping starts to kick in at epoch 3 despite min_epochs = 1.

run basic_examples/gpu_template.py and log the callback calls every epoch.

When setting min_epochs=n (counting from 1), we should evaluate early stopping at the end of epoch n.

I propose to change this line in the training loop:
met_min_epochs = epoch > self.min_epochs
to
met_min_epochs = epoch >= self.min_epochs - 1

Why the "-1"? The epoch variable in the training loop starts at 0, but the Trainer argument min_epochs starts counting at 1.
Why the ">="? The early stop check is done at the end of each epoch, hence the epoch counter will be = to min_epochs after min_epochs have passed.

Desktop (please complete the following information):

The text was updated successfully, but these errors were encountered:

williamFalcon · 2019-12-09T12:49:31Z

submit the PR for this?

awaelchli · 2019-12-09T13:20:42Z

yep. was waiting for an approval.

awaelchli added the bug Something isn't working label Dec 8, 2019

awaelchli mentioned this issue Dec 9, 2019

Fix early stopping off by 2 (min_epochs) #617

Merged

4 tasks

williamFalcon closed this as completed in #617 Dec 9, 2019

Provide feedback