Add a tensorboard log for final training loss value. #455

rnyak · 2019-12-16T14:46:28Z

Describe the bug
When I check the loss plot on TensorBoard, I see that validation steps are much higher than training steps. See the screenshot below. why training ends 1062 steps before the validation steps? What's the logic behind?

Minimum Reproducible Example
A short code snippet which reproduces the exception

Expected behavior
A clear and concise description of what you expected to happen.

Additional context
Add any other context about the problem here.

rnyak · 2019-12-18T22:49:03Z

@benleetownsend any explanation on that? Thanks in advance.

benleetownsend · 2019-12-24T11:22:51Z

So, we explicitly run validation on the final model no matter the val interval. This is for the purpose of keep_best_model, otherwise we can accidentally waste the final set of steps. We just follow the default logging schedules for the training loss. If you wanted to track loss values near the end of training you could change your val_interval such that a final value will come near the end of training.

I'm going to rename this issue to track the feature of adding a final step loss log for training

Hopefully this helps/answers your question.

benleetownsend added the enhancement New feature or request label Dec 24, 2019

benleetownsend changed the title ~~loss plot of train and val sets~~ Add a tensorboard log for final training loss value. Dec 24, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a tensorboard log for final training loss value. #455

Add a tensorboard log for final training loss value. #455

rnyak commented Dec 16, 2019 •

edited

Loading

rnyak commented Dec 18, 2019

benleetownsend commented Dec 24, 2019

Add a tensorboard log for final training loss value. #455

Add a tensorboard log for final training loss value. #455

Comments

rnyak commented Dec 16, 2019 • edited Loading

rnyak commented Dec 18, 2019

benleetownsend commented Dec 24, 2019

rnyak commented Dec 16, 2019 •

edited

Loading