BatchNorm guide #2536

cgarciae · 2022-10-15T00:09:20Z

What does this PR do?

Add a Using BatchNorm guide with the following content:

How to define a model that uses BatchNorm
How to handle the batch_stats collection
How to configure apply
How to modify TrainState, train_step, and eval_step to handle a model with BatchNorm.

Our codediff sphinx directive is use throughout to highlight the changes to add BatchNorm support for existing code.

Live preview: https://flax--2536.org.readthedocs.build/en/2536/guides/batch_norm.html

review-notebook-app · 2022-10-15T00:09:25Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

zaxtax · 2022-10-19T16:53:48Z

I think this is good, but it could help if the code could be fully copy-pasted. So maybe including all relevant import statements.

I wouldn't use the word "just" since even though it's a small change the introduction of the train flag is confusing and surprising. As you say, BatchNorm is often the first model most users encounter that behaves much more differently at train vs test time. I think that's worth dwelling on.

Minor:
"BatchNorm add an" -> "BatchNorm adds an"

codecov-commenter · 2022-11-03T22:09:12Z

Codecov Report

Merging #2536 (de58b2a) into main (478cffe) will increase coverage by 1.33%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##             main    #2536      +/-   ##
==========================================
+ Coverage   79.44%   80.78%   +1.33%     
==========================================
  Files          49       50       +1     
  Lines        5211     5365     +154     
==========================================
+ Hits         4140     4334     +194     
+ Misses       1071     1031      -40

Impacted Files	Coverage Δ
flax/training/checkpoints.py	`65.86% <0.00%> (-0.51%)`	⬇️
flax/io.py	`85.26% <0.00%> (ø)`
flax/linen/module.py	`92.64% <0.00%> (+0.01%)`	⬆️
flax/core/scope.py	`89.84% <0.00%> (+0.04%)`	⬆️
flax/metrics/tensorboard.py	`92.85% <0.00%> (+0.08%)`	⬆️
flax/errors.py	`85.14% <0.00%> (+0.45%)`	⬆️
flax/config.py	`92.85% <0.00%> (+15.93%)`	⬆️
flax/struct.py	`96.92% <0.00%> (+18.46%)`	⬆️
flax/serialization.py	`94.44% <0.00%> (+25.35%)`	⬆️
flax/training/train_state.py	`72.22% <0.00%> (+72.22%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

cgarciae · 2022-11-04T20:58:26Z

Hey @zaxtax, thanks for the feedback! What did you mean by:

I think this is good, but it could help if the code could be fully copy-pasted.

Like a "copy to clipboard" widget?

zaxtax · 2022-11-04T22:21:51Z

Oh I meant the example was completely self-contained and could be run if it were cut-paste from the website directly into a notebook cell.

…

On Fri, 4 Nov 2022, 21:58 Cristian Garcia, ***@***.***> wrote: Hey @zaxtax <https://github.com/zaxtax>, thanks for the feedback! What did you mean by: I think this is good, but it could help if the code could be fully copy-pasted. Like a "copy to clipboard" widget? — Reply to this email directly, view it on GitHub <#2536 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAACCUKTK7W5XETE6P7QKF3WGV2H5ANCNFSM6AAAAAARFVJ27A> . You are receiving this because you were mentioned.Message ID: ***@***.***>

cgarciae · 2022-11-05T00:50:14Z

Makes sense! Once the content for this guide is finalized the idea is to copy this into a notebook and add an "open in colab" link.

levskaya

This is really awesome, I don't have any useful critiques at the moment. lgtm!!!!

marcvanzee

Just some small nits, but overall looks really great! This is a super useful tutorial, BatchNorm is quite tricky and this guide explains exactly what needs to be changed.

LGTM, assuming you incorporate my changes.

docs/guides/batch_norm.rst

cgarciae · 2022-11-24T15:15:35Z

Hey @marcvanzee! Is the ready to go? 🚀

marcvanzee · 2022-11-24T15:19:24Z

Sure, I thought @8bitmp3 might want to have a look, but I think he has plenty of other tasks, and given that I already looked at it in quite some detail I think we can merge it.

marcvanzee · 2022-11-24T15:19:50Z

@8bitmp3 if you still have comments, feel free to create a new PR with suggested modifications.

8bitmp3

Reviewed and updated. Thank you 👍

marcvanzee · 2022-11-28T12:47:28Z

Thanks @8bitmp3, great improvements!

cgarciae force-pushed the batch-norm-guide branch from 4dc145f to 2e8fe02 Compare November 3, 2022 21:56

cgarciae force-pushed the batch-norm-guide branch from 2e8fe02 to 96c89de Compare November 4, 2022 17:04

cgarciae marked this pull request as ready for review November 4, 2022 17:04

cgarciae requested review from 8bitmp3 and levskaya and removed request for 8bitmp3 November 4, 2022 17:05

cgarciae force-pushed the batch-norm-guide branch from 96c89de to d03c4e1 Compare November 4, 2022 20:55

8bitmp3 self-requested a review November 4, 2022 21:38

8bitmp3 self-assigned this Nov 4, 2022

levskaya approved these changes Nov 15, 2022

View reviewed changes

levskaya added the pull ready label Nov 15, 2022

marcvanzee approved these changes Nov 17, 2022

View reviewed changes

add batch norm guide

2e70431

cgarciae force-pushed the batch-norm-guide branch from 85f064f to 2e70431 Compare November 21, 2022 20:07

marcvanzee removed the pull ready label Nov 23, 2022

marcvanzee added the pull ready label Nov 24, 2022

8bitmp3 added 3 commits November 24, 2022 22:28

Update and lint Flax BatchNorm guide

37c686a

Update Flax BatchNorm guide (fix formatting)

0e5baa8

Fix train_state.TrainState.create rendering in Flax BatchNorm guide

de58b2a

8bitmp3 approved these changes Nov 24, 2022

View reviewed changes

copybara-service bot merged commit d291af9 into google:main Nov 28, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BatchNorm guide #2536

BatchNorm guide #2536

cgarciae commented Oct 15, 2022 •

edited

Loading

review-notebook-app bot commented Oct 15, 2022

zaxtax commented Oct 19, 2022

codecov-commenter commented Nov 3, 2022 •

edited

Loading

cgarciae commented Nov 4, 2022

zaxtax commented Nov 4, 2022 via email

cgarciae commented Nov 5, 2022

levskaya left a comment

marcvanzee left a comment

cgarciae commented Nov 24, 2022

marcvanzee commented Nov 24, 2022

marcvanzee commented Nov 24, 2022

8bitmp3 left a comment

marcvanzee commented Nov 28, 2022

BatchNorm guide #2536

BatchNorm guide #2536

Conversation

cgarciae commented Oct 15, 2022 • edited Loading

What does this PR do?

review-notebook-app bot commented Oct 15, 2022

zaxtax commented Oct 19, 2022

codecov-commenter commented Nov 3, 2022 • edited Loading

Codecov Report

cgarciae commented Nov 4, 2022

zaxtax commented Nov 4, 2022 via email

cgarciae commented Nov 5, 2022

levskaya left a comment

Choose a reason for hiding this comment

marcvanzee left a comment

Choose a reason for hiding this comment

cgarciae commented Nov 24, 2022

marcvanzee commented Nov 24, 2022

marcvanzee commented Nov 24, 2022

8bitmp3 left a comment

Choose a reason for hiding this comment

marcvanzee commented Nov 28, 2022

cgarciae commented Oct 15, 2022 •

edited

Loading

codecov-commenter commented Nov 3, 2022 •

edited

Loading