Added RMS normalization layer #2881

chiamp · 2023-02-16T03:56:47Z

Resolves #2849.

Added an optional argument use_mean in the _compute_stats function in flax/linen/normalization.py, which will compute the mean and variance if set to True, and will set the mean to 0 and compute the variance without subtracting the mean if set to False. The latter mode is useful as square rooting this "variance" value (which is done in the _normalize function) will give you the RMS.

codecov-commenter · 2023-02-16T04:05:44Z

Codecov Report

Merging #2881 (9cff780) into main (5d4040a) will increase coverage by 0.02%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main    #2881      +/-   ##
==========================================
+ Coverage   81.45%   81.47%   +0.02%     
==========================================
  Files          55       55              
  Lines        5779     5798      +19     
==========================================
+ Hits         4707     4724      +17     
- Misses       1072     1074       +2

Impacted Files	Coverage Δ
flax/linen/__init__.py	`100.00% <ø> (ø)`
flax/linen/normalization.py	`97.41% <100.00%> (+0.29%)`	⬆️
flax/core/scope.py	`89.91% <0.00%> (-0.22%)`	⬇️
flax/linen/module.py	`92.37% <0.00%> (-0.13%)`	⬇️
flax/configurations.py	`85.00% <0.00%> (+0.78%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

marcvanzee · 2023-02-16T08:51:01Z

flax/linen/normalization.py

@@ -335,6 +343,70 @@ def __call__(self, x):
        self.bias_init, self.scale_init)


+class RMSNorm(Module):


Could you please add an example of how to use the layer? I think we should starting doing this for every layer, like @cgarciae does in his RNN PR: https://github.com/google/flax/pull/2604/files#r1107264719.

levskaya

Looks good! Thanks!! -- I second Marc's ask to add a small usage example in the docstring.

chiamp · 2023-02-17T08:49:43Z

@marcvanzee @levskaya I added a docstring, let me know if this works!

levskaya · 2023-02-19T07:22:14Z

@chiamp - I added a exception for the deprecation warning, your tests all seem to pass now!

chiamp · 2023-02-19T07:25:30Z

@chiamp - I added a exception for the deprecation warning, your tests all seem to pass now!

Thanks @levskaya!

levskaya · 2023-02-19T07:31:39Z

flax/linen/normalization.py

+  epsilon: float = 1e-6
+  dtype: Optional[Dtype] = None
+  param_dtype: Dtype = jnp.float32
+  use_bias: bool = True


sorry I just noticed this - we probably don't want use_bias and bias_init here since we're never adjusting the offset?

chiamp self-assigned this Feb 16, 2023

chiamp requested a review from levskaya February 16, 2023 04:21

marcvanzee reviewed Feb 16, 2023

View reviewed changes

chiamp force-pushed the rmslayernorm branch 2 times, most recently from 4a05ad4 to 42ba933 Compare February 17, 2023 03:55

levskaya approved these changes Feb 17, 2023

View reviewed changes

levskaya approved these changes Feb 19, 2023

View reviewed changes

chiamp added the pull ready label Feb 19, 2023

levskaya reviewed Feb 19, 2023

View reviewed changes

Added RMS normalization layer

9cff780

chiamp force-pushed the rmslayernorm branch from 42ba933 to 9cff780 Compare February 19, 2023 07:44

copybara-service bot merged commit 5f0ac50 into google:main Feb 19, 2023

chiamp deleted the rmslayernorm branch February 23, 2023 01:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added RMS normalization layer #2881

Added RMS normalization layer #2881

chiamp commented Feb 16, 2023 •

edited

Loading

codecov-commenter commented Feb 16, 2023 •

edited

Loading

marcvanzee Feb 16, 2023

levskaya left a comment

chiamp commented Feb 17, 2023

levskaya commented Feb 19, 2023

chiamp commented Feb 19, 2023

levskaya Feb 19, 2023

		@@ -335,6 +343,70 @@ def __call__(self, x):
		self.bias_init, self.scale_init)


		class RMSNorm(Module):

Added RMS normalization layer #2881

Added RMS normalization layer #2881

Conversation

chiamp commented Feb 16, 2023 • edited Loading

codecov-commenter commented Feb 16, 2023 • edited Loading

Codecov Report

marcvanzee Feb 16, 2023

Choose a reason for hiding this comment

levskaya left a comment

Choose a reason for hiding this comment

chiamp commented Feb 17, 2023

levskaya commented Feb 19, 2023

chiamp commented Feb 19, 2023

levskaya Feb 19, 2023

Choose a reason for hiding this comment

chiamp commented Feb 16, 2023 •

edited

Loading

codecov-commenter commented Feb 16, 2023 •

edited

Loading