New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

[NNX] Add `LoRA` and `LoRALinear` to NNX #3929

Merged

copybara-service merged 1 commit into google:main from IvyZX:lora

May 23, 2024

Collaborator

IvyZX commented May 21, 2024

Provided here are two ways to add LoRA on any layer:

nnx.LoRA with base_module arg that can take any layer instance. Easy for model surgery on modules, but will render mismatching states.
nnx.LoRALinear which subclasses nnx.Linear and attach a simple nnx.LoRA module along the way. Param structure matches with nnx.Linear, but need a bit more surgery on runtime. This technique can be applied for any other NNX modules.

Not yet sure which approach is the best, so providing both at this moment.

IvyZX requested review from cgarciae and chiamp

May 21, 2024 22:39

IvyZX force-pushed the lora branch 4 times, most recently from 9298031 to d22505d Compare

May 22, 2024 00:56

codecov-commenter commented May 22, 2024 •

edited

Loading

Codecov Report

Attention: Patch coverage is 0% with 43 lines in your changes are missing coverage. Please review.

Project coverage is 0.00%. Comparing base (2c7d7cd) to head (0f6dc0e).
Report is 46 commits behind head on main.

Files	Patch %	Lines
flax/experimental/nnx/nnx/nn/lora.py	0.00%	40 Missing ⚠️
flax/experimental/nnx/__init__.py	0.00%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #3929       +/-   ##
==========================================
- Coverage   60.43%   0.00%   -60.44%     
==========================================
  Files         105     102        -3     
  Lines       13263   13160      -103     
==========================================
- Hits         8015       0     -8015     
- Misses       5248   13160     +7912

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cgarciae requested changes

View reviewed changes

flax/experimental/nnx/nnx/nn/lora.py Outdated

+                def __call__(self, x: jax.Array):
+                  out = x @ self.lora_a @ self.lora_b
+                  if self.base_module is not None:
+                    assert callable(self.base_module), "`base_module` must be callable."

Collaborator

cgarciae May 22, 2024

Lets raise an error here instead

Collaborator Author

IvyZX May 22, 2024

Done.

flax/experimental/nnx/nnx/nn/lora.py

		return out


		class LoRALinear(nnx.Linear):

Collaborator

cgarciae May 22, 2024

It might cleaner to inherit from LoRA and create a Linear in __init__ which is passed as the base_module to super().__init__(). This way you can remove __call__.

Collaborator Author

IvyZX May 22, 2024 •

edited

Loading

The reason I provided LoRALinear is that this assumes the exact same param structure and API as Linear. This is also how PyTorch provides their LoRALinear.

If it's just a LoRA instance, the original linear weights will be one level below, inside base_module. I think users can easily create something like this on their own, no need to have a LoRALinear shortcut for it.

Collaborator

cgarciae May 23, 2024

Makes sense.

flax/experimental/nnx/nnx/nn/lora.py Outdated

+                  in_features: int,
+                  out_features: int,
+                  lora_rank: int,

Collaborator

cgarciae May 22, 2024

Maybe this ordering is a bit more intuitive?

Suggested change

      
                in_features: int,
          
                out_features: int,
          
                lora_rank: int,
          
                in_features: int,
          
                lora_rank: int,
          
                out_features: int,

Collaborator Author

IvyZX May 22, 2024 •

edited

Loading

Sure! LoRA is a generic layer and this is indeed more intuitive.

Note that the PyTorch LoRALinear implementation is ordered (in, out, rank). To avoid this being confusing, I will make lora_rank a kwarg argument in our LoRALinear.

chiamp reviewed

View reviewed changes

flax/experimental/nnx/nnx/nn/lora.py Outdated

Comment on lines 113 to 116

+                  self.lora_a = param_type(
+                    kernel_init(rngs.params(), (in_features, lora_rank), param_dtype)
+                  )
+                  self.lora_b = param_type(

Collaborator

chiamp May 22, 2024

I'm wondering if there are more informative variable names we can use like self.lora_in and self.lora_out, or is a and b standard naming conventions for LoRA?

Collaborator Author

IvyZX May 22, 2024

Yeah unfortunately AFAIK a and b are the conventions here...

chiamp reviewed

View reviewed changes

flax/experimental/nnx/nnx/nn/lora.py Outdated

+                  precision: numerical precision of the computation see `jax.lax.Precision`
+                    for details.
+                  kernel_init: initializer function for the weight matrices.
+                  use_lora_param_type: if yes, LoRA params will be of different Param type.

Collaborator

chiamp May 22, 2024 •

edited

Loading

Suggested change

      
                use_lora_param_type: if yes, LoRA params will be of different Param type.
          
                use_lora_param_type: if ``True``, LoRA params will be of different Param type.

Collaborator Author

IvyZX May 22, 2024

I am thinking maybe it's better to make this argument an explicit type instead of a boolean... more customizability!

chiamp reviewed

View reviewed changes

flax/experimental/nnx/nnx/nn/lora.py Outdated

+                  precision: numerical precision of the computation see `jax.lax.Precision`
+                    for details.
+                  kernel_init: initializer function for the weight matrices.
+                  use_lora_param_type: if yes, LoRA params will be of different Param type.

Collaborator

chiamp May 22, 2024

Suggested change

      
                use_lora_param_type: if yes, LoRA params will be of different Param type.
          
                use_lora_param_type: if ``True``, LoRA params will be of different Param type.

Collaborator Author

IvyZX May 22, 2024

Same here

chiamp approved these changes

View reviewed changes

Collaborator

chiamp left a comment

Should we also add this to Linen?


          Add LoRA to NNX

0f6dc0e

IvyZX force-pushed the lora branch from d22505d to 0f6dc0e Compare

May 22, 2024 22:36

IvyZX requested a review from cgarciae

May 22, 2024 22:37

cgarciae approved these changes

View reviewed changes

IvyZX added the pull ready label

copybara-service bot merged commit b84be49 into google:main

19 checks passed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels