add Moslora #9331

TranscenderNing · 2024-10-29T02:22:04Z

PR types

New features

PR changes

add moslora at peft/lora

Description

add moslora method

paddle-bot · 2024-10-29T02:22:09Z

Thanks for your contribution!

codecov · 2024-10-29T04:09:06Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 53.10%. Comparing base (6813e40) to head (89fc2cc).
Report is 5 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #9331      +/-   ##
===========================================
+ Coverage    52.91%   53.10%   +0.19%     
===========================================
  Files          679      685       +6     
  Lines       108433   108855     +422     
===========================================
+ Hits         57378    57810     +432     
+ Misses       51055    51045      -10

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚨 Try these New Features:

Flaky Tests Detection - Detect and resolve failed and flaky tests

lugimzzz · 2024-11-21T08:30:23Z

llm/tools/merge_lora_params.py

@@ -86,16 +86,25 @@ def lora_process(name, lora_config, state_dict, device, lora_state_dict=None):
        return

    weight = state_dict.pop(name + ".weight")
+    lora_use_mixer = (lora_state_dict is not None and name + ".lora_AB" in lora_state_dict) or (


建议lora_use_mixer直接从lora_config读取不要用state dict的key判断

lugimzzz · 2024-11-21T08:31:30Z

llm/tools/merge_lora_params.py

+            lora_AB = lora_AB.astype("float32")
+            out = (weight + lora_A @ lora_AB @ lora_B * scaling).astype("bfloat16")
+        else:
+            out = (weight + lora_A @ lora_B * scaling).astype("bfloat16")


这里astype改成lora_config里的dtype，原本写的有问题

lugimzzz · 2024-11-21T08:31:58Z

llm/utils/argument.py

@@ -209,6 +209,9 @@ class ModelArgument:
    rslora: bool = field(default=False, metadata={"help": "Whether to use RsLoRA"})
    lora_plus_scale: float = field(default=1.0, metadata={"help": "Lora B scale in LoRA+ technique"})
    pissa: bool = field(default=False, metadata={"help": "Whether to use Pissa: https://arxiv.org/pdf/2404.02948.pdf"})
+    lora_use_mixer: bool = field(
+        default=False, metadata={"help": "Whether to use MosLoRA: https://arxiv.org/pdf/2406.11909"}


顺带更新一下文档

lugimzzz · 2024-11-21T08:33:51Z

paddlenlp/peft/lora/lora_model.py

@@ -443,6 +443,7 @@ def _find_and_replace_module(self, model, module_name, lora_config, enable_lora)
                pissa=lora_config.pissa,
                bias_attr=False if module.bias is None else None,
                use_quick_lora=lora_config.use_quick_lora,
+                lora_use_mixer=lora_config.lora_use_mixer,


在loramodel的init里if (tensor_parallel_degree >1 or pipeline_parallel_degree > 1) and lora_config.lora_use_mixer: raise NotImplementError（"xxx"）

…oslora

lugimzzz

LGTM

TranscenderNing added 2 commits October 29, 2024 10:08

add mos lora

b7ad47b

add tests

fa1764a

paddle-bot bot added the contributor label Oct 29, 2024

paddle-bot bot assigned wawltor Oct 29, 2024

ZHUI requested a review from lugimzzz October 29, 2024 03:16

lugimzzz reviewed Nov 21, 2024

View reviewed changes

lugimzzz added Beijing Innovation Consortium and removed contributor labels Nov 21, 2024

TranscenderNing added 3 commits November 21, 2024 17:18

Merge branch 'develop' of github.com:TranscenderNing/PaddleNLP into m…

c6f57ff

…oslora

correct something by comment

f39dfb9

add tests code for coverage rate

89fc2cc

lugimzzz approved these changes Nov 22, 2024

View reviewed changes

lugimzzz merged commit 183e012 into PaddlePaddle:develop Nov 22, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add Moslora #9331

add Moslora #9331

TranscenderNing commented Oct 29, 2024

paddle-bot bot commented Oct 29, 2024

codecov bot commented Oct 29, 2024 •

edited

Loading

lugimzzz Nov 21, 2024

TranscenderNing Nov 21, 2024

lugimzzz Nov 21, 2024

TranscenderNing Nov 21, 2024

lugimzzz Nov 21, 2024

TranscenderNing Nov 21, 2024

lugimzzz Nov 21, 2024

TranscenderNing Nov 21, 2024

lugimzzz left a comment

add Moslora #9331

add Moslora #9331

Conversation

TranscenderNing commented Oct 29, 2024

PR types

PR changes

Description

paddle-bot bot commented Oct 29, 2024

codecov bot commented Oct 29, 2024 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

lugimzzz left a comment

Choose a reason for hiding this comment

codecov bot commented Oct 29, 2024 •

edited

Loading