added merging methods and scoring functions #165

SaminYeasar · 2025-04-13T06:59:47Z

Added merging methods:
- SparseMerge
- SLERP
- LERP
- TiesMerge: made correction
- TaskArithmatic
- ModelBreadcrumbs
- UniformMerge
Added different scoring functions
- grow and drop
- layer drop + sparse
- model wise sparse
- gradient-magnitude based sparse
- weight-magnitude based sparse
- added backwardhook: will mask gradient during backdrop

added merging methods and scoring functions

SaminYeasar · 2025-04-13T07:06:45Z

@microsoft-github-policy-service agree company="Microsoft"

fverac · 2025-04-14T14:38:08Z

mttl/models/modifiers/sparse_mask.py

+        return grads * keep_mask
+    return hook
+
+def load_mask(f_name):


Is there a way we can avoid needing these functions save_mask and load_mask?

weight_mask should be included in state_dict by now so can be reloaded from checkpoints?

fverac · 2025-04-14T14:39:15Z

mttl/models/modifiers/sparse_mask.py

+                    keep_masks = torch.zeros_like(m.sparse_layer.weight)
+                    m.revert_weight_grad_and_update_mask(keep_masks)
+    # based on gradient-magnitude
+    elif parameter_selection_procedure=='gradient_magnitude':


can you add tests for the new permutations of these parameters in test_sparse_mask.py?

fverac · 2025-04-14T14:40:47Z

mttl/models/modifiers/sparse_mask.py

        self.sparse_layer.forward = types.MethodType(mod_forward, self.sparse_layer)

+    @torch.no_grad()
+    def convert_sparse_weight_to_1D(self):


Is this ever used? ctrl+f doesn't find anything for me.

fverac · 2025-04-14T14:42:50Z

mttl/models/modifiers/sparse_mask.py

+                if m.sparse_cat == 'block_sparse':
                    keep_masks = get_block_mask(m)
-                elif m.sparse_cat == "regular_sparse":
+                    # check: sample noise-block-idx


remove comments when ready. optionally add logging

moved merging model to separate folder, updated essential function and arguments for sparse-adapter training

SaminYeasar · 2025-04-23T07:41:06Z

will make a new pr

SaminYeasar added 4 commits April 13, 2025 06:07

added different merging methods

7e4a40a

updated sparse training funcs

18389cf

added backward hook,different scoring functions

b28da35

merged updated sparse training functions

6fcbe48

added merging methods and scoring functions

fverac reviewed Apr 14, 2025

View reviewed changes

SaminYeasar added 6 commits April 15, 2025 02:17

moved merging method to own dir

3e586aa

minor corrections

dba9675

correct function input

71cb57b

need updated lib

171f5c7

correct func input

7ebcbde

merging the updated code

376ef9e

moved merging model to separate folder, updated essential function and arguments for sparse-adapter training

SaminYeasar closed this Apr 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

added merging methods and scoring functions #165

added merging methods and scoring functions #165

Uh oh!

SaminYeasar commented Apr 13, 2025

Uh oh!

SaminYeasar commented Apr 13, 2025

Uh oh!

fverac Apr 14, 2025

Uh oh!

fverac Apr 14, 2025

Uh oh!

fverac Apr 14, 2025

Uh oh!

fverac Apr 14, 2025

Uh oh!

SaminYeasar commented Apr 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

added merging methods and scoring functions #165

added merging methods and scoring functions #165

Uh oh!

Conversation

SaminYeasar commented Apr 13, 2025

Uh oh!

SaminYeasar commented Apr 13, 2025

Uh oh!

fverac Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

fverac Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

fverac Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

fverac Apr 14, 2025

Choose a reason for hiding this comment

Uh oh!

SaminYeasar commented Apr 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants