Feature requests: Add "AND gate" merge method #72

JilekJosef · 2023-05-04T13:58:37Z

Basically it's exact oposite to add difference method. Instead of checking if values (in models B and C) are different enough you check if they are similar enough and than you place these values to model A.
Purpose?

Extracting concepts that 2 models have in common to another
Fixing model errors/merge without transfering error values (well, this is probably more of LoRA thing since you don't need the third model here and just preserve similar vectors and trash the rest)

I was able to succesfully test the second point with 2 lora epochs. I used torch.norm(A-B) to determine how different they are which was a bit tricky to configure correctly (I didn't standardise the vectors first which was probably a part of the issue), there is probably better method than basic norm that I don't know about since it was first time I was doing somthing like this (I am used to web development in Java mostly). But like in the end I believe it proved the concept. To the first point I don't know how much it will work, but I believe it should't be hard for you to test it since it probably does not require anything more than slight modification of add difference method.

hako-mikan · 2023-05-12T10:36:08Z

Very interesting suggestion. I will think about the implementation.

zethfoxster · 2023-07-02T23:24:01Z

wouldnt this basically spit out 1.5sd with only the most common concepts the 2 models have? what exactly would be a practical use of this?

JilekJosef · 2023-07-08T09:23:40Z

wouldnt this basically spit out 1.5sd with only the most common concepts the 2 models have? what exactly would be a practical use of this?

Basically, concept extraction in case of models A + (B AND C) replace weights in A with similar weights from B and C. In case of LoRA when you have multiple epochs you can do just like C = A AND B which wou purify these LoRAs from redundant things that are different in A and B

le-khang · 2023-07-30T10:57:53Z

This is the exact idea I have in mind for LORA merging. I noticed that when training 3 LORAs (for the same person) using 3 different models and then merging them together, the new LORA becomes very stable and flexible. I'm not saying it's the best, but it's something like:

LORA A can produce good results around 9/10 if used with Model A, but with other models it can be a bit random (between 4-9/10).
LORA Merged ABC can produce good results across many models (around 7-8/10).
I tried to merge more LORAs to see what would happen, but in my experience, when using more than 4 LORAs up to 10 LORAs, the results average out to around 7-7.5/10.

I think that if we can extract the exact concept without it being polluted by other elements, then we can freely increase its strength to improve the quality & flexibility while also reduce the file size.

Deathawaits4 · 2023-12-08T21:14:31Z

is there any news on this one? i think this could massively increase lora usability and put it up on to dreambooth again

JilekJosef · 2023-12-19T13:47:51Z

I have created this https://github.com/JilekJosef/loli-diffusion-merger It's sort of fork of supermerger. However I have implemented AND gate for models only, and the calculation method used works at single value vs single value basis, at least tensor level should be implemented to make it more useable I believe. @Deathawaits4

ljleb · 2024-02-19T20:36:08Z

This suggestion is similar to a weighted geometric average:

def multiply_difference(a, b, c, alpha):
    a = torch.complex(a - c, torch.zeros_like(a))
    b = torch.complex(b - c, torch.zeros_like(b))
    res = a**(1 - alpha) * b**alpha
    return c + res.real

if any parameter is 0 in A or B, then the corresponding parameter will be 0 in the output. If both parameters are close, then the output doesn't change much.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature requests: Add "AND gate" merge method #72

Feature requests: Add "AND gate" merge method #72

JilekJosef commented May 4, 2023 •

edited

Loading

hako-mikan commented May 12, 2023

zethfoxster commented Jul 2, 2023

JilekJosef commented Jul 8, 2023

le-khang commented Jul 30, 2023

Deathawaits4 commented Dec 8, 2023

JilekJosef commented Dec 19, 2023

ljleb commented Feb 19, 2024 •

edited

Loading

Feature requests: Add "AND gate" merge method #72

Feature requests: Add "AND gate" merge method #72

Comments

JilekJosef commented May 4, 2023 • edited Loading

hako-mikan commented May 12, 2023

zethfoxster commented Jul 2, 2023

JilekJosef commented Jul 8, 2023

le-khang commented Jul 30, 2023

Deathawaits4 commented Dec 8, 2023

JilekJosef commented Dec 19, 2023

ljleb commented Feb 19, 2024 • edited Loading

JilekJosef commented May 4, 2023 •

edited

Loading

ljleb commented Feb 19, 2024 •

edited

Loading