Add Orthogonal Finetuning (OFT) #1133

lukaskuhn-lku · 2023-11-14T20:55:23Z

Feature request

The paper Controlling Text-to-Image Diffusion by Orthogonal Finetuning proposes a new method of fine-tuning text-to-image diffusion models by using multiple learned orthogonal transformations on the layers of the pretrained model.

This preserves hyperspherical energy of the pretrained model (the sum of hyperspherical similarity (e.g., cosine similarity) between all pairwise neurons in the same layer), which leads to better generalization, stable training and faster convergence. It is basically a rotation of the neurons

In theory OFT can be applied to any layer and has some interesting interpretations for convolutional layers. For comparison reasons they only trained OFT on the same layers as LoRa in the original paper.

The corresponding repository gained 200 stars in 5 months, so there is definitely interest in the method. I think what is lacking is an easy to use implementation.

Motivation

I stumbled upon this paper and thought that it is a really unique idea of fine-tuning diffusion models and definitely works well in practice. I think an easy to use implementation is all that is lacking for this method to really benefit a lot of users.

Your contribution

I would love to fully contribute this method to peft, if there is no reason not to include it in the main branch.

lukaskuhn-lku closed this as not planned Won't fix, can't repro, duplicate, stale Nov 14, 2023

okotaku mentioned this issue Nov 21, 2023

[Feature] Support OFT #1160

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Orthogonal Finetuning (OFT) #1133

Add Orthogonal Finetuning (OFT) #1133

lukaskuhn-lku commented Nov 14, 2023 •

edited

Loading

Add Orthogonal Finetuning (OFT) #1133

Add Orthogonal Finetuning (OFT) #1133

Comments

lukaskuhn-lku commented Nov 14, 2023 • edited Loading

Feature request

Motivation

Your contribution

lukaskuhn-lku commented Nov 14, 2023 •

edited

Loading