Gradient-Free Textual Inversion

Gradient-free textual inversion for personalized text-to-image generation. We introduce to use evolution strategy from OpenAI without gradient to optimize the pesudo-word embeddings. Our implementation is totally compatible with diffusers and stable diffusion model.

_{Evolution process for textual embeddings.}

What does this repo do?

Current personalized text-to-image approaches, which learn to bind a unique identifier with specific subjects or styles in a few given images, usually incorporate a special word and tune its embedding parameters through gradient descent. It is natural to question whether we can optimize the textual inversions by only accessing the inference of models? As only requiring the forward computation to determine the textual inversion retains the benefits of efficient computation and safe deployment.

Hereto, we introduce a gradient-free framework to optimize the continuous textual inversion in personalized text-to-image generation. Specifically, we first initialize the textual inversion with non-parameter cross-attention to ensure the latent embedding space.
Then, instead of optimizing in the original high-dimensional embedding space, which is intractable for derivative-free optimization, we perform optimization in a decomposition subspace with (i) PCA and (ii) prior normalization through iterative evolutionary strategy.

_{Overview of the proposed gradient-free textual inversion framework.}

Cases

Some cases generated by standard textual inversion and gradient-free inversion based on stable diffusion model.

_{Cases for the personalized text-to-image generation.}

Process

To intialize the textual inversion with cross-attention automatically, run:

python initialize_inversion.py

Then, iterative optimize the textual inversion with gradient-free evolution strategy, run:

python train_inversion.py

Finally, with the trained textual inversion, you can generated personalized image with infer_inversion.py script.

Acknowledge

This repository is based on diffusers and textual inversion script. Thanks for their clear code.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
cma		cma
figures		figures
README.md		README.md
embeding_distribution.py		embeding_distribution.py
infer_inversion.py		infer_inversion.py
initialize_inversion.py		initialize_inversion.py
train_inversion.py		train_inversion.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Gradient-Free Textual Inversion

What does this repo do?

Cases

Process

Acknowledge

About

Releases

Packages

Languages

feizc/Gradient-Free-Textual-Inversion

Folders and files

Latest commit

History

Repository files navigation

Gradient-Free Textual Inversion

What does this repo do?

Cases

Process

Acknowledge

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages