GitHub - genforce/freecontrol: Official implementation of CVPR 2024 paper: "FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition"

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

[Paper] [Project Page]

Sicheng Mo^1*, Fangzhou Mu^2*, Kuan Heng Lin¹, Yanli Liu³, Bochen Guan³, Yin Li², Bolei Zhou¹
¹ UCLA, ² University of Wisconsin-Madison, ³ Innopeak Technology, Inc
^* Equal contribution
Computer Vision and Pattern Recognition (CVPR), 2024

Overview

This is the official implementation of FreeControl, a Generative AI algorithm for controllable text-to-image generation using pre-trained Diffusion Models.

Changelog

10/21/2024: Added SDXL pipeline (thanks to @shirleyzhu233).
02/19/2024: Initial code release. The paper is accepted to CVPR 2024.

Getting Started

Environment Setup

We provide a conda env file for environment setup.

conda env create -f environment.yml
conda activate freecontrol
pip install -U diffusers 
pip install -U gradio

Sample Semantic Bases

We provide three sample scripts in the scripts folder (one for each base model) to showcase how to compute target semantic bases.
You may also download pre-computed bases from google drive. Put them in the dataset folder and launch the gradio demo.

Gradio demo

We provide a graphical user interface (GUI) for users to try out FreeControl. Run the following command to start the demo.

python gradio_app.py

Galley:

We are building a gallery of images generated with FreeControl. You are welcome to share your generated images with us.

Contact

Sicheng Mo (smo3@cs.ucla.edu)

Reference

@article{mo2023freecontrol,
  title={FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition},
  author={Mo, Sicheng and Mu, Fangzhou and Lin, Kuan Heng and Liu, Yanli and Guan, Bochen and Li, Yin and Zhou, Bolei},
  journal={arXiv preprint arXiv:2312.07536},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
config		config
dataset		dataset
docs		docs
libs		libs
scripts		scripts
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml
gradio_app.py		gradio_app.py
sample_semantic_bases.py		sample_semantic_bases.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

[Paper] [Project Page]

Overview

Changelog

Getting Started

Galley:

Contact

Reference

About

Contributors 3

Languages

genforce/freecontrol

Folders and files

Latest commit

History

Repository files navigation

FreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any Condition

[Paper] [Project Page]

Overview

Changelog

Getting Started

Galley:

Contact

Reference

About

Resources

Stars

Watchers

Forks

Contributors 3

Languages