MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing

Paper

Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Lu Yuan, Sergey Tulyakov, Nenghai Yu

Abstract

Despite the recent success of face image generation with GANs, conditional hair editing remains challenging due to the under-explored complexity of its geometry and appearance. In this paper, we present MichiGAN (Multi-Input-Conditioned Hair Image GAN), a novel conditional image generation method for interactive portrait hair manipulation. To provide user control over every major hair visual factor, we explicitly disentangle hair into four orthogonal attributes, including shape, structure, appearance, and background. For each of them, we design a corresponding condition module to represent, process, and convert user inputs, and modulate the image generation pipeline in ways that respect the natures of different visual attributes. All these condition modules are integrated with the backbone generator to form the final end-to-end network, which allows fully-conditioned hair generation from multiple user inputs. Upon it, we also build an interactive portrait hair editing system that enables straightforward manipulation of hair by projecting intuitive and high-level user inputs such as painted masks, guiding strokes, or reference photos to well-defined condition representations. Through extensive experiments and evaluations, we demonstrate the superiority of our method regarding both result quality and user controllability.

Installation

Clone this repo.

git clone https://github.com/tzt101/MichiGAN.git
cd MichiGAN/

This code requires PyTorch 1.0 and python 3+. Please install dependencies by

pip install -r requirements.txt

Please download the Synchronized-BatchNorm-PyTorch rep.

cd models/networks/
git clone https://github.com/vacancy/Synchronized-BatchNorm-PyTorch
cp -rf Synchronized-BatchNorm-PyTorch/sync_batchnorm .
cd ../../

Dataset Preparation

The FFHQ dataset can be downloaded from Baidu Netdisk with the extracted code ichc or OneDrive (RAR) or OneDrive (ZIP), you should specify the dataset root from through --data_dir. Please follow the license when you use the FFHQ dataset.

Generating Images Using Pretrained Model

Once the dataset is ready, the result images can be generated using pretrained models.

Download the pretrained models from the Google Drive Folder, save it in 'checkpoints/MichiGAN/'. You can also download the pretrained models with the following commands:

cd checkpoints/MichiGAN/
bash download_model_G.sh
bash download_model_IG.sh
bash download_model_SIG.sh

Generate single image using the pretrained model.

python inference.py --name MichiGAN --gpu_ids 0 --inference_ref_name 67172 --inference_tag_name 67172 --inference_orient_name 67172 --netG spadeb --which_epoch 50 --use_encoder --noise_background --expand_mask_be --expand_th 5 --use_ig --load_size 512 --crop_size 512 --add_feat_zeros --data_dir [path_to_dataset]

The outputs images are stored at ./inference_samples/ by default. If you just want to test this single image without download the whole dataset, please set --data_dir ./datasets/FFHQ_single/. We give a sample image (67172) here.

Training New Models

New models can be trained with the following command.

python train.py --name [name_experiment] --batchSize 8 --no_confidence_loss --gpu_ids 0,1,2,3,4,5,6,7 --no_style_loss --no_rgb_loss --no_content_loss --use_encoder --wide_edge 2 --no_background_loss --noise_background --random_expand_mask --use_ig --load_size 568 --crop_size 512 --data_dir [pah_to_dataset] ----checkpoints_dir ./checkpoints

[name_experiment] is the directory name of the checkpoint file saved. if you want to train the model with orientation inpainting model (with the option --use_ig), please download the pretrained inpainting model from Google Drive Folder and save them in ./checkpoints/[name_experiment]/ firstly.

UI

You can direct run demo.py to use the Interactive systems. This UI code borrows from MaskGAN.

Orientation for New Dataset

Once the image and the corresponding hair mask is provided, you can use the following command to extract dense hair orientaiton map.

    python cal_orientation.py --image_path [your image path] --hairmask_path [you hair mask path] --orientation_root [save root]

For ease of use, we have rewritten the original c++ code into python. The results of this code are slightly different from the C++ version, but does not affect usage.

Code Structure

train.py, inference.py: the entry point for training and inferencing.
trainers/pix2pix_trainer.py: harnesses and reports the progress of training.
models/pix2pix_model.py: creates the networks, and compute the losses
models/networks/: defines the architecture of all models
options/: creates option lists using argparse package. More individuals are dynamically added in other files as well. Please see the section below.
data/: defines the class for loading datas.

Citation

If you use this code for your research, please cite our papers.

@article{tan2020michigan,
  title={MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing},
  author={Zhentao Tan, Menglei Chai, Dongdong Chen, Jing Liao, Qi Chu, Lu Yuan, Sergey Tulyakov and Nenghai Yu},
  journal={ACM Transactions on Graphics (TOG)},
  volume={39},
  number={4},
  pages={1--13},
  year={2020},
  publisher={ACM New York, NY, USA}
}

Acknowledgments

This code borrows heavily from SPADE. We thank Jiayuan Mao for his Synchronized Batch Normalization code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing

Paper

Abstract

Installation

Dataset Preparation

Generating Images Using Pretrained Model

Training New Models

UI

Orientation for New Dataset

Code Structure

Citation

Acknowledgments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
datasets/FFHQ_demo		datasets/FFHQ_demo
inference_samples		inference_samples
models		models
options		options
trainers		trainers
ui		ui
ui_util		ui_util
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
cal_orientation.py		cal_orientation.py
demo.py		demo.py
inference.py		inference.py
inference_dataset.py		inference_dataset.py
requirements.txt		requirements.txt
test_my_code.py		test_my_code.py
train.py		train.py

License

kudoNCT/michigan_copy

Folders and files

Latest commit

History

Repository files navigation

MichiGAN: Multi-Input-Conditioned Hair Image Generation for Portrait Editing

Paper

Abstract

Installation

Dataset Preparation

Generating Images Using Pretrained Model

Training New Models

UI

Orientation for New Dataset

Code Structure

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages