awesome image-to-image translation papers

A collection of resources on image-to-image translation (constantly updating).

This repository is organized in terms of category, another one which is organized chronologically by conferences is also available.

Contributing

If you think I have missed out on something (or) have any suggestions (papers, implementations and other resources), feel free to pull a request

Feedback and contributions are welcome!

Tutorials

Unpaired Image-to-Image Translation. CVPR Tutorial on GANs (2018)

On Image-to-Image Translation. Stanford, MIT, Facebook, CUHK, SNU (2017)

Supervised

pix2pix: Image-to-Image Translation with Conditional Adversarial Networks.
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros.
CVPR 2017. [PDF] [Github]

BicycleGAN: Toward Multimodal Image-to-Image Translation.
Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman.
NeurIPS 2017. [PDF] [Github]

PI-REC: Progressive Image Reconstruction Network With Edge and Color Domain.
Sheng You, Ning You, Minxue Pan.
arxiv, 25 Mar 2019. [PDF] [Github]

Unsupervised

Truly Unsupervised

Diverse Image Generation via Self-Conditioned GANs..
Steven Liu, Tongzhou Wang, David Bau, Jun-Yan Zhu, Antonio Torralba.
CVPR 2020. [PDF] [Project] [Github]

TUNIT: Rethinking the Truly Unsupervised Image-to-Image Translation.
Kyungjune Baek, Yunjey Choi, Youngjung Uh, Jaejun Yoo, Hyunjung Shim.
arxiv 2020. [PDF] [Github]

General

CycleGAN: Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks.
Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros.
ICCV 2017. [PDF] [Github]

DiscoGAN: Learning to Discover Cross-Domain Relations with Generative Adversarial Networks.
Taeksoo Kim, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, Jiwon Kim.
ICML 2017. [PDF] [Github]

DualGAN: Unsupervised Dual Learning for Image-to-Image Translation.
Zili Yi, Hao Zhang, Ping Tan, Minglun Gong.
ICCV 2017. [PDF] [Github]

DTN: Unsupervised Cross-Domain Image Generation.
Yaniv Taigman, Adam Polyak, Lior Wolf.
ICLR 2017. [PDF] Github]

UNIT: Unsupervised image-to-image translation networks.
Ming-Yu Liu, Thomas Breuel, Jan Kautz.
NeurIPS 2017. [PDF] [Github]

DistanceGAN: One-Sided Unsupervised Domain Mapping.
Sagie Benaim, Lior Wolf.
NeurIPS 2017. [PDF] [Github]

TriangleGAN: Triangle Generative Adversarial Networks.
Zhe Gan, Liqun Chen, Weiyao Wang, Yunchen Pu, Yizhe Zhang, Hao Liu, Chunyuan Li, Lawrence Carin.
NeurIPS 2017. [PDF] [Github]

NAM: Non-Adversarial Unsupervised Domain Mapping.
Yedid Hoshen, Lior Wolf.
ECCV 2018. [PDF] [Github]

SCAN: Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks.
Minjun Li, Haozhi Huang, Lin Ma, Wei Liu, Tong Zhang, Yu-Gang Jiang.
ECCV 2018. [PDF]

GANimorph: Improved Shape Deformation in Unsupervised Image to Image Translation.
Aaron Gokaslan, Vivek Ramanujan, Daniel Ritchie, Kwang In Kim, James Tompkin.
ECCV 2018. [PDF] [Github]

OT-CycleGAN: Guiding the One-to-one Mapping in CycleGAN via Optimal Transport.
Guansong Lu, Zhiming Zhou, Yuxuan Song, Kan Ren, Yong Yu.
AAAI 2019. [PDF]

HarmonicGAN: Harmonic Unpaired Image-to-image Translation.
Rui Zhang, Tomas Pfister, Jia Li.
ICLR 2019. [PDF]

SDIT: Scalable and Diverse Cross-Domain Image Translation.
Yaxing Wang, Abel Gonzalez-Garcia, Joost van de Weijer, Luis Herranz.
ACM MM, 2019. [PDF] [Github]

CrossNet: Latent Cross-Consistency for Unpaired Image Translation.
Omry Sendik, Dani Lischinski, Daniel Cohen-Or.
WACV 2020. [PDF]

Cross-Domain Cascaded Deep Feature Translation.
Oren Katzir, Dani Lischinski, Daniel Cohen-Or.
arxiv, 4 Jun 2019. [PDF]

Implicit Pairs for Boosting Unpaired Image-to-Image Translation.
Yiftach Ginger, Dov Danon, Hadar Averbuch-Elor, Daniel Cohen-Or.
arxiv, 15 Apr 2019. [PDF]

Unsupervised Shape Transformer for Image Translation and Cross-Domain Retrieval.
Kaili Wang, Liqian Ma, Jose Oramas M., Luc Van Gool, Tinne Tuytelaars.
arxiv, 5 Dec 2018. [PDF]

A Novel BiLevel Paradigm for Image-to-Image Translation.
Liqian Ma, Qianru Sun, Bernt Schiele, Luc Van Gool.
arxiv, 8 Apr 2019. [PDF]

Augmented Cyclic Consistency Regularization for Unpaired Image-to-Image Translation.
Takehiko Ohkawa, Naoto Inoue, Hirokatsu Kataoka, Nakamasa Inoue.
arxiv, 29 Feb 2020. [PDF]

NICE: Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation.
Runfa Chen, Wenbing Huang, Binghui Huang, Fuchun Sun, Bin Fang.
CVPR 2020. [PDF] [Github]

Unpaired Image-to-Image Translation using Adversarial Consistency Loss.
Yihao Zhao, Ruihai Wu, Hao Dong.
arxiv, 10 Mar 2020. [PDF]

Optimal Unsupervised Domain Translation.
Emmanuel de Bézenac, Ibrahim Ayed, Patrick Gallinari.
ICML 2020 Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models (INNF+ ). [PDF]

Cross-Domain Cascaded Deep Feature Translation.
Oren Katzir, Dani Lischinski, Daniel Cohen-Or.
arxiv 2019. [PDF]

Towards Lifelong Self-Supervision For Unpaired Image-to-Image Translation.
Victor Schmidt, Makesh Narsimhan Sreedhar, Mostafa ElAraby, Irina Rish.
arxiv 2020. [PDF] [Github]

TuiGAN: Learning Versatile Image-to-Image Translation with Two Unpaired Images.
Jianxin Lin, Yingxue Pang, Yingce Xia, Zhibo Chen, Jiebo Luo.
arxiv 2020. [PDF]

Unpaired Image Translation via Adaptive Convolution-based Normalization.
Wonwoong cho, Kangyeol Kim, Eungyeup Kim, Hyunwoo J. Kim, Jaegul Choo.
arxiv 2020. [PDF]

What and Where to Translate: Local Mask-based Image-to-Image Translation.
Wonwoong Cho, Seunghwan Choi, Junwoo Park, David Keetae Park, Tao Qin, Jaegul Choo.
arxiv 2020. [PDF]

Disentanglement

Guided Variational Autoencoder for Disentanglement Learning.
Zheng Ding, Yifan Xu, Weijian Xu, Gaurav Parmar, Yang Yang, Max Welling, Zhuowen Tu.
CVPR 2020. [PDF]

XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings.
Amélie Royer, Konstantinos Bousmalis, Stephan Gouws, Fred Bertsch, Inbar Mosseri, Forrester Cole, Kevin Murphy.
ICML 2018. [PDF] [Dataset]

ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes.
Taihong Xiao, Jiapeng Hong, Jinwen Ma.
ECCV 2018. [PDF] [Github]

MUNIT: Multimodal Unsupervised Image-to-Image Translation.
Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz.
ECCV 2018. [PDF] [Github]

Image-to-Image Translation for Cross-Domain Disentanglement.
Abel Gonzalez-Garcia, Joost van de Weijer, Yoshua Bengio.
NeurIPS 2018. [PDF]

Conditional Image-to-Image Translation.
Jianxin Lin, Yingce Xia, Tao Qin, Zhibo Chen, Tie-Yan Liu.
CVPR 2018. [PDF]

EGSC-IT: Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency.
Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc Van Gool.
ICLR 2019. [PDF] [Github]

PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup.
Huiwen Chang, Jingwan Lu, Fisher Yu, Adam Finkelstein.
CVPR 2018. [PDF]

DRIT: Diverse Image-to-Image Translation via Disentangled Representations.
Hsin-Ying Lee, Hung-Yu Tseng, Jia-Bin Huang, Maneesh Kumar Singh, Ming-Hsuan Yang.
ECCV 2018. [PDF] [Github]

UFDN: A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation.
Alexander H. Liu, Yen-Cheng Liu, Yu-Ying Yeh, Yu-Chiang Frank Wang.
NeurIPS 2018. [PDF] [Github]

GDWTC: Image-to-Image Translation via Group-wise Deep Whitening and Coloring Transformation.
Wonwoong Cho, Sungha Choi, David Keetae Park, Inkyu Shin, Jaegul Choo.
CVPR 2019. [PDF] [Github]

DRIT++: Diverse Image-to-Image Translation via Disentangled Representations.
Hsin-Ying Lee, Hung-Yu Tseng, Qi Mao, Jia-Bin Huang, Yu-Ding Lu, Maneesh Singh, Ming-Hsuan Yang.
IJCV 2019. [PDF] [[Project] [Github]

Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li.
NeurIPS 2019. [PDF] [Github]

Flow-based Image-to-Image Translation with Feature Disentanglement.
Ruho Kondo, Keisuke Kawano, Satoshi Koide, Takuro Kutsuna.
NeurIPS 2019. [PDF]

DosGAN: Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation.
Jianxin Lin, Zhibo Chen, Yingce Xia, Sen Liu, Tao Qin, Jiebo Luo.
TPAMI 2019. [PDF] [Github]

Multi-Domain and Multi-Modal

Unsupervised multi-modal Styled Content Generation.
Omry Sendik, Dani Lischinski, Daniel Cohen-Or.
SIGGRAPH 2020. [PDF]

Multimodal Image Synthesis with Conditional Implicit Maximum Likelihood Estimation.
Ke Li, Shichong Peng, Tianhao Zhang, Jitendra Malik.
IJCV 2020. [PDF]

MSGAN: Mode Seeking Generative Adversarial Networks for Diverse Image Synthesis.
Qi Mao, Hsin-Ying Lee, Hung-Yu Tseng, Siwei Ma, Ming-Hsuan Yang.
CVPR 2019. [PDF] [Github]

GMM-UNIT: Unsupervised Multi-Domain and Multi-Modal Image-to-Image Translation via Attribute Gaussian Mixture Modeling.
Yahui Liu, Marco De Nadai, Jian Yao, Nicu Sebe, Bruno Lepri, Xavier Alameda-Pineda.
arxiv, 2020. [PDF

StarGAN v2: Diverse Image Synthesis for Multiple Domains.
Yunjey Choi, Youngjung Uh, Jaejun Yoo, Jung-Woo Ha. Clova AI Research, NAVER Corp.
CVPR 2020. [PDF] [GitHub]

Attribute-Guided Face Generation Using Conditional CycleGAN.
Yongyi Lu, Yu-Wing Tai, Chi-Keung Tang.
ECCV 2018. [PDF]

StarGAN: Uniﬁed Generative Adversarial Networks for Multi-Domain Image-to-Image Translation.
Yunjey Choi, Minje Choi, Munyoung Kim, Jung-Woo Ha, Sunghun Kim, Jaegul Choo.
CVPR 2018. [PDF] [Github]

AttGAN: Facial Attribute Editing by Only Changing What You Want.
Zhenliang He, Wangmeng Zuo, Meina Kan, Shiguang Shan, Xilin Chen.
TIP 2019. [PDF] [Github]

ComboGAN: Unrestrained Scalability for Image Domain Translation.
Asha Anoosheh, Eirikur Agustsson, Radu Timofte, Luc Van Gool.
CVPRW 2018. [PDF] [Github]

Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data.
Amjad Almahairi, Sai Rajeswar, Alessandro Sordoni, Philip Bachman, Aaron Courville.
ICML 2018. [PDF] [Github]

ModularGAN: Modular Generative Adversarial Networks.
Bo Zhao, Bo Chang, Zequn Jie, Leonid Sigal.
ECCV 2018. [PDF]

SG-GAN: Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation.
Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Xueying Qin.
ACM MM 2018. [PDF] [Github]

SingleGAN: Image-to-Image Translation by a Single-Generator Network using Multiple Generative Adversarial Learning.
Xiaoming Yu, Xing Cai, Zhenqiang Ying, Thomas Li, Ge Li.
ACCV 2018. [PDF] [Github]

SMIT: Stochastic Multi-Label Image-to-Image Translation.
Andrés Romero, Pablo Arbeláez, Luc Van Gool, Radu Timofte.
ICCV Workshops 2019. [PDF] [Github]
Image-to-Image Translation with Multi-Path Consistency Regularization.
Jianxin Lin, Yingce Xia, Yijun Wang, Tao Qin, Zhibo Chen.
IJCAI 2019. [PDF]

RelGAN: Multi-Domain Image-to-Image Translation via Relative Attributes.
Po-Wei Wu, Yu-Jing Lin, Che-Han Chang, Edward Y. Chang, Shih-Wei Liao.
ICCV 2019. [PDF]

DMIT: Multi-mapping Image-to-Image Translation via Learning Disentanglement.
Xiaoming Yu, Yuanqi Chen, Thomas Li, Shan Liu, Ge Li.
NeurIPS 2019. [PDF] [Github]

ADSPM: Attribute-Driven Spontaneous Motion in Unpaired Image Translation.
Ruizheng Wu, Xin Tao, Xiaodong Gu, Xiaoyong Shen, Jiaya Jia.
ICCV 2019. [PDF] [Github]

CartoonRenderer: An Instance-based Multi-Style Cartoon Image Translator.
Yugang Chen, Muchun Chen, Chaoyue Song, Bingbing Ni.
International Conference on Multimedia Modeling (MMM 2020). [PDF]

injectionGAN: Toward Learning a Unified Many-to-Many Mapping for Diverse Image Translation.
Wenju Xu, Shawn Keshmiri, Guanghui Wang.
arxiv 2019. [PDF]

DCMIT: Unsupervised Multi-Domain Multimodal Image-to-Image Translation with Explicit Domain-Constrained Disentanglement.
Weihao Xia, Yujiu Yang, Jing-Hao Xue.
Neural Networks 2020. [PDF]

Guided Image-to-image Translation

Scene Graphs Guided

Semantic Image Manipulation Using Scene Graphs.
Helisa Dhamo, Azade Farshad, Iro Laina, Nassir Navab, Gregory D. Hager, Federico Tombari, Christian Rupprecht.
CVPR 2020. [PDF]

Texture Guided

TextureGAN: Controlling Deep Image Synthesis with Texture Patches.
Wenqi Xian, Patsorn Sangkloy, Varun Agrawal, Amit Raj, Jingwan Lu, Chen Fang, Fisher Yu, James Hays.
CVPR 2018. [PDF] [Github]

Guided Image-to-Image Translation with Bi-Directional Feature Transformation.
Badour AlBahar, Jia-Bin Huang.
ArXiv 2019. [PDF] [Github]

Amodal Map Guided

Self-Supervised Scene De-occlusion.
Xiaohang Zhan, Xingang Pan, Bo Dai, Ziwei Liu, Dahua Lin, and Chen Change Loy.
CVPR 2020. [PDF] [Github] [Project] [Demo]

Segmentation or Label Map Guided

World-Consistent Video-to-Video Synthesis.
Arun Mallya, Ting-Chun Wang, Karan Sapra, Ming-Yu Liu.
ECCV 2020. [PDF] [Github]

Example-Guided Image Synthesis across Arbitrary Scenes using Masked Spatial-Channel Attention and Self-Supervision.
Haitian Zheng, Haofu Liao, Lele Chen, Wei Xiong, Tianlang Chen, Jiebo Luo.
arxiv 2020. [PDF]

Attentive Normalization for Conditional Image Generation.
Yi Wang, Ying-Cong Chen, Xiangyu Zhang, Jian Sun, Jiaya Jia.
CVPR 2020. [PDF]

Edge Guided GANs with Semantic Preserving for Semantic Image Synthesis.
Hao Tang, Xiaojuan Qi, Dan Xu, Philip H. S. Torr, Nicu Sebe.
arxiv 2020. [PDF] [Github]

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation.
Hao Tang, Dan Xu, Yan Yan, Philip H. S. Torr, Nicu Sebe.
CVPR 2020. [PDF] [Github]

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro.
CVPR 2018. [PDF] [Github]

SPADE: Semantic Image Synthesis with Spatially-Adaptive Normalization.
Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu.
CVPR 2019. [PDF] [Github]

SelectionGAN: Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation.
Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan.
VPR 2019. [PDF] [Github]

Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation.
Matteo Tomei, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara.
CVPR 2019. [PDF] [Github]

Mask-Guided Portrait Editing with Conditional GANs.
Shuyang Gu, Jianmin Bao, Hao Yang, Dong Chen, Fang Wen, Lu Yuan.
CVPR 2019. [PDF] [Github]

Semantic Bottleneck Scene Generation.
Samaneh Azadi, Michael Tschannen, Eric Tzeng, Sylvain Gelly, Trevor Darrell, Mario Lucic.
arxiv, 2019. [PDF]

Video Generation from Single Semantic Label Map.
Junting Pan, Chengyu Wang, Xu Jia, Jing Shao, Lu Sheng, Junjie Yan, Xiaogang Wang.
CVPR 2019. [PDF] [Github]

Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro.
NeurIPS 2018. [PDF] [Github]

Few-shot Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro.
ArXiv 2019. [PDF] [Project]

Text Guided

Text-Guided Manipulation

SISGAN: Semantic Image Synthesis via Adversarial Learning.
Hao Dong, Simiao Yu, Chao Wu, Yike Guo.
ICCV 2017. [PDF] [Github]

Text-Adaptive Generative Adversarial Networks: Manipulating Images with Natural Language.
Seonghyeon Nam, Yunji Kim, Seon Joo Kim.
NeurIPS 2018. [PDF] [Github]

Lightweight Generative Adversarial Networks for Text-Guided Image Manipulation.
Bowen Li, Xiaojuan Qi, Philip Torr, Thomas Lukasiewicz.
NeurIPS 2020. [PDF]

ManiGAN: Text-Guided Image Manipulation.
Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H. S. Torr.
CVPR 2020. [PDF] [Github]

Generative Adversarial Text to Image Synthesis.
Scott Reed, Zeynep Akata, Xinchen Yan, Lajanugen Logeswaran, Bernt Schiele, Honglak Lee.
ICML 2016. [PDF] [Github]

Image-to-Image Translation with Text Guidance.
Bowen Li, Xiaojuan Qi, Philip H. S. Torr, Thomas Lukasiewicz.
arxiv, 12 Feb 2020. [PDF]

Neural Image Inpainting Guided with Descriptive Text.
Lisai Zhang, Qingcai Chen, Baotian Hu, Shuoran Jiang.
arxiv 2020. [PDF]

Text-to-Image Generation

ControlGAN: Controllable Text-to-Image Generation.
Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H. S. Torr.
NeurIPS 2019. [PDF] [Github]

RiFeGAN: Rich Feature Generation for Text-to-Image Synthesis From Prior Knowledge.
Jun Cheng, Fuxiang Wu, Yanling Tian, Lei Wang, Dapeng Tao.
CVPR 2020. [PDF]

MirrorGAN: Learning Text-to-image Generation by Redescription.
Tingting Qiao, Jing Zhang, Duanqing Xu, Dacheng Tao.
CVPR 2019. [PDF] [Unofficial TensorFlow]

AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks.
Tao Xu, Pengchuan Zhang, Qiuyuan Huang, Han Zhang, Zhe Gan, Xiaolei Huang, Xiaodong He.
CVPR 2018. [PDF] [Github]

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis.
Minfeng Zhu, Pingbo Pan, Wei Chen, Yi Yang.
CVPR 2019. [PDF] [Github]

StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks.
Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris Metaxas.
TPAMI 2018. [PDF] [Github]

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks.
Han Zhang, Tao Xu, Hongsheng Li, Shaoting Zhang, Xiaogang Wang, Xiaolei Huang, Dimitris Metaxas.
ICCV 2017. [PDF] [Github]

CPGAN: Content-Parsing Generative Adversarial Networks for Text-to-Image Synthesis.
Jiadong Liang, Wenjie Pei, Feng Lu.
ECCV 2020. [PDF] [Github]

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis.
Ming Tao, Hao Tang, Songsong Wu, Nicu Sebe, Fei Wu, Xiao-Yuan Jing.
TMM 2020. [PDF]

GeNeVA-GAN: Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction.
Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W.Taylor.
ICCV 2019. [PDF] [Github]

LeicaGAN: Learn, Imagine and Create: Text-to-Image Generation from Prior Knowledge.
Tingting Qiao, Jing Zhang, Duanqing Xu, Dacheng Tao.
NeurIPS 2019. [PDF] [Github]

HD-GAN: Photographic Text-to-Image Synthesis with a Hierarchically-nested Adversarial Network.
Zizhao Zhang, Yuanpu Xie, Lin Yang.
CVPR 2018. [PDF] [Github]

Adversarial Synthesis of Human Pose from Text.
Yifei Zhang, Rania Briq, Julian Tanke, Juergen Gall.
arxiv 2020. [PDF]

Semantic Object Accuracy for Generative Text-to-Image Synthesis.
Tobias Hinz, Stefan Heinrich, Stefan Wermter.
arxiv 2020. [PDF] [Github]

3DLSN: End-to-End Optimization of Scene Layout.
Andrew Luo, Zhoutong Zhang, Jiajun Wu, Joshua B. Tenenbaum.
CVPR 2020. [PDF] [Project]

CookGAN: Meal Image Synthesis from Ingredients.
Fangda Han, Ricardo Guerrero, Vladimir Pavlovic.
WACV 2020. [PDF]

Object-driven Text-to-Image Synthesis via Adversarial Training.
Wenbo Li, Pengchuan Zhang, Lei Zhang, Qiuyuan Huang, Xiaodong He, Siwei Lyu, Jianfeng Gao.
CVPR 2019. [PDF]

SwapText: Image Based Texts Transfer in Scenes.
Qiangpeng Yang, Hongsheng Jin, Jun Huang, Wei Lin.
CVPR 2020. [PDF]

Local-Global Video-Text Interactions for Temporal Grounding.
Jonghwan Mun, Minsu Cho, Bohyung Han.
CVPR 2020. [PDF] [Github]

Cycle Text-To-Image GAN with BERT.
Trevor Tsue, Samir Sen, Jason Li.
arxiv 2020. [PDF] Github]

Learning Deep Representations of Fine-grained Visual Descriptions.
Scott Reed, Zeynep Akata, Bernt Schiele, Honglak Lee.
CVPR 2016. [PDF] [Github]

Keypoint or Landmark Guided

ADGAN: Controllable Person Image Synthesis with Attribute-Decomposed GAN.
Yifang Men, Yiming Mao, Yuning Jiang, Wei-Ying Ma, Zhouhui Lian.
CVPR 2020. [PDF] [Project] [Github]

AGUIT: Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning.
Xinyang Li, Jie Hu, Shengchuan Zhang, Xiaopeng Hong, Qixiang Ye, Chenglin Wu, Rongrong Ji.
arxiv, 29 Apr 2019. [PDF] [Github]

Geometry Guided Adversarial Facial Expression Synthesis.
Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan.
MM 2018. [PDF]

C2-GAN: Cycle In Cycle Generative Adversarial Networks for Keypoint-Guided Image Generation.
Hao Tang, Dan Xu, Gaowen Liu, Wei Wang, Nicu Sebe, Yan Yan.
MM 2019. [PDF] [Gtihub]

Few-Shot Adversarial Learning of Realistic Neural Talking Head Models.
Egor Zakharov, Aliaksandra Shysheya, Egor Burkov, Victor Lempitsky.
ICCV 2019. [PDF] [Github]

Geometry Guided Adversarial Facial Expression Synthesis.
Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan.
MM 2018. [PDF]

Pose and Skeleton Guided

Deep Spatial Transformation for Pose-Guided Person Image Generation and Animation.
Yurui Ren, Ge Li, Shan Liu, Thomas H. Li.
arxiv 2020. [PDF] [Github]

Pose Manipulation with Identity Preservation.
Andrei-Timotei Ardelean, Lucian Mircea Sasu.
International Journal of Computers Communications & Control 2020. [PDF]

Pose Guided Person Image Generation.
Liqian Ma, Xu Jia, Qianru Sun, Bernt Schiele, Tinne Tuytelaars, Luc Van Gool.
NIPS 2017. [PDF] [Github]

Deformable GANs for Pose-based Human Image Generation.
Aliaksandr Siarohin, Enver Sangineto, Stephane Lathuiliere, Nicu Sebe.
CVPR 2018. [PDF] [Github]

A Variational U-Net for Conditional Appearance and Shape Generation.
Patrick Esser, Ekaterina Sutter, Björn Ommer.
CVPR 2018. [PDF] [Github]

Progressive Pose Attention Transfer for Person Image Generation.
Zhen Zhu, Tengteng Huang, Baoguang Shi, Miao Yu, Bofei Wang, Xiang Bai.
CVPR 2019. [PDF] [Github]

Everybody Dance Now.
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros.
ECCVW 2018. [PDF] [Project]

Disentangled Person Image Generation.
Liqian Ma, Qianru Sun, Stamatios Georgoulis, Luc Van Gool, Bernt Schiele, Mario Fritz.
CVPR 2018. [PDF]

Mask Guided

InstaGAN: Instance-aware image-to-image translation.
Zhiqiang Shen, Mingyang Huang, Jianping Shi, Xiangyang Xue, Thomas Huang.
ICLR 2019. [PDF] [Github]

ContrastGAN: Generative Semantic Manipulation with Mask-Contrasting GAN.
Xiaodan Liang, Hao Zhang, Eric P. Xing.
ECCV 2018. [PDF]

INIT: Towards Instance-level Image-to-Image Translation.
Zhiqiang Shen, Mingyang Huang, Jianping Shi, Xiangyang Xue, Thomas Huang.
CVPR 2019. [PDF] [project]

Exemplar Guided

Controllable Descendant Face Synthesis.
Yong Zhang, Le Li, Zhilei Liu, Baoyuan Wu, Yanbo Fan, Zhifeng Li.
arxiv 2020. [PDF]

pix2pixSC: Example-Guided Style Consistent Image Synthesis from Semantic Labeling.
Miao Wang, Guo-Ye Yang, Ruilong Li, Run-Ze Liang, Song-Hai Zhang, Peter. M. Hall, Shi-Min Hu.
CVPR 2019. [PDF] [Github] [Project]

EGSC-IT: Exemplar Guided Unsupervised Image-to-Image Translation with Semantic Consistency.
Liqian Ma, Xu Jia, Stamatios Georgoulis, Tinne Tuytelaars, Luc Van Gool.
ICLR 2019. [PDF] [Github]

DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks.
Shuang Ma, Jianlong Fu, Chang Wen Chen, Tao Mei.
CVPR 2018. [PDF]

Stylizing Video by Example.
Ondřej Jamriška, Šárka Sochorová, Ondřej Texler, Michal Lukáč, Jakub Fišer, Jingwan Lu, Eli Shechtman, Daniel Sýkora.
SIGGRAPH 2019. [PDF]

Attention Guided

Attention-GAN for Object Transfiguration in Wild Images.
Xinyuan Chen, Chang Xu, Xiaokang Yang, Dacheng Tao.
ECCV 2018. [PDF]

Unsupervised Attention-guided Image to Image Translation.
Youssef A. Mejjati, Christian Richardt, James Tompkin, Darren Cosker, Kwang In Kim.
NeurIPS 2018. [PDF] [Github]

Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention.
Chao Yang, Taehwan Kim, Ruizhe Wang, Hao Peng, C.-C. Jay Kuo.
TIP 2019. [PDF]

U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation.
Junho Kim, Minjae Kim, Hyeonwoo Kang, Kwanghee Lee.
ICLR 2020. [PDF] [TensorFlow or Pytorch]

SPA-GAN: Spatial Attention GAN for Image-to-Image Translation.
Hajar Emami, Majid Moradi Aliabadi, Ming Dong, Ratna Babu Chinnam.
arxiv 2019. [PDF]

Layout or Structure Guided

Structural-analogy from a Single Image Pair.
Sagie Benaim, Ron Mokady, Amit Bermano, Daniel Cohen-Or, Lior Wolf.
arxiv 2020. [PDF]

BachGAN: High-Resolution Image Synthesis from Salient Object Layout.
Yandong Li, Yu Cheng, Zhe Gan, Licheng Yu, Liqiang Wang, Jingjing Liu.
CVPR 2020. [PDF] [Github]

LayoutGAN: Generating Graphic Layouts with Wireframe Discriminator.
Jianan Li, Jimei Yang, Aaron Hertzmann, Jianming Zhang, Tingfa Xu.
ICLR 2019. [PDF]

Text2Scene: Generating Compositional Scenes from Textual Descriptions.
Fuwen Tan, Song Feng, Vicente Ordonez.
CVPR 2019. [PDF] [Github]

Image Generation from Layout.
Bo Zhao, Lili Meng, Weidong Yin, Leonid Sigal.
CVPR 2019. [PDF]

Generating Multiple Objects at Spatially Distinct Locations.
Tobias Hinz, Stefan Heinrich, Stefan Wermter.
ICLR 2019. [PDF] [Github]

LostGANs: Image Synthesis From Reconfigurable Layout and Style.
Wei Sun, Tianfu Wu.
ICCV 2019. [PDF] [Github]

Learning to Predict Layout-to-image Conditional Convolutions for Semantic Image Synthesis.
Xihui Liu, Guojun Yin, Jing Shao, Xiaogang Wang, Hongsheng Li.
NeurIPS 2019. [PDF]

Dialog Guided

ChatPainter: Improving Text to Image Generation using Dialogue.
Shikhar Sharma, Dendi Suhubdy, Vincent Michalski, Samira Ebrahimi Kahou, Yoshua Bengio.
ICLRW 2018. [PDF]

Keep Drawing It: Iterative language-based image generation and editing.
Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W.Taylor.
NIPSW 2018. [PDF] [CLEVR dataset]

Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction.
Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm, Layla El Asri, Samira Ebrahimi Kahou, Yoshua Bengio, Graham W.Taylor.
ICCV 2019. [PDF]

Sequential Attention GAN for Interactive Image Editing via Dialogue.
Yu Cheng, Zhe Gan, Yitong Li, Jingjing Liu, Jianfeng Gao.
arxiv, 2019. [PDF]

Chat-crowd: A Dialog-based Platform for Visual Layout Composition.
Paola Cascante-Bonilla, Xuwang Yin, Vicente Ordonez, Song Feng.
arxiv, 2018. [PDF] [Github]

CoDraw: Collaborative Drawing as a Testbed for Grounded Goal-driven Communication.
Jin-Hwa Kim, Nikita Kitaev, Xinlei Chen, Marcus Rohrbach, Byoung-Tak Zhang, Yuandong Tian, Dhruv Batra, Devi Parikh.
ACL 2019. [PDF] [CoDraw Dataset]

Audio Guided

X2Face: A Network for Controlling Face Generation by Using Images, Audio and Pose Codes.
Olivia Wiles, A. Sophia Koepke, Andrew Zisserman.
ECCV 2018. [PDF] [Github]

Continous Change

GANHopper: Multi-Hop GAN for Unsupervised Image-to-Image Translation.
Wallace Lira, Johannes Merz, Daniel Ritchie, Daniel Cohen-Or, Hao Zhang.
ECCV 2020. [PDF]

Homomorphic Latent Space Interpolation for Unpaired Image-To-Image Translation.
Ying-Cong Chen, Xiaogang Xu, Zhuotao Tian, Jiaya Jia.
CVPR 2019. [PDF] [Github]

GANimation: Anatomically-aware Facial Animation from a Single Image.
Albert Pumarola, Antonio Agudo, Aleix M. Martinez, Alberto Sanfeliu, Francesc Moreno-Noguer.
ECCV 2018. [PDF] [Github]

Video

World-Consistent Video-to-Video Synthesis.
Arun Mallya, Ting-Chun Wang, Karan Sapra, Ming-Yu Liu.
ECCV 2020. [PDF] [Github]

Line Art Correlation Matching Network for Automatic Animation Colorization.
Zhang Qian, Wang Bo, Wen Wei, Li Hai, Liu Jun Hui.
arxiv 2020. [PDF]

Unsupervised Multimodal Video-to-Video Translation via Self-Supervised Learning.
Kangning Liu, Shuhang Gu, Andres Romero, Radu Timofte.
arxiv 2020. [PDF]

Unsupervised Video-to-Video Translation via Self-Supervised Learning.
Kangning Liu, Shuhang Gu, Radu Timofte.
2020. [PDF]

Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro.
NeurIPS 2018. [PDF] [Github]

Everybody Dance Now.
Caroline Chan, Shiry Ginosar, Tinghui Zhou, Alexei A. Efros.
ECCVW 2018. [PDF] [Project]

Preserving Semantic and Temporal Consistency for Unpaired Video-to-Video Translation.
Kwanyong Park, Sanghyun Woo, Dahun Kim, Donghyeon Cho, In So Kweon.
ACM MM 2019. [PDF]

Mocycle-GAN: Unpaired Video-to-Video Translation.
Yang Chen, Yingwei Pan, Ting Yao, Xinmei Tian, Tao Mei.
ACM MM 2019. [PDF]

Recycle-GAN: Unsupervised Video Retargeting.
Aayush Bansal, Shugao Ma, Deva Ramanan, Yaser Sheikh.
ECCV 2018. [PDF] [Github]

Few-shot Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro.
NeurIPS 2019. [PDF] [Project] [Github]

Few-Shot

Semi-supervised Learning for Few-shot Image-to-Image Translation.
Yaxing Wang, Salman Khan, Abel Gonzalez-Garcia, Joost van de Weijer, Fahad Shahbaz Khan.
CVPR 2020. [PDF] [Github]

FUNIT: Few-Shot Unsupervised Image-to-Image Translation.
Ming-Yu Liu, Xun Huang, Arun Mallya, Tero Karras, Timo Aila, Jaakko Lehtinen, Jan Kautz.
ICCV 2019. [PDF] [Project] [Github]

Semi Few-Shot Attribute Translation.
Ricard Durall, Franz-Josef Pfreundt, Janis Keuper.
arxiv 2019. [PDF]

ZstGAN: An Adversarial Approach for Unsupervised Zero-Shot Image-to-Image Translation.
Jianxin Lin, Yingce Xia, Sen Liu, Tao Qin, Zhibo Chen.
arxiv 2019. [PDF] [Github]

MetaPix: Few-Shot Video Retargeting.
Jessica Lee, Deva Ramanan, Rohit Girdhar.
ICCV 2019. [PDF]] [Project] [Github]

Few-shot Video-to-Video Synthesis.
Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro.
arxiv 2019. [PDF] [Project]

Applications

Photo Animation

AutoToon: Automatic Geometric Warping for Face Cartoon Generation.
Julia Gong, Yannick Hold-Geoffroy, Jingwan Lu.
WACV 2020. [PDF]

Learning to Cartoonize Using White-box Cartoon Representations.
Xinrui Wang, Jinze Yu.
CVPR 2020. [PDF] [Project] [Github]

AnimeGAN: A Novel Lightweight GAN For Photo Animation.
2020. [Github] [Dataset]

ComixGAN: Generative Adversarial Network For Transferring Images To Comics.
[Github]

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization.
Yang Chen, Yu-Kun Lai, Yong-Jin Liu.
CVPR 2018. [PDF] [Pytorch] [TensorFlow]

Image Restoration

Slide-free MUSE Microscopy to H&E Histology Modality Conversion via Unpaired Image-to-Image Translation GAN Models.
Tanishq Abraham, Andrew Shaw, Daniel O'Connor, Austin Todd, Richard Levenson.
ICMLW 2020. [PDF]

Bringing Old Photos Back to Life.
Ziyu Wan, Bo Zhang, Dongdong Chen, Pan Zhang, Dong Chen, Jing Liao, Fan Wen.
CVPR 2020. [PDF] [Project]

Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence.
Junsoo Lee, Eungyeup Kim, Yunsung Lee, Dongjun Kim, Jaehyuk Chang, Jaegul Choo.
CVPR 2020. [PDF]

Pose Agnostic Cross-spectral Hallucination via Disentangling Independent Factors.
Boyan Duan, Chaoyou Fu, Yi Li, Xingguang Song, Ran He.
CVPR 2020. [PDF]

Learning Invariant Representation for Unsupervised Image Restoration.
Wenchao Du, Hu Chen, Hongyu Yang.
CVPR 2020. [PDF]

Unsupervised Domain-Specific Deblurring via Disentangled Representations.
Boyu Lu, Jun-Cheng Chen, Rama Chellappa.
CVPR 2019. [PDF] [Github]

Image Synthesis

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization.
Peihao Zhu, Rameen Abdal, Yipeng Qin, Peter Wonka.
CVPR 2020. [PDF] [Video] [Github]

Face-to-Parameter Translation for Game Character Auto-Creation.
Tianyang Shi, Yi Yuan, Changjie Fan, Zhengxia Zou, Zhenwei Shi, Yong Liu.
ICCV 2019. [PDF]

APDrawingGAN: Face-to-Parameter Translation for Game Character Auto-Creation.
Ran Yi, Yong-Jin Liu, Yu-Kun Lai, Paul L. Rosin.
CVPR 2019. [PDF] [Github] [Online Demo]

Cascaded Generation of High-quality Color Visible Face Images from Thermal Captures.
Naser Damer, Fadi Boutros, Khawla Mallat, Florian Kirchbuchner, Jean-Luc Dugelay, Arjan Kuijper.
arxiv 2019. [PDF]

CartoonGAN: Generative Adversarial Networks for Photo Cartoonization.
Yang Chen, Yu-Kun Lai, Yong-Jin Liu.
CVPR 2018. [PDF] [Github] [unofficial test] [unofficial pytorch]

Retargeting and 3D Vision

Render4Completion: Synthesizing Multi-View Depth Maps for 3D Shape Completion.
Tao Hu, Zhizhong Han, Abhinav Shrivastava, Matthias Zwicker.
ICCV 2019 workshop on Geometry meets Deep Learning. [PDF]

Multi-Garment Net_Learning to Dress 3D people from Images.
Bharat Lal Bhatnagar, Garvita Tiwari, Christian Theobalt, Gerard Pons-Moll.
ICCV 2019. [PDF] [Github]

Tex2Shape: Detailed Full Human Body Geometry From a Single Image.
Thiemo Alldieck, Gerard Pons-Moll, Christian Theobalt, Marcus Magnor.
ICCV 2019. [arxiv] [PDF] [Github]

pix2vertex: Unrestricted facial geometry reconstruction using image-to-image translation.
Matan Sela, Elad Richardson, Ron Kimmel.
arxiv, 2017. [PDF] [Github]

Learning to Reconstruct People in Clothing from a Single RGB Camera.
Thiemo Alldieck, Marcus Magnor, Bharat Lal Bhatnagar, Christian Theobalt, Gerard Pons-Moll.
ICCV 2019. [PDF][Github]

360-Degree Textures of People in Clothing from a Single Image.
Verica Lazova, Eldar Insafutdinov, Gerard Pons-Moll.
3DV 2019. [PDF][Project]

SelectionGAN: Multi-Channel Attention Selection GAN with Cascaded Semantic Guidance for Cross-View Image Translation.
Hao Tang, Dan Xu, Nicu Sebe, Yanzhi Wang, Jason J. Corso, Yan Yan.
CVPR 2019. [PDF] [Github]

Multi-Channel Attention Selection GANs for Guided Image-to-Image Translation.
Hao Tang, Dan Xu, Yan Yan, Jason J. Corso, Philip H.S. Torr, Nicu Sebe.
arxiv 2020. [PDF]

VR Facial Animation via Multiview Image Translation.
Shih-En Wei, Jason Saragih, Tomas Simon, Adam W. Harley, Stephen Lombardi, Michal Perdoch, Alexander Hypes, Dawei Wang, Hernan Badino, Yaser Sheikh.
SIGGRAPH 2019. [PDF]

Attribute Editing

Lifespan Age Transformation Synthesis.
Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman.
ECCV 2020. [PDF] [Github] [Project] [FFHQ-Aging-Dataset]

LEED: Label-Free Expression Editing via Disentanglement.
Rongliang Wu, Shijian Lu.
ECCV 2020. [PDF]

PA-GAN: Progressive Attention Generative Adversarial Network for Facial Attribute Editing.
Zhenliang He, Meina Kan, Jichao Zhang, Shiguang Shan.
arxiv 2020. [PDF] [Github]

Facial Expression Editing with Continuous Emotion Labels.
Alexandra Lindt, Pablo Barros, Henrique Siqueira, Stefan Wermter.
FG 2019. [PDF]

Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning.
Yu Deng, Jiaolong Yang, Dong Chen, Fang Wen, Xin Tong.
CVPR 2020. [PDF]

High Resolution Face Age Editing.
Xu Yao, Gilles Puy, Alasdair Newson, Yann Gousseau, Pierre Hellier.
arxiv 2020. [PDF] [Github]

Intuitive, Interactive Beard and Hair Synthesis with Generative Models.
Kyle Olszewski, Duygu Ceylan, Jun Xing, Jose Echevarria, Zhili Chen, Weikai Chen, Hao Li.
CVPR 2020. [PDF]

Lifespan Age Transformation Synthesis.
Roy Or-El, Soumyadip Sengupta, Ohad Fried, Eli Shechtman, Ira Kemelmacher-Shlizerman.
arxiv 2020. [PDF]

IcGAN: Invertible Conditional GANs for image editing.
Guim Perarnau, Joost van de Weijer, Bogdan Raducanu, Jose M. Álvarez.
NeurIPS Workshop 2016. [PDF] [Github]

Smart, Sparse Contours to Represent and Edit Images.
Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman.
CVPR 2018. [PDF] [[Project]

AGUIT: Attribute Guided Unpaired Image-to-Image Translation with Semi-supervised Learning.
Xinyang Li, Jie Hu, Shengchuan Zhang, Xiaopeng Hong, Qixiang Ye, Chenglin Wu, Rongrong Ji.
arxiv, 29 Apr 2019. [PDF] [Github]

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network.
Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin
ACM MM 2018. [PDF] [Github] [Project]

UFDN: A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation.
Alexander H. Liu, Yen-Cheng Liu, Yu-Ying Yeh, Yu-Chiang Frank Wang.
NeurIPS 2018. [PDF] [Github]

ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes.
Taihong Xiao, Jiapeng Hong, Jinwen Ma.
ECCV 2018. [PDF] [Github]

Biphasic-GAN: Biphasic Learning of GANs for High-Resolution Image-to-Image Translation.
Jie Cao, Huaibo Huang, Yi Li, Jingtuo Liu, Ran He, Zhenan Sun.
ArXiv 2019. [PDF]

High Fidelity Face Manipulation with Extreme Pose and Expression.
Chaoyou Fu, Yibo Hu, Xiang Wu, Guoli Wang, Qian Zhang, Ran He.
ArXiv 2019. [PDF]

Make a Face: Towards Arbitrary High Fidelity Face Manipulation.
Shengju Qian, Kwan-Yee Lin, Wayne Wu, Yangxiaokang Liu, Quan Wang, Fumin Shen, Chen Qian, Ran He.
ICCV 2019. [PDF]

SliderGAN: Synthesizing Expressive Face Images by Sliding 3D Blendshape Parameters.
Evangelos Ververas, Stefanos Zafeiriou.
arxiv 2019. [PDF]

Generating High-Resolution Fashion Model Images Wearing Custom Outfits.
Gökhan Yildirim, Nikolay Jetchev, Roland Vollgraf, Urs Bergmann.
Workshop on Computer Vision for Fashion, Art and Design, ICCV 2019. [PDF]

Data Augmentation

Generative Image Translation for Data Augmentation in Colorectal Histopathology Images.
NeurIPS 2019 Machine Learning for Health Workshop. [PDF] [Project]

DG-Net: Joint Discriminative and Generative Learning for Person Re-identification.
Zhedong Zheng, Xiaodong Yang, Zhiding Yu, Liang Zheng, Yi Yang, Jan Kautz.
CVPR 2019. [PDF] [Github]

Compressed Sensing using Generative Models.
Ashish Bora, Ajil Jalal, Eric Price, Alexandros G. Dimakis.
arxiv 2017. [PDF] [Github]

Person Transfer GAN to Bridge Domain Gap for Person Re-Identification.
Longhui Wei, Shiliang Zhang, Wen Gao, Qi Tian.
CVPR 2018. [PDF]

Image-Image Domain Adaptation with Preserved Self-Similarity and Domain-Dissimilarity for Person Re-identification.
Weijian Deng, Liang Zheng, Qixiang Ye, Guoliang Kang, Yi Yang, Jianbin Jiao.
CVPR 2018. [PDF]

Model-Compression-and-Pruning

GAN Compression: Efficient Architectures for Interactive Conditional GANs.
Muyang Li, Ji Lin, Yaoyao Ding, Zhijian Liu, Jun-Yan Zhu, and Song Han.
CVPR 2020. [PDF] [Demo] [Github]

Co-Evolutionary Compression for Unpaired Image Translation.
Han Shu, Yunhe Wang, Xu Jia, Kai Han, Hanting Chen, Chunjing Xu, Qi Tian, Chang Xu.
ICCV 2019. [PDF] [Github]

Adversarial-Examples

Disrupting DeepFakes: Adversarial Attacks Against Conditional Image Translation Networks and Facial Manipulation Systems.
Nataniel Ruiz, Stan Sclaroff.
arxiv, 3 Mar 2020. [PDF]

Adversarial Self-Defense for Cycle-Consistent GANs.
Dina Bashkirova, Ben Usman, Kate Saenko.
NeurIPS 2019. [PDF]]

Imbalanced Data

Elastic-InfoGAN: Unsupervised Disentangled Representation Learning in Imbalanced Data.
Utkarsh Ojha, Krishna Kumar Singh, Cho-Jui Hsieh, Yong Jae Lee.
arxiv, 1 Oct 2019. [PDF]

Datasets

Please cite their papers if you use the data.

pix2pix Datasets

Some datasets can also be downloaded manually from the website or automatically using the following script:

python download-dataset.py datasetname

facades: 400 images from CMP Facades dataset. (31MB)
sketch: http://mmlab.ie.cuhk.edu.hk/archive/cufsf/
oil-chinese: http://www.cs.mun.ca/~yz7241/dataset/
day-night: http://www.cs.mun.ca/~yz7241/dataset/
facades: 400 images from CMP Facades dataset. [Citation]
cityscapes: 2975 images from the Cityscapes training set. [Citation]
maps: 1096 training images scraped from Google Maps
edges2shoes: 50k training images from UT Zappos50K dataset. Edges are computed by HED edge detector + post-processing. [Citation]
edges2handbags: 137K Amazon Handbag images from iGAN project. Edges are computed by HED edge detector + post-processing. [Citation]

CycleGAN Datasets

facades: 400 images from the CMP Facades dataset. [Citation]
cityscapes: 2975 images from the Cityscapes training set. [Citation]
maps: 1096 training images scraped from Google Maps.
horse2zebra: 939 horse images and 1177 zebra images downloaded from ImageNet using keywords wild horse and zebra
apple2orange: 996 apple images and 1020 orange images downloaded from ImageNet using keywords apple and navel orange.
summer2winter_yosemite: 1273 summer Yosemite images and 854 winter Yosemite images were downloaded using Flickr API. See more details in our paper.
monet2photo, vangogh2photo, ukiyoe2photo, cezanne2photo: The art images were downloaded from Wikiart. The real photos are downloaded from Flickr using the combination of the tags landscape and landscapephotography. The training set size of each class is Monet:1074, Cezanne:584, Van Gogh:401, Ukiyo-e:1433, Photographs:6853.
iphone2dslr_flower: both classes of images were downlaoded from Flickr. The training set size of each class is iPhone:1813, DSLR:3316.
KaoKore Dataset: KaoKore is a novel dataset of face images from Japanese illustrations along with multiple labels for each face, derived from the Collection of Facial Expressions. KaoKore dataset contains 5552 image files, each being an color (RGB) image of size 256 x 256 as well as two sets of labels gender and social status.
Ford Autonomous Vehicle Seasonal Dataset: It is a challenging multi-agent seasonal dataset collected by a fleet of Ford autonomous vehicles at different days and times during 2017-18. The vehicles traversed an average route of 66 km in Michigan that included a mix of driving scenarios such as the Detroit Airport, freeways, city-centers, university campus and suburban neighbourhoods, etc. Each vehicle used in this data collection is a Ford Fusion outfitted with an Applanix POS-LV GNSS system, four HDL-32E Velodyne 3D-lidar scanners, 6 Point Grey 1.3 MP Cameras arranged on the rooftop for 360-degree coverage and 1 Pointgrey 5 MP camera mounted behind the windshield for the forward field of view. They present the seasonal variation in weather, lighting, construction and traffic conditions experienced in dynamic urban environments. To get more details about the Ford AV Dataset, please refer to the paper, github or visit the website.

Attribute Editing

CelebA. The CelebFaces Attributes (CelebA) dataset contains 202,599 face images of celebrities, each annotated with 40 binary attributes. size 178×218. hair color (black, blond, brown),gender (male/female), and age (young/old). [Onedrive] [BaiduYun]
CelebA-HQ. [Homepage]. There is also a Modified h5tool.py to make user getting celeba-HQ easier.
CelebAMask-HQ. It is a large-scale face image dataset that has 30,000 high-resolution face images selected from the CelebA dataset by following CelebA-HQ. Each image has segmentation mask of facial attributes corresponding to CelebA. The masks of CelebAMask-HQ were manually-annotated with the size of 512×512 and 19 classes including all facial components and acessories such as skin, nose, eyes, eyebrows, ears, mouth, lip, hair, hat, eyeglass, earring, necklace, neck, and cloth. The dataset can be downloaded [here].
RaFD. The Radboud Faces Database (RaFD) consists of 4,824 images collected from 67 participants. Each participant makes eight facial expressions in three different gaze directions, which are captured from three different angles.
AFHQ. Released in StarGAN v2. Animal FacesHQ (AFHQ) consists of 15,000 high-quality images at 512 × 512 resolution. We collected images with permissive licenses from the Flickr and Pixabay websites. All images are vertically and horizontally aligned to have the eyes at the center. The low-quality images were discarded by human effort. You can downloaded using the provided scripts. For more details, see the Project or Paper.
CMU Multi-PIE Face Database. [Multi-PIE] A large (305GB) database of images for training facial recognition software. It consists 13 poses within ±90 degrees of 337 subjects and can be used for face frontalization experiments.
APDrawing Dataset. [APDrawingDB], [Project].

Others

Makeup Transfer. [Download]
DeepFashion. In-shop Clothes Retrieval Benchmark evaluates the performance of in-shop Clothes Retrieval. This is a large subset of DeepFashion, containing large pose and scale variations. It also has large diversities, large quantities, and rich annotations, including 7,982 number of clothing items, 52,712 number of in-shop clothes images, and ~200,000 cross-pose/scale pairs, Each image is annotated by bounding box, clothing type and pose type. Download
AI-Generated Faces: Free Resource of 100K Faces Without Copyright. [Download]
All-Age-Faces (AAF) Database - contains 13'322 face images (mostly Asian) distributed across all ages (from 2 to 80), including 7381 females and 5941 males. GitHub Paper
Celeb-DF. A New Dataset for DeepFake Forensics. [Download]
The Deepfake Detection Challenge (DFDC) Preview Dataset. Facebook AI. [PDF] [Project].
Faceforensics++. Learning to detect manipulated facial images, 2019.
AI Generated Diverse Photos. [Project]
t-less. An RGB-D- Dataset for6 D Pose Estimation of Texture-less Objects.

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Files

README_.md

Latest commit

History

README_.md

File metadata and controls

awesome image-to-image translation papers

Contributing

Table of Contents

Tutorials

Supervised

Unsupervised

Truly Unsupervised

General

Disentanglement

Multi-Domain and Multi-Modal

Guided Image-to-image Translation

Scene Graphs Guided

Texture Guided

Amodal Map Guided

Segmentation or Label Map Guided

Text Guided

Text-Guided Manipulation

Text-to-Image Generation

Keypoint or Landmark Guided

Pose and Skeleton Guided

Mask Guided

Exemplar Guided

Attention Guided

Layout or Structure Guided

Dialog Guided

Audio Guided

Continous Change

Video

Few-Shot

Applications

Photo Animation

Image Restoration

Image Synthesis

Retargeting and 3D Vision

Attribute Editing

Data Augmentation

Model-Compression-and-Pruning

Adversarial-Examples

Imbalanced Data

Datasets

pix2pix Datasets

CycleGAN Datasets

Attribute Editing

Others

License