A collection of image-to-image papers.
Papers are ordered in arXiv first version submitting time (if applicable).
Feel free to send a PR or issue.
TOC
Note | Model | Paper | Conference | paper link | code link |
---|---|---|---|---|---|
pix2pix | Image-to-Image Translation with Conditional Adversarial Networks | CVPR 2017 | 1611.07004 | junyanz/pytorch-CycleGAN-and-pix2pix | |
texture guided | TextureGAN | TextureGAN: Controlling Deep Image Synthesis with Texture Patches | CVPR 2018 | 1706.02823 | janesjanes/Pytorch-TextureGAN |
Contextual GAN | Image Generation from Sketch Constraint Using Contextual GAN | ECCV 2018 | 1711.08972 | ||
pix2pix-HD | High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs | CVPR 2018 | 1711.11585 | NVIDIA/pix2pixHD | |
one-to-many | BicycleGAN | Toward Multimodal Image-to-Image Translation | NIPS 2017 | 1711.11586 | junyanz/BicycleGAN |
keypoint guided | G2-GAN | Geometry Guided Adversarial Facial Expression Synthesis | MM 2018 | 1712.03474 | |
contour2im | Smart, Sparse Contours to Represent and Edit Images | CVPR 2018 | 1712.08232 | website | |
disentangle | Cross-domain disentanglement networks | Image-to-image translation for cross-domain disentanglement | NIPS 2018 | 1805.09730 | |
video | vid2vid | Video-to-Video Synthesis | NIPS 2018 | 1808.06601 | NVIDIA/vid2vid |
video | pix2pix-HD + Temporal Smoothing + faceGAN | Everybody Dance Now | ECCVW 2018 | 1808.07371 | website |
Note | Model | Paper | Conference | paper link | code link |
---|---|---|---|---|---|
DTN | Unsupervised Cross-Domain Image Generation | ICLR 2017 | 1611.02200 | yunjey/domain-transfer-network (unofficial) | |
UNIT | Unsupervised image-to-image translation networks | NIPS 2017 | 1703.00848 | mingyuliutw/UNIT | |
DiscoGAN | Learning to Discover Cross-Domain Relations with Generative Adversarial Networks | ICML 2017 | 1703.05192 | SKTBrain/DiscoGAN | |
CycleGAN | Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks | ICCV 2017 | 1703.10593 | junyanz/pytorch-CycleGAN-and-pix2pix | |
DualGAN | DualGAN: Unsupervised Dual Learning for Image-to-Image Translation | ICCV 2017 | 1704.02510 | duxingren14/DualGAN | |
DistanceGAN | One-Sided Unsupervised Domain Mapping | NIPS 2017 | 1706.00826 | sagiebenaim/DistanceGAN | |
semi supervised | Triangle GAN | Triangle Generative Adversarial Networks | NIPS 2017 | 1709.06548 | LiqunChen0606/Triangle-GAN |
CartoonGAN | CartoonGAN: Generative Adversarial Networks for Photo Cartoonization | CVPR 2018 | thecvf | FlyingGoblin/CartoonGAN, unofficial test, unofficial pytorch | |
non-adversarial | NAM | NAM: Non-Adversarial Unsupervised Domain Mapping | ECCV 2018 | 1806.00804 | facebookresearch/nam |
SCAN | Unsupervised Image-to-Image Translation with Stacked Cycle-Consistent Adversarial Networks | ECCV 2018 | 1807.08536 | ||
dilated conv, improve shape deform. | GANimorph | Improved Shape Deformation in Unsupervised Image to Image Translation | ECCV 2018 | 1808.04325 | brownvc/ganimorph |
video | Recycle-GAN | Recycle-GAN: Unsupervised Video Retargeting | ECCV 2018 | 1808.05174 | aayushbansal/Recycle-GAN |
OT-CycleGAN | Guiding the One-to-one Mapping in CycleGAN via Optimal Transport | AAAI 2019 | 1811.06284 | ||
One-shot | OST | One-Shot Unsupervised Cross Domain Translation | NIPS 2018 | 1806.06029 | sagiebenaim/OneShotTranslation |
Note | Model | Paper | Conference | paper link | code link |
---|---|---|---|---|---|
mask | ContrastGAN | Generative Semantic Manipulation with Mask-Contrasting GAN | ECCV 2018 | 1708.00315 | |
attention | DA-GAN | DA-GAN: Instance-level Image Translation by Deep Attention Generative Adversarial Networks | CVPR 2018 | 1802.06454 | |
mask / attention | Attention-GAN | Attention-GAN for Object Transfiguration in Wild Images | ECCV 2018 | 1803.06798 | |
attention | Attention guided GAN | Unsupervised Attention-guided Image to Image Translation | NIPS 2018 | 1806.02311 | AlamiMejjati/Unsupervised-Attention-guided-Image-to-Image-Translation |
attention, one-sided | Show, Attend and Translate: Unsupervised Image Translation with Self-Regularization and Attention | 1806.06195 | |||
instance aware | InstaGAN | Instance-aware image-to-image translation | ICLR 2019 | openreview | sangwoomo/instagan |
Semantic Patch memory bank | Art2Real | Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation | CVPR 2019 | 1811.10666 |
Note | Model | Paper | Conference | paper link | code link |
---|---|---|---|---|---|
IcGAN | Invertible Conditional GANs for image editing | NIPSW 2016 | 1611.06355 | Guim3/IcGAN | |
Conditional CycleGAN | Conditional CycleGAN for Attribute Guided Face Image Generation | ECCV 2018 | 1705.09966 | ||
StarGAN | StarGAN: Unified Generative Adversarial Networks for Multi-Domain Image-to-Image Translation | CVPR 2018 | 1711.09020 | yunjey/StarGAN | |
AttGAN | AttGAN: Facial Attribute Editing by Only Changing What You Want | 1711.10678 | LynnHo/AttGAN-Tensorflow | ||
ComboGAN | ComboGAN: Unrestrained Scalability for Image Domain Translation | CVPRW 2018 | 1712.06909 | AAnoosheh/ComboGAN | |
AugCGAN (Augmented CycleGAN) | Augmented CycleGAN: Learning Many-to-Many Mappings from Unpaired Data | ICML 2018 | 1802.10151 | aalmah/augmented_cyclegan | |
ModularGAN | Modular Generative Adversarial Networks | ECCV 2018 | 1804.03343 | ||
sparsely grouped dataset | SG-GAN | Sparsely Grouped Multi-task Generative Adversarial Networks for Facial Attribute Manipulation | MM 2018 | 1805.07509 | zhangqianhui/Sparsely-Grouped-GAN |
GANimation | GANimation: Anatomically-aware Facial Animation from a Single Image | ECCV 2018 (honorable mention) | 1807.09251 | albertpumarola/GANimation | |
SMIT | SMIT: Stochastic Multi-Label Image-to-Image Translation | 1812.03704 |
Note | Model | Paper | Conference | paper link | code link |
---|---|---|---|---|---|
XGAN | XGAN: Unsupervised Image-to-Image Translation for Many-to-Many Mappings | ICML 2018 | 1711.05139 | dataset | |
ELEGANT | ELEGANT: Exchanging Latent Encodings with GAN for Transferring Multiple Face Attributes | ECCV 2018 | 1803.10562 | Prinsphield/ELEGANT | |
MUNIT | Multimodal Unsupervised Image-to-Image Translation | ECCV 2018 | 1804.04732 | NVlabs/MUNIT | |
cd-GAN (Conditional DualGAN) | Conditional Image-to-Image Translation | CVPR 2018 | 1805.00251 | ||
EG-UNIT | Exemplar Guided Unsupervised Image-to-Image Translation | 1805.11145 | |||
PairedCycleGAN | PairedCycleGAN: Asymmetric Style Transfer for Applying and Removing Makeup | CVPR 2018 | thecvf | ||
DRIT | Diverse Image-to-Image Translation via Disentangled Representations | ECCV 2018 | 1808.00948 | HsinYingLee/DRIT | |
UFDN | A Unified Feature Disentangler for Multi-Domain Image Translation and Manipulation | NIPS 2018 | 1809.01361 | Alexander-H-Liu/UFDN | |
non-disentangle, face makeup guided | BeautyGAN | BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network | MM 2018 | author | |
Style-based generator | A Style-Based Generator Architecture for Generative Adversarial Networks | 1812.04948 | website | ||
GDWTC | Image-to-Image Translation via Group-wise Deep Whitening and Coloring Transformation | 1812.09912 |