Saved in:
Bibliographic Details
Main Authors: Wu, Sidi, Chen, Yizi, Mermet, Samuel, Hurni, Lorenz, Schindler, Konrad, Gonthier, Nicolas, Landrieu, Loic
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.20142
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909154884452352
author Wu, Sidi
Chen, Yizi
Mermet, Samuel
Hurni, Lorenz
Schindler, Konrad
Gonthier, Nicolas
Landrieu, Loic
author_facet Wu, Sidi
Chen, Yizi
Mermet, Samuel
Hurni, Lorenz
Schindler, Konrad
Gonthier, Nicolas
Landrieu, Loic
contents Most image-to-image translation models postulate that a unique correspondence exists between the semantic classes of the source and target domains. However, this assumption does not always hold in real-world scenarios due to divergent distributions, different class sets, and asymmetrical information representation. As conventional GANs attempt to generate images that match the distribution of the target domain, they may hallucinate spurious instances of classes absent from the source domain, thereby diminishing the usefulness and reliability of translated images. CycleGAN-based methods are also known to hide the mismatched information in the generated images to bypass cycle consistency objectives, a process known as steganography. In response to the challenge of non-bijective image translation, we introduce StegoGAN, a novel model that leverages steganography to prevent spurious features in generated images. Our approach enhances the semantic consistency of the translated images without requiring additional postprocessing or supervision. Our experimental evaluations demonstrate that StegoGAN outperforms existing GAN-based models across various non-bijective image-to-image translation tasks, both qualitatively and quantitatively. Our code and pretrained models are accessible at https://github.com/sian-wusidi/StegoGAN.
format Preprint
id arxiv_https___arxiv_org_abs_2403_20142
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation
Wu, Sidi
Chen, Yizi
Mermet, Samuel
Hurni, Lorenz
Schindler, Konrad
Gonthier, Nicolas
Landrieu, Loic
Computer Vision and Pattern Recognition
Image and Video Processing
Most image-to-image translation models postulate that a unique correspondence exists between the semantic classes of the source and target domains. However, this assumption does not always hold in real-world scenarios due to divergent distributions, different class sets, and asymmetrical information representation. As conventional GANs attempt to generate images that match the distribution of the target domain, they may hallucinate spurious instances of classes absent from the source domain, thereby diminishing the usefulness and reliability of translated images. CycleGAN-based methods are also known to hide the mismatched information in the generated images to bypass cycle consistency objectives, a process known as steganography. In response to the challenge of non-bijective image translation, we introduce StegoGAN, a novel model that leverages steganography to prevent spurious features in generated images. Our approach enhances the semantic consistency of the translated images without requiring additional postprocessing or supervision. Our experimental evaluations demonstrate that StegoGAN outperforms existing GAN-based models across various non-bijective image-to-image translation tasks, both qualitatively and quantitatively. Our code and pretrained models are accessible at https://github.com/sian-wusidi/StegoGAN.
title StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation
topic Computer Vision and Pattern Recognition
Image and Video Processing
url https://arxiv.org/abs/2403.20142