Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Sidi, Chen, Yizi, Mermet, Samuel, Hurni, Lorenz, Schindler, Konrad, Gonthier, Nicolas, Landrieu, Loic
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Image and Video Processing
Online Access:	https://arxiv.org/abs/2403.20142
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909154884452352
author	Wu, Sidi Chen, Yizi Mermet, Samuel Hurni, Lorenz Schindler, Konrad Gonthier, Nicolas Landrieu, Loic
author_facet	Wu, Sidi Chen, Yizi Mermet, Samuel Hurni, Lorenz Schindler, Konrad Gonthier, Nicolas Landrieu, Loic
contents	Most image-to-image translation models postulate that a unique correspondence exists between the semantic classes of the source and target domains. However, this assumption does not always hold in real-world scenarios due to divergent distributions, different class sets, and asymmetrical information representation. As conventional GANs attempt to generate images that match the distribution of the target domain, they may hallucinate spurious instances of classes absent from the source domain, thereby diminishing the usefulness and reliability of translated images. CycleGAN-based methods are also known to hide the mismatched information in the generated images to bypass cycle consistency objectives, a process known as steganography. In response to the challenge of non-bijective image translation, we introduce StegoGAN, a novel model that leverages steganography to prevent spurious features in generated images. Our approach enhances the semantic consistency of the translated images without requiring additional postprocessing or supervision. Our experimental evaluations demonstrate that StegoGAN outperforms existing GAN-based models across various non-bijective image-to-image translation tasks, both qualitatively and quantitatively. Our code and pretrained models are accessible at https://github.com/sian-wusidi/StegoGAN.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_20142
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation Wu, Sidi Chen, Yizi Mermet, Samuel Hurni, Lorenz Schindler, Konrad Gonthier, Nicolas Landrieu, Loic Computer Vision and Pattern Recognition Image and Video Processing Most image-to-image translation models postulate that a unique correspondence exists between the semantic classes of the source and target domains. However, this assumption does not always hold in real-world scenarios due to divergent distributions, different class sets, and asymmetrical information representation. As conventional GANs attempt to generate images that match the distribution of the target domain, they may hallucinate spurious instances of classes absent from the source domain, thereby diminishing the usefulness and reliability of translated images. CycleGAN-based methods are also known to hide the mismatched information in the generated images to bypass cycle consistency objectives, a process known as steganography. In response to the challenge of non-bijective image translation, we introduce StegoGAN, a novel model that leverages steganography to prevent spurious features in generated images. Our approach enhances the semantic consistency of the translated images without requiring additional postprocessing or supervision. Our experimental evaluations demonstrate that StegoGAN outperforms existing GAN-based models across various non-bijective image-to-image translation tasks, both qualitatively and quantitatively. Our code and pretrained models are accessible at https://github.com/sian-wusidi/StegoGAN.
title	StegoGAN: Leveraging Steganography for Non-Bijective Image-to-Image Translation
topic	Computer Vision and Pattern Recognition Image and Video Processing
url	https://arxiv.org/abs/2403.20142

Similar Items