Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Zhao, Wanchen
Format:	Preprint
Published:	2024
Subjects:	Image and Video Processing Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.16042
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916411546271744
author	Zhao, Wanchen
author_facet	Zhao, Wanchen
contents	Image-to-Image Translation is a vital area of computer vision that focuses on transforming images from one visual domain to another while preserving their core content and structure. However, this field faces two major challenges: first, the data from the two domains are often unpaired, making it difficult to train generative adversarial networks effectively; second, existing methods tend to produce artifacts or hallucinations during image generation, leading to a decline in image quality. To address these issues, this paper proposes an enhanced unsupervised image-to-image translation method based on the Contrastive Unpaired Translation (CUT) model, incorporating Histogram of Oriented Gradients (HOG) features. This novel approach ensures the preservation of the semantic structure of images, even without semantic labels, by minimizing the loss between the HOG features of input and generated images. The method was tested on translating synthetic game environments from GTA5 dataset to realistic urban scenes in cityscapes dataset, demonstrating significant improvements in reducing hallucinations and enhancing image quality.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_16042
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients Zhao, Wanchen Image and Video Processing Computer Vision and Pattern Recognition Image-to-Image Translation is a vital area of computer vision that focuses on transforming images from one visual domain to another while preserving their core content and structure. However, this field faces two major challenges: first, the data from the two domains are often unpaired, making it difficult to train generative adversarial networks effectively; second, existing methods tend to produce artifacts or hallucinations during image generation, leading to a decline in image quality. To address these issues, this paper proposes an enhanced unsupervised image-to-image translation method based on the Contrastive Unpaired Translation (CUT) model, incorporating Histogram of Oriented Gradients (HOG) features. This novel approach ensures the preservation of the semantic structure of images, even without semantic labels, by minimizing the loss between the HOG features of input and generated images. The method was tested on translating synthetic game environments from GTA5 dataset to realistic urban scenes in cityscapes dataset, demonstrating significant improvements in reducing hallucinations and enhancing image quality.
title	Enhanced Unsupervised Image-to-Image Translation Using Contrastive Learning and Histogram of Oriented Gradients
topic	Image and Video Processing Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2409.16042

Similar Items