Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Jinshu, Xu, Qihui, Kang, Qi, Zhou, MengChu
Format:	Preprint
Published:	2021
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2103.02997
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916454694125568
author	Chen, Jinshu Xu, Qihui Kang, Qi Zhou, MengChu
author_facet	Chen, Jinshu Xu, Qihui Kang, Qi Zhou, MengChu
contents	In most interactive image generation tasks, given regions of interest (ROI) by users, the generated results are expected to have adequate diversities in appearance while maintaining correct and reasonable structures in original images. Such tasks become more challenging if only limited data is available. Recently proposed generative models complete training based on only one image. They pay much attention to the monolithic feature of the sample while ignoring the actual semantic information of different objects inside the sample. As a result, for ROI-based generation tasks, they may produce inappropriate samples with excessive randomicity and without maintaining the related objects' correct structures. To address this issue, this work introduces a MOrphologic-structure-aware Generative Adversarial Network named MOGAN that produces random samples with diverse appearances and reliable structures based on only one image. For training for ROI, we propose to utilize the data coming from the original image being augmented and bring in a novel module to transform such augmented data into knowledge containing both structures and appearances, thus enhancing the model's comprehension of the sample. To learn the rest areas other than ROI, we employ binary masks to ensure the generation isolated from ROI. Finally, we set parallel and hierarchical branches of the mentioned learning process. Compared with other single image GAN schemes, our approach focuses on internal features including the maintenance of rational structures and variation on appearance. Experiments confirm a better capacity of our model on ROI-based image generation tasks than its competitive peers.
format	Preprint
id	arxiv_https___arxiv_org_abs_2103_02997
institution	arXiv
publishDate	2021
record_format	arxiv
spellingShingle	MOGAN: Morphologic-structure-aware Generative Learning from a Single Image Chen, Jinshu Xu, Qihui Kang, Qi Zhou, MengChu Computer Vision and Pattern Recognition In most interactive image generation tasks, given regions of interest (ROI) by users, the generated results are expected to have adequate diversities in appearance while maintaining correct and reasonable structures in original images. Such tasks become more challenging if only limited data is available. Recently proposed generative models complete training based on only one image. They pay much attention to the monolithic feature of the sample while ignoring the actual semantic information of different objects inside the sample. As a result, for ROI-based generation tasks, they may produce inappropriate samples with excessive randomicity and without maintaining the related objects' correct structures. To address this issue, this work introduces a MOrphologic-structure-aware Generative Adversarial Network named MOGAN that produces random samples with diverse appearances and reliable structures based on only one image. For training for ROI, we propose to utilize the data coming from the original image being augmented and bring in a novel module to transform such augmented data into knowledge containing both structures and appearances, thus enhancing the model's comprehension of the sample. To learn the rest areas other than ROI, we employ binary masks to ensure the generation isolated from ROI. Finally, we set parallel and hierarchical branches of the mentioned learning process. Compared with other single image GAN schemes, our approach focuses on internal features including the maintenance of rational structures and variation on appearance. Experiments confirm a better capacity of our model on ROI-based image generation tasks than its competitive peers.
title	MOGAN: Morphologic-structure-aware Generative Learning from a Single Image
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2103.02997

Similar Items