Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Pobitzer, Markus, Janicki, Filip, Rigotti, Mattia, Malossi, Cristiano
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition I.4; I.5
Online Access:	https://arxiv.org/abs/2402.16421
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914692333568000
author	Pobitzer, Markus Janicki, Filip Rigotti, Mattia Malossi, Cristiano
author_facet	Pobitzer, Markus Janicki, Filip Rigotti, Mattia Malossi, Cristiano
contents	Instance segmentation datasets play a crucial role in training accurate and robust computer vision models. However, obtaining accurate mask annotations to produce high-quality segmentation datasets is a costly and labor-intensive process. In this work, we show how this issue can be mitigated by starting with small annotated instance segmentation datasets and augmenting them to effectively obtain a sizeable annotated dataset. We achieve that by creating variations of the available annotated object instances in a way that preserves the provided mask annotations, thereby resulting in new image-mask pairs to be added to the set of annotated images. Specifically, we generate new images using a diffusion-based inpainting model to fill out the masked area with a desired object class by guiding the diffusion through the object outline. We show that the object outline provides a simple, but also reliable and convenient training-free guidance signal for the underlying inpainting model that is often sufficient to fill out the mask with an object of the correct class without further text guidance and preserve the correspondence between generated images and the mask annotations with high precision. Our experimental results reveal that our method successfully generates realistic variations of object instances, preserving their shape characteristics while introducing diversity within the augmented area. We also show that the proposed method can naturally be combined with text guidance and other image augmentation techniques.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_16421
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Outline-Guided Object Inpainting with Diffusion Models Pobitzer, Markus Janicki, Filip Rigotti, Mattia Malossi, Cristiano Computer Vision and Pattern Recognition I.4; I.5 Instance segmentation datasets play a crucial role in training accurate and robust computer vision models. However, obtaining accurate mask annotations to produce high-quality segmentation datasets is a costly and labor-intensive process. In this work, we show how this issue can be mitigated by starting with small annotated instance segmentation datasets and augmenting them to effectively obtain a sizeable annotated dataset. We achieve that by creating variations of the available annotated object instances in a way that preserves the provided mask annotations, thereby resulting in new image-mask pairs to be added to the set of annotated images. Specifically, we generate new images using a diffusion-based inpainting model to fill out the masked area with a desired object class by guiding the diffusion through the object outline. We show that the object outline provides a simple, but also reliable and convenient training-free guidance signal for the underlying inpainting model that is often sufficient to fill out the mask with an object of the correct class without further text guidance and preserve the correspondence between generated images and the mask annotations with high precision. Our experimental results reveal that our method successfully generates realistic variations of object instances, preserving their shape characteristics while introducing diversity within the augmented area. We also show that the proposed method can naturally be combined with text guidance and other image augmentation techniques.
title	Outline-Guided Object Inpainting with Diffusion Models
topic	Computer Vision and Pattern Recognition I.4; I.5
url	https://arxiv.org/abs/2402.16421

Similar Items