Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Weber, Ethan, Peterlinz, Riley, Mathur, Rohan, Warburg, Frederik, Efros, Alexei A., Kanazawa, Angjoo
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2405.10320
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929622368649216
author	Weber, Ethan Peterlinz, Riley Mathur, Rohan Warburg, Frederik Efros, Alexei A. Kanazawa, Angjoo
author_facet	Weber, Ethan Peterlinz, Riley Mathur, Rohan Warburg, Frederik Efros, Alexei A. Kanazawa, Angjoo
contents	We recover the underlying 3D structure from images of cartoons and anime depicting the same scene. This is an interesting problem domain because images in creative media are often depicted without explicit geometric consistency for storytelling and creative expression-they are only 3D in a qualitative sense. While humans can easily perceive the underlying 3D scene from these images, existing Structure-from-Motion (SfM) methods that assume 3D consistency fail catastrophically. We present Toon3D for reconstructing geometrically inconsistent images. Our key insight is to deform the input images while recovering camera poses and scene geometry, effectively explaining away geometrical inconsistencies to achieve consistency. This process is guided by the structure inferred from monocular depth predictions. We curate a dataset with multi-view imagery from cartoons and anime that we annotate with reliable sparse correspondences using our user-friendly annotation tool. Our recovered point clouds can be plugged into novel-view synthesis methods to experience cartoons from viewpoints never drawn before. We evaluate against classical and recent learning-based SfM methods, where Toon3D is able to obtain more reliable camera poses and scene geometry.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_10320
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Toon3D: Seeing Cartoons from New Perspectives Weber, Ethan Peterlinz, Riley Mathur, Rohan Warburg, Frederik Efros, Alexei A. Kanazawa, Angjoo Computer Vision and Pattern Recognition We recover the underlying 3D structure from images of cartoons and anime depicting the same scene. This is an interesting problem domain because images in creative media are often depicted without explicit geometric consistency for storytelling and creative expression-they are only 3D in a qualitative sense. While humans can easily perceive the underlying 3D scene from these images, existing Structure-from-Motion (SfM) methods that assume 3D consistency fail catastrophically. We present Toon3D for reconstructing geometrically inconsistent images. Our key insight is to deform the input images while recovering camera poses and scene geometry, effectively explaining away geometrical inconsistencies to achieve consistency. This process is guided by the structure inferred from monocular depth predictions. We curate a dataset with multi-view imagery from cartoons and anime that we annotate with reliable sparse correspondences using our user-friendly annotation tool. Our recovered point clouds can be plugged into novel-view synthesis methods to experience cartoons from viewpoints never drawn before. We evaluate against classical and recent learning-based SfM methods, where Toon3D is able to obtain more reliable camera poses and scene geometry.
title	Toon3D: Seeing Cartoons from New Perspectives
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2405.10320

Similar Items