Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cheng, Tianhang, Ma, Wei-Chiu, Guan, Kaiyu, Torralba, Antonio, Wang, Shenlong
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2401.05236
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910293433516032
author	Cheng, Tianhang Ma, Wei-Chiu Guan, Kaiyu Torralba, Antonio Wang, Shenlong
author_facet	Cheng, Tianhang Ma, Wei-Chiu Guan, Kaiyu Torralba, Antonio Wang, Shenlong
contents	Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multiple identical objects. SfD begins by identifying multiple instances of an object within an image, and then jointly estimates the 6DoF pose for all instances.An inverse graphics pipeline is subsequently employed to jointly reason about the shape, material of the object, and the environment light, while adhering to the shared geometry and material constraint across instances. Our primary contributions involve utilizing object duplicates as a robust prior for single-image inverse graphics and proposing an in-plane rotation-robust Structure from Motion (SfM) formulation for joint 6-DoF object pose estimation. By leveraging multi-view cues from a single image, SfD generates more realistic and detailed 3D reconstructions, significantly outperforming existing single image reconstruction models and multi-view reconstruction approaches with a similar or greater number of observations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_05236
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects Cheng, Tianhang Ma, Wei-Chiu Guan, Kaiyu Torralba, Antonio Wang, Shenlong Computer Vision and Pattern Recognition Our world is full of identical objects (\emphe.g., cans of coke, cars of same model). These duplicates, when seen together, provide additional and strong cues for us to effectively reason about 3D. Inspired by this observation, we introduce Structure from Duplicates (SfD), a novel inverse graphics framework that reconstructs geometry, material, and illumination from a single image containing multiple identical objects. SfD begins by identifying multiple instances of an object within an image, and then jointly estimates the 6DoF pose for all instances.An inverse graphics pipeline is subsequently employed to jointly reason about the shape, material of the object, and the environment light, while adhering to the shared geometry and material constraint across instances. Our primary contributions involve utilizing object duplicates as a robust prior for single-image inverse graphics and proposing an in-plane rotation-robust Structure from Motion (SfM) formulation for joint 6-DoF object pose estimation. By leveraging multi-view cues from a single image, SfD generates more realistic and detailed 3D reconstructions, significantly outperforming existing single image reconstruction models and multi-view reconstruction approaches with a similar or greater number of observations.
title	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2401.05236

Similar Items