Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shim, Jaehyeok, Joo, Kyungdon
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.05005
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916299105370112
author	Shim, Jaehyeok Joo, Kyungdon
author_facet	Shim, Jaehyeok Joo, Kyungdon
contents	We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_05005
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction Shim, Jaehyeok Joo, Kyungdon Computer Vision and Pattern Recognition We propose a novel concept of dual and integrated latent topologies (DITTO in short) for implicit 3D reconstruction from noisy and sparse point clouds. Most existing methods predominantly focus on single latent type, such as point or grid latents. In contrast, the proposed DITTO leverages both point and grid latents (i.e., dual latent) to enhance their strengths, the stability of grid latents and the detail-rich capability of point latents. Concretely, DITTO consists of dual latent encoder and integrated implicit decoder. In the dual latent encoder, a dual latent layer, which is the key module block composing the encoder, refines both latents in parallel, maintaining their distinct shapes and enabling recursive interaction. Notably, a newly proposed dynamic sparse point transformer within the dual latent layer effectively refines point latents. Then, the integrated implicit decoder systematically combines these refined latents, achieving high-fidelity 3D reconstruction and surpassing previous state-of-the-art methods on object- and scene-level datasets, especially in thin and detailed structures.
title	DITTO: Dual and Integrated Latent Topologies for Implicit 3D Reconstruction
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.05005

Similar Items