Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Sanvito, Alessandro, Ramazzina, Andrea, Walz, Stefanie, Bijelic, Mario, Heide, Felix
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2405.19712
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866911894207463424
author Sanvito, Alessandro
Ramazzina, Andrea
Walz, Stefanie
Bijelic, Mario
Heide, Felix
author_facet Sanvito, Alessandro
Ramazzina, Andrea
Walz, Stefanie
Bijelic, Mario
Heide, Felix
contents No augmented application is possible without animated humanoid avatars. At the same time, generating human replicas from real-world monocular hand-held or robotic sensor setups is challenging due to the limited availability of views. Previous work showed the feasibility of virtual avatars but required the presence of 360 degree views of the targeted subject. To address this issue, we propose HINT, a NeRF-based algorithm able to learn a detailed and complete human model from limited viewing angles. We achieve this by introducing a symmetry prior, regularization constraints, and training cues from large human datasets. In particular, we introduce a sagittal plane symmetry prior to the appearance of the human, directly supervise the density function of the human model using explicit 3D body modeling, and leverage a co-learned human digitization network as additional supervision for the unseen angles. As a result, our method can reconstruct complete humans even from a few viewing angles, increasing performance by more than 15% PSNR compared to previous state-of-the-art algorithms.
format Preprint
id arxiv_https___arxiv_org_abs_2405_19712
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle HINT: Learning Complete Human Neural Representations from Limited Viewpoints
Sanvito, Alessandro
Ramazzina, Andrea
Walz, Stefanie
Bijelic, Mario
Heide, Felix
Computer Vision and Pattern Recognition
No augmented application is possible without animated humanoid avatars. At the same time, generating human replicas from real-world monocular hand-held or robotic sensor setups is challenging due to the limited availability of views. Previous work showed the feasibility of virtual avatars but required the presence of 360 degree views of the targeted subject. To address this issue, we propose HINT, a NeRF-based algorithm able to learn a detailed and complete human model from limited viewing angles. We achieve this by introducing a symmetry prior, regularization constraints, and training cues from large human datasets. In particular, we introduce a sagittal plane symmetry prior to the appearance of the human, directly supervise the density function of the human model using explicit 3D body modeling, and leverage a co-learned human digitization network as additional supervision for the unseen angles. As a result, our method can reconstruct complete humans even from a few viewing angles, increasing performance by more than 15% PSNR compared to previous state-of-the-art algorithms.
title HINT: Learning Complete Human Neural Representations from Limited Viewpoints
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2405.19712