Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Paudel, Pramish, Khanal, Anubhav, Chhatkuli, Ajad, Paudel, Danda Pani, Tandukar, Jyoti
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.11174
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929421909229568
author	Paudel, Pramish Khanal, Anubhav Chhatkuli, Ajad Paudel, Danda Pani Tandukar, Jyoti
author_facet	Paudel, Pramish Khanal, Anubhav Chhatkuli, Ajad Paudel, Danda Pani Tandukar, Jyoti
contents	Personalized 3D avatars require an animatable representation of digital humans. Doing so instantly from monocular videos offers scalability to broad class of users and wide-scale applications. In this paper, we present a fast, simple, yet effective method for creating animatable 3D digital humans from monocular videos. Our method utilizes the efficiency of Gaussian splatting to model both 3D geometry and appearance. However, we observed that naively optimizing Gaussian splats results in inaccurate geometry, thereby leading to poor animations. This work achieves and illustrates the need of accurate 3D mesh-type modelling of the human body for animatable digitization through Gaussian splats. This is achieved by developing a novel pipeline that benefits from three key aspects: (a) implicit modelling of surface's displacements and the color's spherical harmonics; (b) binding of 3D Gaussians to the respective triangular faces of the body template; (c) a novel technique to render normals followed by their auxiliary supervision. Our exhaustive experiments on three different benchmark datasets demonstrates the state-of-the-art results of our method, in limited time settings. In fact, our method is faster by an order of magnitude (in terms of training time) than its closest competitor. At the same time, we achieve superior rendering and 3D reconstruction performance under the change of poses.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_11174
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	iHuman: Instant Animatable Digital Humans From Monocular Videos Paudel, Pramish Khanal, Anubhav Chhatkuli, Ajad Paudel, Danda Pani Tandukar, Jyoti Computer Vision and Pattern Recognition Artificial Intelligence Personalized 3D avatars require an animatable representation of digital humans. Doing so instantly from monocular videos offers scalability to broad class of users and wide-scale applications. In this paper, we present a fast, simple, yet effective method for creating animatable 3D digital humans from monocular videos. Our method utilizes the efficiency of Gaussian splatting to model both 3D geometry and appearance. However, we observed that naively optimizing Gaussian splats results in inaccurate geometry, thereby leading to poor animations. This work achieves and illustrates the need of accurate 3D mesh-type modelling of the human body for animatable digitization through Gaussian splats. This is achieved by developing a novel pipeline that benefits from three key aspects: (a) implicit modelling of surface's displacements and the color's spherical harmonics; (b) binding of 3D Gaussians to the respective triangular faces of the body template; (c) a novel technique to render normals followed by their auxiliary supervision. Our exhaustive experiments on three different benchmark datasets demonstrates the state-of-the-art results of our method, in limited time settings. In fact, our method is faster by an order of magnitude (in terms of training time) than its closest competitor. At the same time, we achieve superior rendering and 3D reconstruction performance under the change of poses.
title	iHuman: Instant Animatable Digital Humans From Monocular Videos
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2407.11174

Similar Items