Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Xu, Haoxin, Zhao, Zezheng, Cao, Yuxin, Chen, Chunyu, Ge, Hao, Liu, Ziyao
Formato:	Preprint
Publicado:	2024
Materias:	Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2403.05218
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866917623400235008
author	Xu, Haoxin Zhao, Zezheng Cao, Yuxin Chen, Chunyu Ge, Hao Liu, Ziyao
author_facet	Xu, Haoxin Zhao, Zezheng Cao, Yuxin Chen, Chunyu Ge, Hao Liu, Ziyao
contents	Monocular 3D face reconstruction plays a crucial role in avatar generation, with significant demand in web-related applications such as generating virtual financial advisors in FinTech. Current reconstruction methods predominantly rely on deep learning techniques and employ 2D self-supervision as a means to guide model learning. However, these methods encounter challenges in capturing the comprehensive 3D structural information of the face due to the utilization of 2D images for model training purposes. To overcome this limitation and enhance the reconstruction of 3D structural features, we propose an innovative approach that integrates existing 2D features with 3D features to guide the model learning process. Specifically, we introduce the 3D-ID Loss, which leverages the high-dimensional structure features extracted from a Spectral-Based Graph Convolution Encoder applied to the facial mesh. This approach surpasses the sole reliance on the 3D information provided by the facial mesh vertices coordinates. Our model is trained using 2D-3D data pairs from a combination of datasets and achieves state-of-the-art performance on the NoW benchmark.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_05218
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	3D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder Xu, Haoxin Zhao, Zezheng Cao, Yuxin Chen, Chunyu Ge, Hao Liu, Ziyao Computer Vision and Pattern Recognition Monocular 3D face reconstruction plays a crucial role in avatar generation, with significant demand in web-related applications such as generating virtual financial advisors in FinTech. Current reconstruction methods predominantly rely on deep learning techniques and employ 2D self-supervision as a means to guide model learning. However, these methods encounter challenges in capturing the comprehensive 3D structural information of the face due to the utilization of 2D images for model training purposes. To overcome this limitation and enhance the reconstruction of 3D structural features, we propose an innovative approach that integrates existing 2D features with 3D features to guide the model learning process. Specifically, we introduce the 3D-ID Loss, which leverages the high-dimensional structure features extracted from a Spectral-Based Graph Convolution Encoder applied to the facial mesh. This approach surpasses the sole reliance on the 3D information provided by the facial mesh vertices coordinates. Our model is trained using 2D-3D data pairs from a combination of datasets and achieves state-of-the-art performance on the NoW benchmark.
title	3D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.05218

Ejemplares similares