Guardado en:
Detalles Bibliográficos
Autores principales: Xu, Haoxin, Zhao, Zezheng, Cao, Yuxin, Chen, Chunyu, Ge, Hao, Liu, Ziyao
Formato: Preprint
Publicado: 2024
Materias:
Acceso en línea:https://arxiv.org/abs/2403.05218
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866917623400235008
author Xu, Haoxin
Zhao, Zezheng
Cao, Yuxin
Chen, Chunyu
Ge, Hao
Liu, Ziyao
author_facet Xu, Haoxin
Zhao, Zezheng
Cao, Yuxin
Chen, Chunyu
Ge, Hao
Liu, Ziyao
contents Monocular 3D face reconstruction plays a crucial role in avatar generation, with significant demand in web-related applications such as generating virtual financial advisors in FinTech. Current reconstruction methods predominantly rely on deep learning techniques and employ 2D self-supervision as a means to guide model learning. However, these methods encounter challenges in capturing the comprehensive 3D structural information of the face due to the utilization of 2D images for model training purposes. To overcome this limitation and enhance the reconstruction of 3D structural features, we propose an innovative approach that integrates existing 2D features with 3D features to guide the model learning process. Specifically, we introduce the 3D-ID Loss, which leverages the high-dimensional structure features extracted from a Spectral-Based Graph Convolution Encoder applied to the facial mesh. This approach surpasses the sole reliance on the 3D information provided by the facial mesh vertices coordinates. Our model is trained using 2D-3D data pairs from a combination of datasets and achieves state-of-the-art performance on the NoW benchmark.
format Preprint
id arxiv_https___arxiv_org_abs_2403_05218
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle 3D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder
Xu, Haoxin
Zhao, Zezheng
Cao, Yuxin
Chen, Chunyu
Ge, Hao
Liu, Ziyao
Computer Vision and Pattern Recognition
Monocular 3D face reconstruction plays a crucial role in avatar generation, with significant demand in web-related applications such as generating virtual financial advisors in FinTech. Current reconstruction methods predominantly rely on deep learning techniques and employ 2D self-supervision as a means to guide model learning. However, these methods encounter challenges in capturing the comprehensive 3D structural information of the face due to the utilization of 2D images for model training purposes. To overcome this limitation and enhance the reconstruction of 3D structural features, we propose an innovative approach that integrates existing 2D features with 3D features to guide the model learning process. Specifically, we introduce the 3D-ID Loss, which leverages the high-dimensional structure features extracted from a Spectral-Based Graph Convolution Encoder applied to the facial mesh. This approach surpasses the sole reliance on the 3D information provided by the facial mesh vertices coordinates. Our model is trained using 2D-3D data pairs from a combination of datasets and achieves state-of-the-art performance on the NoW benchmark.
title 3D Face Reconstruction Using A Spectral-Based Graph Convolution Encoder
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2403.05218