Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Jiao, Zixun, Wang, Xihan, Xia, Zhaoqiang, Shao, Lianhe, Gao, Quanli
Format:	Preprint
Publié:	2024
Sujets:	Computer Vision and Pattern Recognition Artificial Intelligence Human-Computer Interaction
Accès en ligne:	https://arxiv.org/abs/2405.01066
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866908406535684096
author	Jiao, Zixun Wang, Xihan Xia, Zhaoqiang Shao, Lianhe Gao, Quanli
author_facet	Jiao, Zixun Wang, Xihan Xia, Zhaoqiang Shao, Lianhe Gao, Quanli
contents	Reconstructing the hand mesh from one single RGB image is a challenging task because hands are often occluded by other objects. Most previous works attempt to explore more additional information and adopt attention mechanisms for improving 3D reconstruction performance, while it would increase computational complexity simultaneously. To achieve a performance-reserving architecture with high computational efficiency, in this work, we propose a simple but effective 3D hand mesh reconstruction network (i.e., HandS3C), which is the first time to incorporate state space model into the task of hand mesh reconstruction. In the network, we design a novel state-space spatial-channel attention module that extends the effective receptive field, extracts hand features in the spatial dimension, and enhances regional features of hands in the channel dimension. This helps to reconstruct a complete and detailed hand mesh. Extensive experiments conducted on well-known datasets facing heavy occlusions (such as FREIHAND, DEXYCB, and HO3D) demonstrate that our proposed HandS3C achieves state-of-the-art performance while maintaining a minimal parameters.
format	Preprint
id	arxiv_https___arxiv_org_abs_2405_01066
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	HandS3C: 3D Hand Mesh Reconstruction with State Space Spatial Channel Attention from RGB images Jiao, Zixun Wang, Xihan Xia, Zhaoqiang Shao, Lianhe Gao, Quanli Computer Vision and Pattern Recognition Artificial Intelligence Human-Computer Interaction Reconstructing the hand mesh from one single RGB image is a challenging task because hands are often occluded by other objects. Most previous works attempt to explore more additional information and adopt attention mechanisms for improving 3D reconstruction performance, while it would increase computational complexity simultaneously. To achieve a performance-reserving architecture with high computational efficiency, in this work, we propose a simple but effective 3D hand mesh reconstruction network (i.e., HandS3C), which is the first time to incorporate state space model into the task of hand mesh reconstruction. In the network, we design a novel state-space spatial-channel attention module that extends the effective receptive field, extracts hand features in the spatial dimension, and enhances regional features of hands in the channel dimension. This helps to reconstruct a complete and detailed hand mesh. Extensive experiments conducted on well-known datasets facing heavy occlusions (such as FREIHAND, DEXYCB, and HO3D) demonstrate that our proposed HandS3C achieves state-of-the-art performance while maintaining a minimal parameters.
title	HandS3C: 3D Hand Mesh Reconstruction with State Space Spatial Channel Attention from RGB images
topic	Computer Vision and Pattern Recognition Artificial Intelligence Human-Computer Interaction
url	https://arxiv.org/abs/2405.01066

Documents similaires