Guardado en:
Detalles Bibliográficos
Autores principales: Xu, Miao, Zhu, Xiangyu, Liang, Xusheng, Wang, Zidu, Wu, Jinlin, Lei, Zhen
Formato: Preprint
Publicado: 2025
Materias:
Acceso en línea:https://arxiv.org/abs/2509.03114
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
_version_ 1866909768492253184
author Xu, Miao
Zhu, Xiangyu
Liang, Xusheng
Wang, Zidu
Wu, Jinlin
Lei, Zhen
author_facet Xu, Miao
Zhu, Xiangyu
Liang, Xusheng
Wang, Zidu
Wu, Jinlin
Lei, Zhen
contents Existing reconstruction or hand-object pose estimation methods are capable of producing coarse interaction states. However, due to the complex and diverse geometry of both human hands and objects, these approaches often suffer from interpenetration or leave noticeable gaps in regions that are supposed to be in contact. Moreover, the surface of a real human hand undergoes non-negligible deformations during interaction, which are difficult to capture and represent with previous methods. To tackle these challenges, we formulate hand-object interaction as an attraction-driven process and propose a Gravity-Field Based Diffusion Bridge (GravityDB) to simulate interactions between a deformable hand surface and rigid objects. Our approach effectively resolves the aforementioned issues by generating physically plausible interactions that are free of interpenetration, ensure stable grasping, and capture realistic hand deformations. Furthermore, we incorporate semantic information from textual descriptions to guide the construction of the gravitational field, enabling more semantically meaningful interaction regions. Extensive qualitative and quantitative experiments on multiple datasets demonstrate the effectiveness of our method.
format Preprint
id arxiv_https___arxiv_org_abs_2509_03114
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge
Xu, Miao
Zhu, Xiangyu
Liang, Xusheng
Wang, Zidu
Wu, Jinlin
Lei, Zhen
Computer Vision and Pattern Recognition
Existing reconstruction or hand-object pose estimation methods are capable of producing coarse interaction states. However, due to the complex and diverse geometry of both human hands and objects, these approaches often suffer from interpenetration or leave noticeable gaps in regions that are supposed to be in contact. Moreover, the surface of a real human hand undergoes non-negligible deformations during interaction, which are difficult to capture and represent with previous methods. To tackle these challenges, we formulate hand-object interaction as an attraction-driven process and propose a Gravity-Field Based Diffusion Bridge (GravityDB) to simulate interactions between a deformable hand surface and rigid objects. Our approach effectively resolves the aforementioned issues by generating physically plausible interactions that are free of interpenetration, ensure stable grasping, and capture realistic hand deformations. Furthermore, we incorporate semantic information from textual descriptions to guide the construction of the gravitational field, enabling more semantically meaningful interaction regions. Extensive qualitative and quantitative experiments on multiple datasets demonstrate the effectiveness of our method.
title Towards Realistic Hand-Object Interaction with Gravity-Field Based Diffusion Bridge
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2509.03114