Saved in:
Bibliographic Details
Main Authors: Li, Runfa Blark, Kim, David, Liu, Xinshuang, Suzuki, Keito, Bhatt, Dwait, Raicevic, Nikola, Lin, Xin, Lee, Ki Myung Brian, Atanasov, Nikolay, Nguyen, Truong
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.01436
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910037659615232
author Li, Runfa Blark
Kim, David
Liu, Xinshuang
Suzuki, Keito
Bhatt, Dwait
Raicevic, Nikola
Lin, Xin
Lee, Ki Myung Brian
Atanasov, Nikolay
Nguyen, Truong
author_facet Li, Runfa Blark
Kim, David
Liu, Xinshuang
Suzuki, Keito
Bhatt, Dwait
Raicevic, Nikola
Lin, Xin
Lee, Ki Myung Brian
Atanasov, Nikolay
Nguyen, Truong
contents Bimanual dexterous manipulation for tool use remains a formidable challenge in robotics due to the high-dimensional state space and complicated contact dynamics. Existing methods naively represent the entire system state as a single configuration vector, disregarding the rich structural and topological information inherent to articulated hands. We present PhysGraph, a physically-grounded graph transformer policy designed explicitly for challenging bimanual hand-tool-object manipulation. Unlike prior works, we represent the bimanual system as a kinematic graph and introduce per-link tokenization to preserve fine-grained local state information. We propose a physically-grounded bias generator that injects structural priors directly into the attention mechanism, including kinematic spatial distance, dynamic contact states, geometric proximity, and anatomical properties. This allows the policy to explicitly reason about physical interactions rather than learning them implicitly from sparse rewards. Extensive experiments show that PhysGraph significantly outperforms baseline - ManipTrans in manipulation precision and task success rates while using only 51% of the parameters of ManipTrans. Furthermore, the inherent topological flexibility of our architecture shows qualitative zero-shot transfer to unseen tool/object geometries, and is sufficiently general to be trained on three robotic hands (Shadow, Allegro, Inspire).
format Preprint
id arxiv_https___arxiv_org_abs_2603_01436
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle PhysGraph: Physically-Grounded Graph-Transformer Policies for Bimanual Dexterous Hand-Tool-Object Manipulation
Li, Runfa Blark
Kim, David
Liu, Xinshuang
Suzuki, Keito
Bhatt, Dwait
Raicevic, Nikola
Lin, Xin
Lee, Ki Myung Brian
Atanasov, Nikolay
Nguyen, Truong
Robotics
Bimanual dexterous manipulation for tool use remains a formidable challenge in robotics due to the high-dimensional state space and complicated contact dynamics. Existing methods naively represent the entire system state as a single configuration vector, disregarding the rich structural and topological information inherent to articulated hands. We present PhysGraph, a physically-grounded graph transformer policy designed explicitly for challenging bimanual hand-tool-object manipulation. Unlike prior works, we represent the bimanual system as a kinematic graph and introduce per-link tokenization to preserve fine-grained local state information. We propose a physically-grounded bias generator that injects structural priors directly into the attention mechanism, including kinematic spatial distance, dynamic contact states, geometric proximity, and anatomical properties. This allows the policy to explicitly reason about physical interactions rather than learning them implicitly from sparse rewards. Extensive experiments show that PhysGraph significantly outperforms baseline - ManipTrans in manipulation precision and task success rates while using only 51% of the parameters of ManipTrans. Furthermore, the inherent topological flexibility of our architecture shows qualitative zero-shot transfer to unseen tool/object geometries, and is sufficiently general to be trained on three robotic hands (Shadow, Allegro, Inspire).
title PhysGraph: Physically-Grounded Graph-Transformer Policies for Bimanual Dexterous Hand-Tool-Object Manipulation
topic Robotics
url https://arxiv.org/abs/2603.01436