Saved in:
Bibliographic Details
Main Authors: Zhou, Peng, Li, Zhongxuan, Wu, Jinsong, Qi, Jiaming, Hu, Jun, Navarro-Alarcon, David, Pan, Jia, Xie, Lihua, Zhang, Shiyao, Zhang, Zeqing
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.01092
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Teleoperation of high-precision manipulation is con-strained by tight success tolerances and complex contact dy-namics, which make impending failures difficult for human operators to anticipate under partial observability. This paper proposes a value-guided, failure-aware framework for bimanual teleoperation that provides compliant haptic assistance while pre-serving continuous human authority. The framework is trained entirely from heterogeneous offline teleoperation data containing both successful and failed executions. Task feasibility is mod-eled as a conservative success score learned via Conservative Value Learning, yielding a risk-sensitive estimate that remains reliable under distribution shift. During online operation, the learned success score regulates the level of assistance, while a learned actor provides a corrective motion direction. Both are integrated through a joint-space impedance interface on the master side, yielding continuous guidance that steers the operator away from failure-prone actions without overriding intent. Experimental results on contact-rich manipulation tasks demonstrate improved task success rates and reduced operator workload compared to conventional teleoperation and shared-autonomy baselines, indicating that conservative value learning provides an effective mechanism for embedding failure awareness into bilateral teleoperation. Experimental videos are available at https://www.youtube.com/watch?v=XDTsvzEkDRE