Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Zhang, Shuoqin, Xiong, Yixin, Gao, Xiru, Liu, Kai, Wang, Ke, Zhou, Xichuan, Hu, Zhe
Format:	Preprint
Publié:	2026
Sujets:	Robotics
Accès en ligne:	https://arxiv.org/abs/2605.19924
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866916028231974912
author	Zhang, Shuoqin Xiong, Yixin Gao, Xiru Liu, Kai Wang, Ke Zhou, Xichuan Hu, Zhe
author_facet	Zhang, Shuoqin Xiong, Yixin Gao, Xiru Liu, Kai Wang, Ke Zhou, Xichuan Hu, Zhe
contents	Human-in-the-loop reinforcement learning systems achieve near-perfect success on the workstation where they are trained, but collapse when the same robot is moved to a workstation a few meters away due to shifts in the visual input distribution caused by new lamp positions and window light. Re-collecting demonstrations and re-running HIL on every workstation is incompatible with deployment, and naively fine-tuning on shifted-light data triggers catastrophic forgetting of the source workstation. To close this cross-domain gap, we present RoHIL, an offline fine-tuning framework that uses no extra real-robot interaction. RoHIL combines (i) a world-model-based image relighter that re-synthesises the visual stream of source-workstation trajectories under multiple virtual HDRI environments, leaving actions and rewards real; (ii) Illumination-Retention Replay (IRR), a data-level anti-forgetting mechanism that interleaves relit adaptation transitions with original-light retention transitions to preserve source-workstation Bellman coverage; and (iii) an anchored Bellman-actor regulariser that constrains representation and policy drift from the original source-workstation policy. Across four real-robot manipulation tasks under significant cross-workstation illumination variations, RoHIL substantially improves shifted-light performance where standard HIL-RL collapses, while preserving source-workstation performance, eliminating the need to re-collect data and retrain for every new workstation and environment. Project page: https://anonymous4365.github.io/RoHIL/
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_19924
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations Zhang, Shuoqin Xiong, Yixin Gao, Xiru Liu, Kai Wang, Ke Zhou, Xichuan Hu, Zhe Robotics Human-in-the-loop reinforcement learning systems achieve near-perfect success on the workstation where they are trained, but collapse when the same robot is moved to a workstation a few meters away due to shifts in the visual input distribution caused by new lamp positions and window light. Re-collecting demonstrations and re-running HIL on every workstation is incompatible with deployment, and naively fine-tuning on shifted-light data triggers catastrophic forgetting of the source workstation. To close this cross-domain gap, we present RoHIL, an offline fine-tuning framework that uses no extra real-robot interaction. RoHIL combines (i) a world-model-based image relighter that re-synthesises the visual stream of source-workstation trajectories under multiple virtual HDRI environments, leaving actions and rewards real; (ii) Illumination-Retention Replay (IRR), a data-level anti-forgetting mechanism that interleaves relit adaptation transitions with original-light retention transitions to preserve source-workstation Bellman coverage; and (iii) an anchored Bellman-actor regulariser that constrains representation and policy drift from the original source-workstation policy. Across four real-robot manipulation tasks under significant cross-workstation illumination variations, RoHIL substantially improves shifted-light performance where standard HIL-RL collapses, while preserving source-workstation performance, eliminating the need to re-collect data and retrain for every new workstation and environment. Project page: https://anonymous4365.github.io/RoHIL/
title	RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations
topic	Robotics
url	https://arxiv.org/abs/2605.19924

Documents similaires