Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Han, Tyler, Shen, Siyang, Baijal, Rohan, Ravichandiran, Harine, Nemekhbold, Bat, Huang, Kevin, Jung, Sanghun, Boots, Byron
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2602.24121
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910035346456576
author	Han, Tyler Shen, Siyang Baijal, Rohan Ravichandiran, Harine Nemekhbold, Bat Huang, Kevin Jung, Sanghun Boots, Byron
author_facet	Han, Tyler Shen, Siyang Baijal, Rohan Ravichandiran, Harine Nemekhbold, Bat Huang, Kevin Jung, Sanghun Boots, Byron
contents	Observational learning requires an agent to learn to perform a task by referencing only observations of the performed task. This work investigates the equivalent setting in real-world robot learning where access to hand-designed rewards and demonstrator actions are not assumed. To address this data-constrained setting, this work presents a planning-based Inverse Reinforcement Learning (IRL) algorithm for world modeling from observation and interaction alone. Experiments conducted entirely in the real-world demonstrate that this paradigm is effective for learning image-based manipulation tasks from scratch in under an hour, without assuming prior knowledge, pre-training, or data of any kind beyond task observations. Moreover, this work demonstrates that the learned world model representation is capable of online transfer learning in the real-world from scratch. In comparison to existing approaches, including IRL, RL, and Behavior Cloning (BC), which have more restrictive assumptions, the proposed approach demonstrates significantly greater sample efficiency and success rates, enabling a practical path forward for online world modeling and planning from observation and interaction. Videos and more at: https://uwrobotlearning.github.io/mpail2/.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_24121
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Planning from Observation and Interaction Han, Tyler Shen, Siyang Baijal, Rohan Ravichandiran, Harine Nemekhbold, Bat Huang, Kevin Jung, Sanghun Boots, Byron Robotics Observational learning requires an agent to learn to perform a task by referencing only observations of the performed task. This work investigates the equivalent setting in real-world robot learning where access to hand-designed rewards and demonstrator actions are not assumed. To address this data-constrained setting, this work presents a planning-based Inverse Reinforcement Learning (IRL) algorithm for world modeling from observation and interaction alone. Experiments conducted entirely in the real-world demonstrate that this paradigm is effective for learning image-based manipulation tasks from scratch in under an hour, without assuming prior knowledge, pre-training, or data of any kind beyond task observations. Moreover, this work demonstrates that the learned world model representation is capable of online transfer learning in the real-world from scratch. In comparison to existing approaches, including IRL, RL, and Behavior Cloning (BC), which have more restrictive assumptions, the proposed approach demonstrates significantly greater sample efficiency and success rates, enabling a practical path forward for online world modeling and planning from observation and interaction. Videos and more at: https://uwrobotlearning.github.io/mpail2/.
title	Planning from Observation and Interaction
topic	Robotics
url	https://arxiv.org/abs/2602.24121

Similar Items