Saved in:
Bibliographic Details
Main Authors: Van der Merwe, Mark, Jha, Devesh
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.15021
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909746513051648
author Van der Merwe, Mark
Jha, Devesh
author_facet Van der Merwe, Mark
Jha, Devesh
contents Attention-based architectures trained on internet-scale language data have demonstrated state of the art reasoning ability for various language-based tasks, such as logic problems and textual reasoning. Additionally, these Large Language Models (LLMs) have exhibited the ability to perform few-shot prediction via in-context learning, in which input-output examples provided in the prompt are generalized to new inputs. This ability furthermore extends beyond standard language tasks, enabling few-shot learning for general patterns. In this work, we consider the application of in-context learning with pre-trained language models for dynamic manipulation. Dynamic manipulation introduces several crucial challenges, including increased dimensionality, complex dynamics, and partial observability. To address this, we take an iterative approach, and formulate our in-context learning problem to predict adjustments to a parametric policy based on previous interactions. We show across several tasks in simulation and on a physical robot that utilizing in-context learning outperforms alternative methods in the low data regime. Video summary of this work and experiments can be found https://youtu.be/2inxpdrq74U?si=dAdDYsUEr25nZvRn.
format Preprint
id arxiv_https___arxiv_org_abs_2508_15021
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle In-Context Iterative Policy Improvement for Dynamic Manipulation
Van der Merwe, Mark
Jha, Devesh
Robotics
Attention-based architectures trained on internet-scale language data have demonstrated state of the art reasoning ability for various language-based tasks, such as logic problems and textual reasoning. Additionally, these Large Language Models (LLMs) have exhibited the ability to perform few-shot prediction via in-context learning, in which input-output examples provided in the prompt are generalized to new inputs. This ability furthermore extends beyond standard language tasks, enabling few-shot learning for general patterns. In this work, we consider the application of in-context learning with pre-trained language models for dynamic manipulation. Dynamic manipulation introduces several crucial challenges, including increased dimensionality, complex dynamics, and partial observability. To address this, we take an iterative approach, and formulate our in-context learning problem to predict adjustments to a parametric policy based on previous interactions. We show across several tasks in simulation and on a physical robot that utilizing in-context learning outperforms alternative methods in the low data regime. Video summary of this work and experiments can be found https://youtu.be/2inxpdrq74U?si=dAdDYsUEr25nZvRn.
title In-Context Iterative Policy Improvement for Dynamic Manipulation
topic Robotics
url https://arxiv.org/abs/2508.15021