Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Malato, Federico, Hautamaki, Ville
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2406.04913
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913381503467520
author	Malato, Federico Hautamaki, Ville
author_facet	Malato, Federico Hautamaki, Ville
contents	Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such agents fail to reproduce the expert policy. We propose to recover from these failures through online adaptation. Our approach combines the action proposal coming from a pre-trained policy with relevant experience recorded by an expert. The combination results in an adapted action that closely follows the expert. Our experiments show that an adapted agent performs better than its pure imitation learning counterpart. Notably, adapted agents can achieve reasonable performance even when the base, non-adapted policy catastrophically fails.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_04913
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Online Adaptation for Enhancing Imitation Learning Policies Malato, Federico Hautamaki, Ville Artificial Intelligence Machine Learning Imitation learning enables autonomous agents to learn from human examples, without the need for a reward signal. Still, if the provided dataset does not encapsulate the task correctly, or when the task is too complex to be modeled, such agents fail to reproduce the expert policy. We propose to recover from these failures through online adaptation. Our approach combines the action proposal coming from a pre-trained policy with relevant experience recorded by an expert. The combination results in an adapted action that closely follows the expert. Our experiments show that an adapted agent performs better than its pure imitation learning counterpart. Notably, adapted agents can achieve reasonable performance even when the base, non-adapted policy catastrophically fails.
title	Online Adaptation for Enhancing Imitation Learning Policies
topic	Artificial Intelligence Machine Learning
url	https://arxiv.org/abs/2406.04913

Similar Items