Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Balloch, Jonathan C., Bhagat, Rishav, Zollicoffer, Geigh, Jia, Ruoran, Kim, Julia, Riedl, Mark O.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2404.02235
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929301546336256
author	Balloch, Jonathan C. Bhagat, Rishav Zollicoffer, Geigh Jia, Ruoran Kim, Julia Riedl, Mark O.
author_facet	Balloch, Jonathan C. Bhagat, Rishav Zollicoffer, Geigh Jia, Ruoran Kim, Julia Riedl, Mark O.
contents	In deep reinforcement learning (RL) research, there has been a concerted effort to design more efficient and productive exploration methods while solving sparse-reward problems. These exploration methods often share common principles (e.g., improving diversity) and implementation details (e.g., intrinsic reward). Prior work found that non-stationary Markov decision processes (MDPs) require exploration to efficiently adapt to changes in the environment with online transfer learning. However, the relationship between specific exploration characteristics and effective transfer learning in deep RL has not been characterized. In this work, we seek to understand the relationships between salient exploration characteristics and improved performance and efficiency in transfer learning. We test eleven popular exploration algorithms on a variety of transfer types -- or ``novelties'' -- to identify the characteristics that positively affect online transfer learning. Our analysis shows that some characteristics correlate with improved performance and efficiency across a wide range of transfer tasks, while others only improve transfer performance with respect to specific environment changes. From our analysis, make recommendations about which exploration algorithm characteristics are best suited to specific transfer situations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_02235
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning Balloch, Jonathan C. Bhagat, Rishav Zollicoffer, Geigh Jia, Ruoran Kim, Julia Riedl, Mark O. Machine Learning Artificial Intelligence In deep reinforcement learning (RL) research, there has been a concerted effort to design more efficient and productive exploration methods while solving sparse-reward problems. These exploration methods often share common principles (e.g., improving diversity) and implementation details (e.g., intrinsic reward). Prior work found that non-stationary Markov decision processes (MDPs) require exploration to efficiently adapt to changes in the environment with online transfer learning. However, the relationship between specific exploration characteristics and effective transfer learning in deep RL has not been characterized. In this work, we seek to understand the relationships between salient exploration characteristics and improved performance and efficiency in transfer learning. We test eleven popular exploration algorithms on a variety of transfer types -- or ``novelties'' -- to identify the characteristics that positively affect online transfer learning. Our analysis shows that some characteristics correlate with improved performance and efficiency across a wide range of transfer tasks, while others only improve transfer performance with respect to specific environment changes. From our analysis, make recommendations about which exploration algorithm characteristics are best suited to specific transfer situations.
title	Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2404.02235

Similar Items