Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Lee, Joongkyu, Park, Seung Joon, Tang, Yunhao, Oh, Min-hwan
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2402.05439
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911773540483072
author	Lee, Joongkyu Park, Seung Joon Tang, Yunhao Oh, Min-hwan
author_facet	Lee, Joongkyu Park, Seung Joon Tang, Yunhao Oh, Min-hwan
contents	In reinforcement learning, temporal abstraction in the action space, exemplified by action repetition, is a technique to facilitate policy learning through extended actions. However, a primary limitation in previous studies of action repetition is its potential to degrade performance, particularly when sub-optimal actions are repeated. This issue often negates the advantages of action repetition. To address this, we propose a novel algorithm named Uncertainty-aware Temporal Extension (UTE). UTE employs ensemble methods to accurately measure uncertainty during action extension. This feature allows policies to strategically choose between emphasizing exploration or adopting an uncertainty-averse approach, tailored to their specific needs. We demonstrate the effectiveness of UTE through experiments in Gridworld and Atari 2600 environments. Our findings show that UTE outperforms existing action repetition algorithms, effectively mitigating their inherent limitations and significantly enhancing policy learning efficiency.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_05439
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Learning Uncertainty-Aware Temporally-Extended Actions Lee, Joongkyu Park, Seung Joon Tang, Yunhao Oh, Min-hwan Machine Learning In reinforcement learning, temporal abstraction in the action space, exemplified by action repetition, is a technique to facilitate policy learning through extended actions. However, a primary limitation in previous studies of action repetition is its potential to degrade performance, particularly when sub-optimal actions are repeated. This issue often negates the advantages of action repetition. To address this, we propose a novel algorithm named Uncertainty-aware Temporal Extension (UTE). UTE employs ensemble methods to accurately measure uncertainty during action extension. This feature allows policies to strategically choose between emphasizing exploration or adopting an uncertainty-averse approach, tailored to their specific needs. We demonstrate the effectiveness of UTE through experiments in Gridworld and Atari 2600 environments. Our findings show that UTE outperforms existing action repetition algorithms, effectively mitigating their inherent limitations and significantly enhancing policy learning efficiency.
title	Learning Uncertainty-Aware Temporally-Extended Actions
topic	Machine Learning
url	https://arxiv.org/abs/2402.05439

Similar Items