Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ghriss, Ayoub, Sugiyama, Masashi, Lazaric, Alessandro
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Robotics
Online Access:	https://arxiv.org/abs/2403.10855
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911811522002944
author	Ghriss, Ayoub Sugiyama, Masashi Lazaric, Alessandro
author_facet	Ghriss, Ayoub Sugiyama, Masashi Lazaric, Alessandro
contents	The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments. It addresses such goals by decomposing learning tasks in a hierarchical fashion known as Hierarchical Reinforcement Learning. We start in the first chapter by getting familiar with the Markov Decision Process framework and presenting some of its recent techniques that the following chapters use. We then proceed to build our Hierarchical Policy learning as an answer to the limitations of a single primitive policy. The hierarchy is composed of a manager agent at the top and employee agents at the lower level. In the last chapter, which is the core of this thesis, we attempt to learn lower-level elements of the hierarchy independently of the manager level in what is known as the "Eigenoption". Based on the graph structure of the environment, Eigenoptions allow us to build agents that are aware of the geometric and dynamic properties of the environment. Their decision-making has a special property: it is invariant to symmetric transformations of the environment, allowing as a consequence to greatly reduce the complexity of the learning task.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_10855
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Reinforcement Learning with Options and State Representation Ghriss, Ayoub Sugiyama, Masashi Lazaric, Alessandro Machine Learning Robotics The current thesis aims to explore the reinforcement learning field and build on existing methods to produce improved ones to tackle the problem of learning in high-dimensional and complex environments. It addresses such goals by decomposing learning tasks in a hierarchical fashion known as Hierarchical Reinforcement Learning. We start in the first chapter by getting familiar with the Markov Decision Process framework and presenting some of its recent techniques that the following chapters use. We then proceed to build our Hierarchical Policy learning as an answer to the limitations of a single primitive policy. The hierarchy is composed of a manager agent at the top and employee agents at the lower level. In the last chapter, which is the core of this thesis, we attempt to learn lower-level elements of the hierarchy independently of the manager level in what is known as the "Eigenoption". Based on the graph structure of the environment, Eigenoptions allow us to build agents that are aware of the geometric and dynamic properties of the environment. Their decision-making has a special property: it is invariant to symmetric transformations of the environment, allowing as a consequence to greatly reduce the complexity of the learning task.
title	Reinforcement Learning with Options and State Representation
topic	Machine Learning Robotics
url	https://arxiv.org/abs/2403.10855

Similar Items