Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Altabaa, Awni, Yang, Zhuoran
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2403.00993
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909211575713792
author	Altabaa, Awni Yang, Zhuoran
author_facet	Altabaa, Awni Yang, Zhuoran
contents	In a sequential decision-making problem, the information structure is the description of how events in the system occurring at different points in time affect each other. Classical models of reinforcement learning (e.g., MDPs, POMDPs) assume a simple and highly regular information structure, while more general models like predictive state representations do not explicitly model the information structure. By contrast, real-world sequential decision-making problems typically involve a complex and time-varying interdependence of system variables, requiring a rich and flexible representation of information structure. In this paper, we formalize a novel reinforcement learning model which explicitly represents the information structure. We then use this model to carry out an information-structural analysis of the statistical hardness of general sequential decision-making problems, obtaining a characterization via a graph-theoretic quantity of the DAG representation of the information structure. We prove an upper bound on the sample complexity of learning a general sequential decision-making problem in terms of its information structure by exhibiting an algorithm achieving the upper bound. This recovers known tractability results and gives a novel perspective on reinforcement learning in general sequential decision-making problems, providing a systematic way of identifying new tractable classes of problems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_00993
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games Altabaa, Awni Yang, Zhuoran Machine Learning Artificial Intelligence In a sequential decision-making problem, the information structure is the description of how events in the system occurring at different points in time affect each other. Classical models of reinforcement learning (e.g., MDPs, POMDPs) assume a simple and highly regular information structure, while more general models like predictive state representations do not explicitly model the information structure. By contrast, real-world sequential decision-making problems typically involve a complex and time-varying interdependence of system variables, requiring a rich and flexible representation of information structure. In this paper, we formalize a novel reinforcement learning model which explicitly represents the information structure. We then use this model to carry out an information-structural analysis of the statistical hardness of general sequential decision-making problems, obtaining a characterization via a graph-theoretic quantity of the DAG representation of the information structure. We prove an upper bound on the sample complexity of learning a general sequential decision-making problem in terms of its information structure by exhibiting an algorithm achieving the upper bound. This recovers known tractability results and gives a novel perspective on reinforcement learning in general sequential decision-making problems, providing a systematic way of identifying new tractable classes of problems.
title	On the Role of Information Structure in Reinforcement Learning for Partially-Observable Sequential Teams and Games
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2403.00993

Similar Items