Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Fujimoto, Ted, Suetterlein, Joshua, Chatterjee, Samrat, Ganguly, Auroop
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Multiagent Systems
Online Access:	https://arxiv.org/abs/2402.03590
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909094481231872
author	Fujimoto, Ted Suetterlein, Joshua Chatterjee, Samrat Ganguly, Auroop
author_facet	Fujimoto, Ted Suetterlein, Joshua Chatterjee, Samrat Ganguly, Auroop
contents	Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful convergence to the optimal policy during training, may obfuscate overfitting or dependence on the experimental setup. Although researchers in RL have proposed reliability metrics that account for uncertainty to better understand each algorithm's strengths and weaknesses, the recommendations of past work do not assume the presence of out-of-distribution observations. We propose a set of evaluation methods that measure the robustness of RL algorithms under distribution shifts. The tools presented here argue for the need to account for performance over time while the agent is acting in its environment. In particular, we recommend time series analysis as a method of observational RL evaluation. We also show that the unique properties of RL and simulated dynamic environments allow us to make stronger assumptions to justify the measurement of causal impact in our evaluations. We then apply these tools to single-agent and multi-agent environments to show the impact of introducing distribution shifts during test time. We present this methodology as a first step toward rigorous RL evaluation in the presence of distribution shifts.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_03590
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Assessing the Impact of Distribution Shift on Reinforcement Learning Performance Fujimoto, Ted Suetterlein, Joshua Chatterjee, Samrat Ganguly, Auroop Machine Learning Artificial Intelligence Multiagent Systems Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful convergence to the optimal policy during training, may obfuscate overfitting or dependence on the experimental setup. Although researchers in RL have proposed reliability metrics that account for uncertainty to better understand each algorithm's strengths and weaknesses, the recommendations of past work do not assume the presence of out-of-distribution observations. We propose a set of evaluation methods that measure the robustness of RL algorithms under distribution shifts. The tools presented here argue for the need to account for performance over time while the agent is acting in its environment. In particular, we recommend time series analysis as a method of observational RL evaluation. We also show that the unique properties of RL and simulated dynamic environments allow us to make stronger assumptions to justify the measurement of causal impact in our evaluations. We then apply these tools to single-agent and multi-agent environments to show the impact of introducing distribution shifts during test time. We present this methodology as a first step toward rigorous RL evaluation in the presence of distribution shifts.
title	Assessing the Impact of Distribution Shift on Reinforcement Learning Performance
topic	Machine Learning Artificial Intelligence Multiagent Systems
url	https://arxiv.org/abs/2402.03590

Similar Items