Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Yue, Fan, Yewen, Liang, Paul Pu, Azaria, Amos, Li, Yuanzhi, Mitchell, Tom M.
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2302.04449
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914878039523328
author	Wu, Yue Fan, Yewen Liang, Paul Pu Azaria, Amos Li, Yuanzhi Mitchell, Tom M.
author_facet	Wu, Yue Fan, Yewen Liang, Paul Pu Azaria, Amos Li, Yuanzhi Mitchell, Tom M.
contents	High sample complexity has long been a challenge for RL. On the other hand, humans learn to perform tasks not only from interaction or demonstrations, but also by reading unstructured text documents, e.g., instruction manuals. Instruction manuals and wiki pages are among the most abundant data that could inform agents of valuable features and policies or task-specific environmental dynamics and reward structures. Therefore, we hypothesize that the ability to utilize human-written instruction manuals to assist learning policies for specific tasks should lead to a more efficient and better-performing agent. We propose the Read and Reward framework. Read and Reward speeds up RL algorithms on Atari games by reading manuals released by the Atari game developers. Our framework consists of a QA Extraction module that extracts and summarizes relevant information from the manual and a Reasoning module that evaluates object-agent interactions based on information from the manual. An auxiliary reward is then provided to a standard A2C RL agent, when interaction is detected. Experimentally, various RL algorithms obtain significant improvement in performance and training speed when assisted by our design.
format	Preprint
id	arxiv_https___arxiv_org_abs_2302_04449
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals Wu, Yue Fan, Yewen Liang, Paul Pu Azaria, Amos Li, Yuanzhi Mitchell, Tom M. Machine Learning Artificial Intelligence Computation and Language High sample complexity has long been a challenge for RL. On the other hand, humans learn to perform tasks not only from interaction or demonstrations, but also by reading unstructured text documents, e.g., instruction manuals. Instruction manuals and wiki pages are among the most abundant data that could inform agents of valuable features and policies or task-specific environmental dynamics and reward structures. Therefore, we hypothesize that the ability to utilize human-written instruction manuals to assist learning policies for specific tasks should lead to a more efficient and better-performing agent. We propose the Read and Reward framework. Read and Reward speeds up RL algorithms on Atari games by reading manuals released by the Atari game developers. Our framework consists of a QA Extraction module that extracts and summarizes relevant information from the manual and a Reasoning module that evaluates object-agent interactions based on information from the manual. An auxiliary reward is then provided to a standard A2C RL agent, when interaction is detected. Experimentally, various RL algorithms obtain significant improvement in performance and training speed when assisted by our design.
title	Read and Reap the Rewards: Learning to Play Atari with the Help of Instruction Manuals
topic	Machine Learning Artificial Intelligence Computation and Language
url	https://arxiv.org/abs/2302.04449

Similar Items