Saved in:
Bibliographic Details
Main Authors: He, Ruiqi, Lieder, Falk
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2412.03111
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910726754402304
author He, Ruiqi
Lieder, Falk
author_facet He, Ruiqi
Lieder, Falk
contents One explanation for how people can plan efficiently despite limited cognitive resources is that we possess a set of adaptive planning strategies and know when and how to use them. But how are these strategies acquired? While previous research has studied how individuals learn to choose among existing strategies, little is known about the process of forming new planning strategies. In this work, we propose that new planning strategies are discovered through metacognitive reinforcement learning. To test this, we designed a novel experiment to investigate the discovery of new planning strategies. We then present metacognitive reinforcement learning models and demonstrate their capability for strategy discovery as well as show that they provide a better explanation of human strategy discovery than alternative learning mechanisms. However, when fitted to human data, these models exhibit a slower discovery rate than humans, leaving room for improvement.
format Preprint
id arxiv_https___arxiv_org_abs_2412_03111
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Experience-driven discovery of planning strategies
He, Ruiqi
Lieder, Falk
Artificial Intelligence
One explanation for how people can plan efficiently despite limited cognitive resources is that we possess a set of adaptive planning strategies and know when and how to use them. But how are these strategies acquired? While previous research has studied how individuals learn to choose among existing strategies, little is known about the process of forming new planning strategies. In this work, we propose that new planning strategies are discovered through metacognitive reinforcement learning. To test this, we designed a novel experiment to investigate the discovery of new planning strategies. We then present metacognitive reinforcement learning models and demonstrate their capability for strategy discovery as well as show that they provide a better explanation of human strategy discovery than alternative learning mechanisms. However, when fitted to human data, these models exhibit a slower discovery rate than humans, leaving room for improvement.
title Experience-driven discovery of planning strategies
topic Artificial Intelligence
url https://arxiv.org/abs/2412.03111