Saved in:
Bibliographic Details
Main Authors: Ying, Lance, Collins, Katherine M., Sharma, Prafull, Colas, Cedric, Zhao, Kaiya Ivy, Weller, Adrian, Tavares, Zenna, Isola, Phillip, Gershman, Samuel J., Andreas, Jacob D., Griffiths, Thomas L., Chollet, Francois, Allen, Kelsey R., Tenenbaum, Joshua B.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.12821
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915405004537856
author Ying, Lance
Collins, Katherine M.
Sharma, Prafull
Colas, Cedric
Zhao, Kaiya Ivy
Weller, Adrian
Tavares, Zenna
Isola, Phillip
Gershman, Samuel J.
Andreas, Jacob D.
Griffiths, Thomas L.
Chollet, Francois
Allen, Kelsey R.
Tenenbaum, Joshua B.
author_facet Ying, Lance
Collins, Katherine M.
Sharma, Prafull
Colas, Cedric
Zhao, Kaiya Ivy
Weller, Adrian
Tavares, Zenna
Isola, Phillip
Gershman, Samuel J.
Andreas, Jacob D.
Griffiths, Thomas L.
Chollet, Francois
Allen, Kelsey R.
Tenenbaum, Joshua B.
contents Human intelligence exhibits a remarkable capacity for rapid adaptation and effective problem-solving in novel and unfamiliar contexts. We argue that this profound adaptability is fundamentally linked to the efficient construction and refinement of internal representations of the environment, commonly referred to as world models, and we refer to this adaptation mechanism as world model induction. However, current understanding and evaluation of world models in artificial intelligence (AI) remains narrow, often focusing on static representations learned from training on massive corpora of data, instead of the efficiency and efficacy in learning these representations through interaction and exploration within a novel environment. In this Perspective, we provide a view of world model induction drawing on decades of research in cognitive science on how humans learn and adapt so efficiently; we then call for a new evaluation framework for assessing adaptive world models in AI. Concretely, we propose a new benchmarking paradigm based on suites of carefully designed games with genuine, deep and continually refreshing novelty in the underlying game structures -- we refer to this class of games as novel games. We detail key desiderata for constructing these games and propose appropriate metrics to explicitly challenge and evaluate the agent's ability for rapid world model induction. We hope that this new evaluation framework will inspire future evaluation efforts on world models in AI and provide a crucial step towards developing AI systems capable of human-like rapid adaptation and robust generalization -- a critical component of artificial general intelligence.
format Preprint
id arxiv_https___arxiv_org_abs_2507_12821
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Assessing Adaptive World Models in Machines with Novel Games
Ying, Lance
Collins, Katherine M.
Sharma, Prafull
Colas, Cedric
Zhao, Kaiya Ivy
Weller, Adrian
Tavares, Zenna
Isola, Phillip
Gershman, Samuel J.
Andreas, Jacob D.
Griffiths, Thomas L.
Chollet, Francois
Allen, Kelsey R.
Tenenbaum, Joshua B.
Artificial Intelligence
Machine Learning
Human intelligence exhibits a remarkable capacity for rapid adaptation and effective problem-solving in novel and unfamiliar contexts. We argue that this profound adaptability is fundamentally linked to the efficient construction and refinement of internal representations of the environment, commonly referred to as world models, and we refer to this adaptation mechanism as world model induction. However, current understanding and evaluation of world models in artificial intelligence (AI) remains narrow, often focusing on static representations learned from training on massive corpora of data, instead of the efficiency and efficacy in learning these representations through interaction and exploration within a novel environment. In this Perspective, we provide a view of world model induction drawing on decades of research in cognitive science on how humans learn and adapt so efficiently; we then call for a new evaluation framework for assessing adaptive world models in AI. Concretely, we propose a new benchmarking paradigm based on suites of carefully designed games with genuine, deep and continually refreshing novelty in the underlying game structures -- we refer to this class of games as novel games. We detail key desiderata for constructing these games and propose appropriate metrics to explicitly challenge and evaluate the agent's ability for rapid world model induction. We hope that this new evaluation framework will inspire future evaluation efforts on world models in AI and provide a crucial step towards developing AI systems capable of human-like rapid adaptation and robust generalization -- a critical component of artificial general intelligence.
title Assessing Adaptive World Models in Machines with Novel Games
topic Artificial Intelligence
Machine Learning
url https://arxiv.org/abs/2507.12821