Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Lovell, Jason
Format:	Recurso digital
Language:
Published:	Zenodo 2026
Subjects:	agent world models consequence simulation computer-use agents consequence aliasing
Online Access:	https://doi.org/10.5281/zenodo.20429921
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866902159029698560
author	Lovell, Jason
author_facet	Lovell, Jason
contents	<p>First public release of nanoAWM: a tiny, from-scratch (pure-Python, no PyTorch/NumPy) action-conditioned world model for tool-using agents, plus the MiniOS task suite it learns on.</p> <p>Honest headline: on a genuinely held-out split with object and action vocabulary disjoint from training, the learned world-model planner scores 0.524, versus 0.067 for the best non-world-model baseline and 1.000 for an oracle upper bound. The in-distribution 1.000 is a sanity check, not the result.</p> <p>Documented negative results are included by design (action-paraphrase collapse, cross-surface OOD, lexicon holdout). Code, the trained model, the dataset, the paper, and all audits are in the repository.</p>
format	Recurso digital
id	zenodo_https___doi_org_10_5281_zenodo_20429921
institution	Zenodo
language
publishDate	2026
publisher	Zenodo
record_format	zenodo
spellingShingle	nanoAWM: A Tiny Agent World Model Lab Lovell, Jason agent world models consequence simulation computer-use agents consequence aliasing <p>First public release of nanoAWM: a tiny, from-scratch (pure-Python, no PyTorch/NumPy) action-conditioned world model for tool-using agents, plus the MiniOS task suite it learns on.</p> <p>Honest headline: on a genuinely held-out split with object and action vocabulary disjoint from training, the learned world-model planner scores 0.524, versus 0.067 for the best non-world-model baseline and 1.000 for an oracle upper bound. The in-distribution 1.000 is a sanity check, not the result.</p> <p>Documented negative results are included by design (action-paraphrase collapse, cross-surface OOD, lexicon holdout). Code, the trained model, the dataset, the paper, and all audits are in the repository.</p>
title	nanoAWM: A Tiny Agent World Model Lab
topic	agent world models consequence simulation computer-use agents consequence aliasing
url	https://doi.org/10.5281/zenodo.20429921

Similar Items