MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Nikulin, Alexander, Zisman, Ilya, Zemtsov, Alexey, Kurenkov, Vladislav
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Machine Learning Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2406.08973
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866917941437530112
author	Nikulin, Alexander Zisman, Ilya Zemtsov, Alexey Kurenkov, Vladislav
author_facet	Nikulin, Alexander Zisman, Ilya Zemtsov, Alexey Kurenkov, Vladislav
contents	Following the success of the in-context learning paradigm in large-scale language and computer vision models, the recently emerging field of in-context reinforcement learning is experiencing a rapid growth. However, its development has been held back by the lack of challenging benchmarks, as all the experiments have been carried out in simple environments and on small-scale datasets. We present XLand-100B, a large-scale dataset for in-context reinforcement learning based on the XLand-MiniGrid environment, as a first step to alleviate this problem. It contains complete learning histories for nearly $30,000$ different tasks, covering $100$B transitions and 2.5B episodes. It took 50,000 GPU hours to collect the dataset, which is beyond the reach of most academic labs. Along with the dataset, we provide the utilities to reproduce or expand it even further. We also benchmark common in-context RL baselines and show that they struggle to generalize to novel and diverse tasks. With this substantial effort, we aim to democratize research in the rapidly growing field of in-context reinforcement learning and provide a solid foundation for further scaling.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_08973
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning Nikulin, Alexander Zisman, Ilya Zemtsov, Alexey Kurenkov, Vladislav Machine Learning Artificial Intelligence Following the success of the in-context learning paradigm in large-scale language and computer vision models, the recently emerging field of in-context reinforcement learning is experiencing a rapid growth. However, its development has been held back by the lack of challenging benchmarks, as all the experiments have been carried out in simple environments and on small-scale datasets. We present XLand-100B, a large-scale dataset for in-context reinforcement learning based on the XLand-MiniGrid environment, as a first step to alleviate this problem. It contains complete learning histories for nearly $30,000$ different tasks, covering $100$B transitions and 2.5B episodes. It took 50,000 GPU hours to collect the dataset, which is beyond the reach of most academic labs. Along with the dataset, we provide the utilities to reproduce or expand it even further. We also benchmark common in-context RL baselines and show that they struggle to generalize to novel and diverse tasks. With this substantial effort, we aim to democratize research in the rapidly growing field of in-context reinforcement learning and provide a solid foundation for further scaling.
title	XLand-100B: A Large-Scale Multi-Task Dataset for In-Context Reinforcement Learning
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2406.08973

Documenti analoghi