Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Markgraf, Hannah, Eichelbeck, Michael, Cappey, Daria, Demirtürk, Selin, Schattschneider, Yara, Althoff, Matthias
Formato:	Preprint
Publicado:	2025
Materias:	Machine Learning
Acceso en línea:	https://arxiv.org/abs/2505.16754
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866910963964313600
author	Markgraf, Hannah Eichelbeck, Michael Cappey, Daria Demirtürk, Selin Schattschneider, Yara Althoff, Matthias
author_facet	Markgraf, Hannah Eichelbeck, Michael Cappey, Daria Demirtürk, Selin Schattschneider, Yara Althoff, Matthias
contents	Offline reinforcement learning (RL) has gained traction as a powerful paradigm for learning control policies from pre-collected data, eliminating the need for costly or risky online interactions. While many open-source libraries offer robust implementations of offline RL algorithms, they all rely on datasets composed of experience tuples consisting of state, action, next state, and reward. Managing, curating, and distributing such datasets requires suitable infrastructure. Although static datasets exist for established benchmark problems, no standardized or scalable solution supports developing and sharing datasets for novel or user-defined benchmarks. To address this gap, we introduce PyTupli, a Python-based tool to streamline the creation, storage, and dissemination of benchmark environments and their corresponding tuple datasets. PyTupli includes a lightweight client library with defined interfaces for uploading and retrieving benchmarks and data. It supports fine-grained filtering at both the episode and tuple level, allowing researchers to curate high-quality, task-specific datasets. A containerized server component enables production-ready deployment with authentication, access control, and automated certificate provisioning for secure use. By addressing key barriers in dataset infrastructure, PyTupli facilitates more collaborative, reproducible, and scalable offline RL research.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_16754
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning Projects Markgraf, Hannah Eichelbeck, Michael Cappey, Daria Demirtürk, Selin Schattschneider, Yara Althoff, Matthias Machine Learning Offline reinforcement learning (RL) has gained traction as a powerful paradigm for learning control policies from pre-collected data, eliminating the need for costly or risky online interactions. While many open-source libraries offer robust implementations of offline RL algorithms, they all rely on datasets composed of experience tuples consisting of state, action, next state, and reward. Managing, curating, and distributing such datasets requires suitable infrastructure. Although static datasets exist for established benchmark problems, no standardized or scalable solution supports developing and sharing datasets for novel or user-defined benchmarks. To address this gap, we introduce PyTupli, a Python-based tool to streamline the creation, storage, and dissemination of benchmark environments and their corresponding tuple datasets. PyTupli includes a lightweight client library with defined interfaces for uploading and retrieving benchmarks and data. It supports fine-grained filtering at both the episode and tuple level, allowing researchers to curate high-quality, task-specific datasets. A containerized server component enables production-ready deployment with authentication, access control, and automated certificate provisioning for secure use. By addressing key barriers in dataset infrastructure, PyTupli facilitates more collaborative, reproducible, and scalable offline RL research.
title	PyTupli: A Scalable Infrastructure for Collaborative Offline Reinforcement Learning Projects
topic	Machine Learning
url	https://arxiv.org/abs/2505.16754

Ejemplares similares