Saved in:
| Main Authors: | Fonseca, Yesid, Ríos, Manuel S., Quijano, Nicanor, Giraldo, Luis F. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.12548 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot
by: Mosquera, Manuel, et al.
Published: (2024)
by: Mosquera, Manuel, et al.
Published: (2024)
Learning Incentive Structures for Cooperative Resilience in Multi-Agent Systems under Social Dilemmas
by: Chacon-Chamorro, Manuela, et al.
Published: (2026)
by: Chacon-Chamorro, Manuela, et al.
Published: (2026)
RLVR-World: Training World Models with Reinforcement Learning
by: Wu, Jialong, et al.
Published: (2025)
by: Wu, Jialong, et al.
Published: (2025)
Cooperative Resilience in Artificial Intelligence Multiagent Systems
by: Chacon-Chamorro, Manuela, et al.
Published: (2024)
by: Chacon-Chamorro, Manuela, et al.
Published: (2024)
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
by: Wang, Zhaoyang, et al.
Published: (2026)
by: Wang, Zhaoyang, et al.
Published: (2026)
Network Sparsity Unlocks the Scaling Potential of Deep Reinforcement Learning
by: Ma, Guozheng, et al.
Published: (2025)
by: Ma, Guozheng, et al.
Published: (2025)
FutureWorld: A Live Reinforcement Learning Environment for Predictive Agents with Real-World Outcome Rewards
by: Han, Zhixin, et al.
Published: (2026)
by: Han, Zhixin, et al.
Published: (2026)
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
by: Jang, Joel, et al.
Published: (2025)
by: Jang, Joel, et al.
Published: (2025)
MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning
by: Yuan, Yifu, et al.
Published: (2024)
by: Yuan, Yifu, et al.
Published: (2024)
In-Context Reinforcement Learning via Communicative World Models
by: Martinez-Lopez, Fernando, et al.
Published: (2025)
by: Martinez-Lopez, Fernando, et al.
Published: (2025)
Continual Reinforcement Learning by Planning with Online World Models
by: Liu, Zichen, et al.
Published: (2025)
by: Liu, Zichen, et al.
Published: (2025)
Hierarchical Reinforcement Learning for Optimal Agent Grouping in Cooperative Systems
by: Hu, Liyuan
Published: (2025)
by: Hu, Liyuan
Published: (2025)
Object-Centric World Models for Causality-Aware Reinforcement Learning
by: Nishimoto, Yosuke, et al.
Published: (2025)
by: Nishimoto, Yosuke, et al.
Published: (2025)
Reinforcement Learning from Delayed Observations via World Models
by: Karamzade, Armin, et al.
Published: (2024)
by: Karamzade, Armin, et al.
Published: (2024)
Enhancing Vision-Language Model Training with Reinforcement Learning in Synthetic Worlds for Real-World Success
by: Bredis, George, et al.
Published: (2025)
by: Bredis, George, et al.
Published: (2025)
Market Making Strategies with Reinforcement Learning
by: Vicente, Óscar Fernández
Published: (2025)
by: Vicente, Óscar Fernández
Published: (2025)
Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy
by: Zhang, Xiaoyun, et al.
Published: (2025)
by: Zhang, Xiaoyun, et al.
Published: (2025)
Partial Models for Building Adaptive Model-Based Reinforcement Learning Agents
by: Alver, Safa, et al.
Published: (2024)
by: Alver, Safa, et al.
Published: (2024)
From Observations to Events: Event-Aware World Model for Reinforcement Learning
by: Peng, Zhao-Han, et al.
Published: (2026)
by: Peng, Zhao-Han, et al.
Published: (2026)
Novelty Detection in Reinforcement Learning with World Models
by: Zollicoffer, Geigh, et al.
Published: (2023)
by: Zollicoffer, Geigh, et al.
Published: (2023)
Is Optimal Transport Necessary for Inverse Reinforcement Learning?
by: Dong, Zixuan, et al.
Published: (2025)
by: Dong, Zixuan, et al.
Published: (2025)
Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)
by: Deng, Wenhao, et al.
Published: (2025)
Learning Optimal Defender Strategies for CAGE-2 using a POMDP Model
by: Le, Duc Huy, et al.
Published: (2025)
by: Le, Duc Huy, et al.
Published: (2025)
POWQMIX: Weighted Value Factorization with Potentially Optimal Joint Actions Recognition for Cooperative Multi-Agent Reinforcement Learning
by: Huang, Chang, et al.
Published: (2024)
by: Huang, Chang, et al.
Published: (2024)
Reinforcement Learning for Machine Learning Engineering Agents
by: Yang, Sherry, et al.
Published: (2025)
by: Yang, Sherry, et al.
Published: (2025)
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning
by: Chen, Jiayu, et al.
Published: (2025)
by: Chen, Jiayu, et al.
Published: (2025)
Program-Based Strategy Induction for Reinforcement Learning
by: Correa, Carlos G., et al.
Published: (2024)
by: Correa, Carlos G., et al.
Published: (2024)
$Agent^2$: An Agent-Generates-Agent Framework for Reinforcement Learning Automation
by: Wei, Yuan, et al.
Published: (2025)
by: Wei, Yuan, et al.
Published: (2025)
Beyond Token-level Supervision: Unlocking the Potential of Decoding-based Regression via Reinforcement Learning
by: Chen, Ming, et al.
Published: (2025)
by: Chen, Ming, et al.
Published: (2025)
External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling
by: Bhagat, Rishav, et al.
Published: (2024)
by: Bhagat, Rishav, et al.
Published: (2024)
World Modelling Improves Language Model Agents
by: Guo, Shangmin, et al.
Published: (2025)
by: Guo, Shangmin, et al.
Published: (2025)
Agent Lightning: Train ANY AI Agents with Reinforcement Learning
by: Luo, Xufang, et al.
Published: (2025)
by: Luo, Xufang, et al.
Published: (2025)
An Optimal Discriminator Weighted Imitation Perspective for Reinforcement Learning
by: Xu, Haoran, et al.
Published: (2025)
by: Xu, Haoran, et al.
Published: (2025)
Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning
by: Cipollone, Roberto, et al.
Published: (2025)
by: Cipollone, Roberto, et al.
Published: (2025)
Kernelized Reinforcement Learning with Order Optimal Regret Bounds
by: Vakili, Sattar, et al.
Published: (2023)
by: Vakili, Sattar, et al.
Published: (2023)
Spatiotemporal Forecasting as Planning: A Model-Based Reinforcement Learning Approach with Generative World Models
by: Wu, Hao, et al.
Published: (2025)
by: Wu, Hao, et al.
Published: (2025)
Diffusion World Model: Future Modeling Beyond Step-by-Step Rollout for Offline Reinforcement Learning
by: Ding, Zihan, et al.
Published: (2024)
by: Ding, Zihan, et al.
Published: (2024)
Safe Reinforcement Learning for Real-World Engine Control
by: Bedei, Julian, et al.
Published: (2025)
by: Bedei, Julian, et al.
Published: (2025)
Puzzle it Out: Local-to-Global World Model for Offline Multi-Agent Reinforcement Learning
by: Li, Sijia, et al.
Published: (2026)
by: Li, Sijia, et al.
Published: (2026)
Value-Distributional Model-Based Reinforcement Learning
by: Luis, Carlos E., et al.
Published: (2023)
by: Luis, Carlos E., et al.
Published: (2023)
Similar Items
-
Can LLM-Augmented autonomous agents cooperate?, An evaluation of their cooperative capabilities through Melting Pot
by: Mosquera, Manuel, et al.
Published: (2024) -
Learning Incentive Structures for Cooperative Resilience in Multi-Agent Systems under Social Dilemmas
by: Chacon-Chamorro, Manuela, et al.
Published: (2026) -
RLVR-World: Training World Models with Reinforcement Learning
by: Wu, Jialong, et al.
Published: (2025) -
Cooperative Resilience in Artificial Intelligence Multiagent Systems
by: Chacon-Chamorro, Manuela, et al.
Published: (2024) -
Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning
by: Wang, Zhaoyang, et al.
Published: (2026)