Saved in:
| Main Authors: | Yan, John, Yu, Michael, Sun, Yuqi, Duffy, Alexander, Marques, Tyler, Olson, Matthew Lyle |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.05183 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
by: Wu, Xiyang, et al.
Published: (2026)
by: Wu, Xiyang, et al.
Published: (2026)
Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
by: Duffy, Alexander, et al.
Published: (2025)
by: Duffy, Alexander, et al.
Published: (2025)
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning
by: Li, Yu, et al.
Published: (2025)
by: Li, Yu, et al.
Published: (2025)
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
by: Sun, Chuanneng, et al.
Published: (2024)
by: Sun, Chuanneng, et al.
Published: (2024)
Data Interpreter: An LLM Agent For Data Science
by: Hong, Sirui, et al.
Published: (2024)
by: Hong, Sirui, et al.
Published: (2024)
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments
by: Su, Hongjin, et al.
Published: (2025)
by: Su, Hongjin, et al.
Published: (2025)
Interpretability by Design for Efficient Multi-Objective Reinforcement Learning
by: Xia, Qiyue, et al.
Published: (2025)
by: Xia, Qiyue, et al.
Published: (2025)
Interpretable Failure Analysis in Multi-Agent Reinforcement Learning Systems
by: Shefin, Risal Shahriar, et al.
Published: (2026)
by: Shefin, Risal Shahriar, et al.
Published: (2026)
CARL: Causality-guided Architecture Representation Learning for an Interpretable Performance Predictor
by: Ji, Han, et al.
Published: (2025)
by: Ji, Han, et al.
Published: (2025)
An Extremely Data-efficient and Generative LLM-based Reinforcement Learning Agent for Recommenders
by: Feng, Shuang, et al.
Published: (2024)
by: Feng, Shuang, et al.
Published: (2024)
Decision-Centric Design for LLM Systems
by: Sun, Wei
Published: (2026)
by: Sun, Wei
Published: (2026)
Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems
by: Feng, Lang, et al.
Published: (2026)
by: Feng, Lang, et al.
Published: (2026)
SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents
by: Kutasov, Jonathan, et al.
Published: (2025)
by: Kutasov, Jonathan, et al.
Published: (2025)
PDRL: Multi-Agent based Reinforcement Learning for Predictive Monitoring
by: Shaik, Thanveer, et al.
Published: (2023)
by: Shaik, Thanveer, et al.
Published: (2023)
LLM-Guided Communication for Cooperative Multi-Agent Reinforcement Learning
by: Bae, Sangjun, et al.
Published: (2026)
by: Bae, Sangjun, et al.
Published: (2026)
Coordination Matters: Evaluation of Cooperative Multi-Agent Reinforcement Learning
by: Cardei, Maria Ana, et al.
Published: (2026)
by: Cardei, Maria Ana, et al.
Published: (2026)
Tree Search for LLM Agent Reinforcement Learning
by: Ji, Yuxiang, et al.
Published: (2025)
by: Ji, Yuxiang, et al.
Published: (2025)
Hierarchical Reinforcement Learning with Augmented Step-Level Transitions for LLM Agents
by: Zhen, Shuai, et al.
Published: (2026)
by: Zhen, Shuai, et al.
Published: (2026)
From Data-Centric to Sample-Centric: Enhancing LLM Reasoning via Progressive Optimization
by: Chen, Xinjie, et al.
Published: (2025)
by: Chen, Xinjie, et al.
Published: (2025)
DSAI: Unbiased and Interpretable Latent Feature Extraction for Data-Centric AI
by: Cho, Hyowon, et al.
Published: (2024)
by: Cho, Hyowon, et al.
Published: (2024)
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
Reinforcement Learning for Long-Horizon Interactive LLM Agents
by: Chen, Kevin, et al.
Published: (2025)
by: Chen, Kevin, et al.
Published: (2025)
Putting Data at the Centre of Offline Multi-Agent Reinforcement Learning
by: Formanek, Claude, et al.
Published: (2024)
by: Formanek, Claude, et al.
Published: (2024)
A Survey on Data-Centric AI: Tabular Learning from Reinforcement Learning and Generative AI Perspective
by: Ying, Wangyang, et al.
Published: (2025)
by: Ying, Wangyang, et al.
Published: (2025)
BLAST: A Stealthy Backdoor Leverage Attack against Cooperative Multi-Agent Deep Reinforcement Learning based Systems
by: Fang, Jing, et al.
Published: (2025)
by: Fang, Jing, et al.
Published: (2025)
LLM-Guided Reinforcement Learning: Addressing Training Bottlenecks through Policy Modulation
by: Tan, Heng, et al.
Published: (2025)
by: Tan, Heng, et al.
Published: (2025)
Towards Unified Attribution in Explainable AI, Data-Centric AI, and Mechanistic Interpretability
by: Zhang, Shichang, et al.
Published: (2025)
by: Zhang, Shichang, et al.
Published: (2025)
RAGEN: Understanding Self-Evolution in LLM Agents via Multi-Turn Reinforcement Learning
by: Wang, Zihan, et al.
Published: (2025)
by: Wang, Zihan, et al.
Published: (2025)
Deep Reinforcement Learning via Object-Centric Attention
by: Blüml, Jannis, et al.
Published: (2025)
by: Blüml, Jannis, et al.
Published: (2025)
MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery
by: Li, Dong, et al.
Published: (2026)
by: Li, Dong, et al.
Published: (2026)
Turn-based Multi-Agent Reinforcement Learning Model Checking
by: Gross, Dennis
Published: (2025)
by: Gross, Dennis
Published: (2025)
Trust-based Consensus in Multi-Agent Reinforcement Learning Systems
by: Fung, Ho Long, et al.
Published: (2022)
by: Fung, Ho Long, et al.
Published: (2022)
PepThink-R1: LLM for Interpretable Cyclic Peptide Optimization with CoT SFT and Reinforcement Learning
by: Wang, Ruheng, et al.
Published: (2025)
by: Wang, Ruheng, et al.
Published: (2025)
Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines
by: Zheng, Xuejing, et al.
Published: (2024)
by: Zheng, Xuejing, et al.
Published: (2024)
Reinforce LLM Reasoning through Multi-Agent Reflection
by: Yuan, Yurun, et al.
Published: (2025)
by: Yuan, Yurun, et al.
Published: (2025)
SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training
by: He, Zhongyu, et al.
Published: (2026)
by: He, Zhongyu, et al.
Published: (2026)
Object-Centric World Models for Causality-Aware Reinforcement Learning
by: Nishimoto, Yosuke, et al.
Published: (2025)
by: Nishimoto, Yosuke, et al.
Published: (2025)
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers
by: Yan, Kai, et al.
Published: (2024)
by: Yan, Kai, et al.
Published: (2024)
Offline Reinforcement Learning: Role of State Aggregation and Trajectory Data
by: Jia, Zeyu, et al.
Published: (2024)
by: Jia, Zeyu, et al.
Published: (2024)
A Medical Data-Effective Learning Benchmark for Highly Efficient Pre-training of Foundation Models
by: Yang, Wenxuan, et al.
Published: (2024)
by: Yang, Wenxuan, et al.
Published: (2024)
Similar Items
-
Co-Evolving LLM Decision and Skill Bank Agents for Long-Horizon Tasks
by: Wu, Xiyang, et al.
Published: (2026) -
Democratizing Diplomacy: A Harness for Evaluating Any Large Language Model on Full-Press Diplomacy
by: Duffy, Alexander, et al.
Published: (2025) -
Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning
by: Li, Yu, et al.
Published: (2025) -
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
by: Sun, Chuanneng, et al.
Published: (2024) -
Data Interpreter: An LLM Agent For Data Science
by: Hong, Sirui, et al.
Published: (2024)