Saved in:
| Main Authors: | Zhang, Ruize, Xu, Zelai, Ma, Chengdong, Yu, Chao, Tu, Wei-Wei, Tang, Wenhao, Huang, Shiyu, Ye, Deheng, Ding, Wenbo, Yang, Yaodong, Wang, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.01072 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning
by: Zhang, Ruize, et al.
Published: (2025)
by: Zhang, Ruize, et al.
Published: (2025)
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
by: Xu, Zelai, et al.
Published: (2026)
by: Xu, Zelai, et al.
Published: (2026)
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
by: Xu, Zelai, et al.
Published: (2025)
by: Xu, Zelai, et al.
Published: (2025)
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning
by: Yu, Yan, et al.
Published: (2025)
by: Yu, Yan, et al.
Published: (2025)
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning
by: Fan, Wenzhe, et al.
Published: (2024)
by: Fan, Wenzhe, et al.
Published: (2024)
Language Agents with Reinforcement Learning for Strategic Play in the Werewolf Game
by: Xu, Zelai, et al.
Published: (2023)
by: Xu, Zelai, et al.
Published: (2023)
AED: Automatic Discovery of Effective and Diverse Vulnerabilities for Autonomous Driving Policy with Large Language Models
by: Qiu, Le, et al.
Published: (2025)
by: Qiu, Le, et al.
Published: (2025)
Game-Theoretic Multiagent Reinforcement Learning
by: Yang, Yaodong, et al.
Published: (2020)
by: Yang, Yaodong, et al.
Published: (2020)
Accelerating Robotic Reinforcement Learning with Agent Guidance
by: Chen, Haojun, et al.
Published: (2026)
by: Chen, Haojun, et al.
Published: (2026)
Learning Strategic Language Agents in the Werewolf Game with Iterative Latent Space Policy Optimization
by: Xu, Zelai, et al.
Published: (2025)
by: Xu, Zelai, et al.
Published: (2025)
H$^2$R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents
by: Ye, Shicheng, et al.
Published: (2025)
by: Ye, Shicheng, et al.
Published: (2025)
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
by: Xu, Botian, et al.
Published: (2023)
by: Xu, Botian, et al.
Published: (2023)
Roadmap on Incentive Compatibility for AI Alignment and Governance in Sociotechnical Systems
by: Zhang, Zhaowei, et al.
Published: (2024)
by: Zhang, Zhaowei, et al.
Published: (2024)
EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models
by: Tan, Zheyue, et al.
Published: (2025)
by: Tan, Zheyue, et al.
Published: (2025)
MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation
by: Yang, Lu, et al.
Published: (2026)
by: Yang, Lu, et al.
Published: (2026)
Finding Kissing Numbers with Game-theoretic Reinforcement Learning
by: Ma, Chengdong, et al.
Published: (2025)
by: Ma, Chengdong, et al.
Published: (2025)
JuggleRL: Mastering Ball Juggling with a Quadrotor via Deep Reinforcement Learning
by: Ji, Shilong, et al.
Published: (2025)
by: Ji, Shilong, et al.
Published: (2025)
Refining Few-Step Text-to-Multiview Diffusion via Reinforcement Learning
by: Zhang, Ziyi, et al.
Published: (2025)
by: Zhang, Ziyi, et al.
Published: (2025)
Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games
by: Tang, Xiaohang, et al.
Published: (2024)
by: Tang, Xiaohang, et al.
Published: (2024)
Bidirectional Soft Actor-Critic: Leveraging Forward and Reverse KL Divergence for Efficient Reinforcement Learning
by: Zhang, Yixian, et al.
Published: (2025)
by: Zhang, Yixian, et al.
Published: (2025)
Adaptive Coarse-to-Fine Subgoal Refinement for Long-Horizon Offline Goal-Conditioned Reinforcement Learning
by: Ke, Kaiqiang, et al.
Published: (2026)
by: Ke, Kaiqiang, et al.
Published: (2026)
A Comprehensive Survey on Self-Supervised Learning for Recommendation
by: Ren, Xubin, et al.
Published: (2024)
by: Ren, Xubin, et al.
Published: (2024)
Improving Sample Efficiency of Reinforcement Learning with Background Knowledge from Large Language Models
by: Zhang, Fuxiang, et al.
Published: (2024)
by: Zhang, Fuxiang, et al.
Published: (2024)
Scaling DRL for Decision Making: A Survey on Data, Network, and Training Budget Strategies
by: Ma, Yi, et al.
Published: (2025)
by: Ma, Yi, et al.
Published: (2025)
Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning
by: Li, Simin, et al.
Published: (2025)
by: Li, Simin, et al.
Published: (2025)
Reinforcement Learning with Knowledge Representation and Reasoning: A Brief Survey
by: Yu, Chao, et al.
Published: (2023)
by: Yu, Chao, et al.
Published: (2023)
GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training
by: Wei, Tong, et al.
Published: (2025)
by: Wei, Tong, et al.
Published: (2025)
Absolute Zero: Reinforced Self-play Reasoning with Zero Data
by: Zhao, Andrew, et al.
Published: (2025)
by: Zhao, Andrew, et al.
Published: (2025)
Empirical Study on Robustness and Resilience in Cooperative Multi-Agent Reinforcement Learning
by: Li, Simin, et al.
Published: (2025)
by: Li, Simin, et al.
Published: (2025)
EvoMAS: Learning Execution-Time Workflows for Multi-Agent Systems
by: Xu, Chengdong, et al.
Published: (2026)
by: Xu, Chengdong, et al.
Published: (2026)
Online Planning for Multi-UAV Pursuit-Evasion in Unknown Environments Using Deep Reinforcement Learning
by: Chen, Jiayu, et al.
Published: (2024)
by: Chen, Jiayu, et al.
Published: (2024)
Enhance the Safety in Reinforcement Learning by ADRC Lagrangian Methods
by: Zhang, Mingxu, et al.
Published: (2026)
by: Zhang, Mingxu, et al.
Published: (2026)
Social World Model-Augmented Mechanism Design Policy Learning
by: Zhang, Xiaoyuan, et al.
Published: (2025)
by: Zhang, Xiaoyuan, et al.
Published: (2025)
Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)
by: Deng, Wenhao, et al.
Published: (2025)
Context-Picker: Dynamic context selection using multi-stage reinforcement learning
by: Zhu, Siyuan, et al.
Published: (2025)
by: Zhu, Siyuan, et al.
Published: (2025)
HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning
by: Lu, Zhicong, et al.
Published: (2026)
by: Lu, Zhicong, et al.
Published: (2026)
Reaching Consensus in Cooperative Multi-Agent Reinforcement Learning with Goal Imagination
by: Wang, Liangzhou, et al.
Published: (2024)
by: Wang, Liangzhou, et al.
Published: (2024)
Evolving Diverse Red-team Language Models in Multi-round Multi-agent Games
by: Ma, Chengdong, et al.
Published: (2023)
by: Ma, Chengdong, et al.
Published: (2023)
Multi-UAV Formation Control with Static and Dynamic Obstacle Avoidance via Reinforcement Learning
by: Xie, Yuqing, et al.
Published: (2024)
by: Xie, Yuqing, et al.
Published: (2024)
AI Agents for Web Testing: A Case Study in the Wild
by: Ye, Naimeng, et al.
Published: (2025)
by: Ye, Naimeng, et al.
Published: (2025)
Similar Items
-
Mastering Multi-Drone Volleyball through Hierarchical Co-Self-Play Reinforcement Learning
by: Zhang, Ruize, et al.
Published: (2025) -
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning
by: Xu, Zelai, et al.
Published: (2026) -
VolleyBots: A Testbed for Multi-Drone Volleyball Game Combining Motion Control and Strategic Play
by: Xu, Zelai, et al.
Published: (2025) -
Goal Discovery with Causal Capacity for Efficient Reinforcement Learning
by: Yu, Yan, et al.
Published: (2025) -
Towards Efficient Collaboration via Graph Modeling in Reinforcement Learning
by: Fan, Wenzhe, et al.
Published: (2024)