Saved in:
| Main Author: | Zhang, Chenchen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.02801 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning
by: Jiang, Eric Hanchen, et al.
Published: (2026)
by: Jiang, Eric Hanchen, et al.
Published: (2026)
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models
by: Zhang, Chenchen
Published: (2026)
by: Zhang, Chenchen
Published: (2026)
Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems
by: Shi, Xi, et al.
Published: (2026)
by: Shi, Xi, et al.
Published: (2026)
Reinforcement World Model Learning for LLM-based Agents
by: Yu, Xiao, et al.
Published: (2026)
by: Yu, Xiao, et al.
Published: (2026)
Reinforce LLM Reasoning through Multi-Agent Reflection
by: Yuan, Yurun, et al.
Published: (2025)
by: Yuan, Yurun, et al.
Published: (2025)
Advancing Multi-Agent RAG Systems with Minimalist Reinforcement Learning
by: Wu, Yihong, et al.
Published: (2025)
by: Wu, Yihong, et al.
Published: (2025)
ClinicalAgents: Multi-Agent Orchestration for Clinical Decision Making with Dual-Memory
by: Ge, Zhuohan, et al.
Published: (2026)
by: Ge, Zhuohan, et al.
Published: (2026)
Response-Conditioned Parallel-to-Sequential Orchestration for Multi-Agent Systems
by: Tastan, Nurbek, et al.
Published: (2026)
by: Tastan, Nurbek, et al.
Published: (2026)
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026)
by: Xu, Haoyuan, et al.
Published: (2026)
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
Real-World Doctor Agent with Proactive Consultation through Multi-Agent Reinforcement Learning
by: Feng, Yichun, et al.
Published: (2025)
by: Feng, Yichun, et al.
Published: (2025)
Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making
by: Amin, Danial
Published: (2026)
by: Amin, Danial
Published: (2026)
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
by: Chen, Yiqun, et al.
Published: (2025)
by: Chen, Yiqun, et al.
Published: (2025)
AgentNet: Decentralized Evolutionary Coordination for LLM-based Multi-Agent Systems
by: Yang, Yingxuan, et al.
Published: (2025)
by: Yang, Yingxuan, et al.
Published: (2025)
Unifying Language Agent Algorithms with Graph-based Orchestration Engine for Reproducible Agent Research
by: Zhang, Qianqian, et al.
Published: (2025)
by: Zhang, Qianqian, et al.
Published: (2025)
LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions
by: Sun, Chuanneng, et al.
Published: (2024)
by: Sun, Chuanneng, et al.
Published: (2024)
MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination
by: Li, Zhuo, et al.
Published: (2026)
by: Li, Zhuo, et al.
Published: (2026)
MASA: LLM-Driven Multi-Agent Systems for Autoformalization
by: Zhang, Lan, et al.
Published: (2025)
by: Zhang, Lan, et al.
Published: (2025)
Disagreement as Data: Reasoning Trace Analytics in Multi-Agent Systems
by: Tajik, Elham, et al.
Published: (2026)
by: Tajik, Elham, et al.
Published: (2026)
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles
by: Wu, Jinyang, et al.
Published: (2026)
by: Wu, Jinyang, et al.
Published: (2026)
Multi-Agent Collaboration via Evolving Orchestration
by: Dang, Yufan, et al.
Published: (2025)
by: Dang, Yufan, et al.
Published: (2025)
Insight Agents: An LLM-Based Multi-Agent System for Data Insights
by: Bai, Jincheng, et al.
Published: (2026)
by: Bai, Jincheng, et al.
Published: (2026)
MAO-ARAG: Multi-Agent Orchestration for Adaptive Retrieval-Augmented Generation
by: Chen, Yiqun, et al.
Published: (2025)
by: Chen, Yiqun, et al.
Published: (2025)
Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning
by: Jin, Bowen, et al.
Published: (2025)
by: Jin, Bowen, et al.
Published: (2025)
G-Memory: Tracing Hierarchical Memory for Multi-Agent Systems
by: Zhang, Guibin, et al.
Published: (2025)
by: Zhang, Guibin, et al.
Published: (2025)
Creativity in LLM-based Multi-Agent Systems: A Survey
by: Lin, Yi-Cheng, et al.
Published: (2025)
by: Lin, Yi-Cheng, et al.
Published: (2025)
AMAS: Adaptively Determining Communication Topology for LLM-based Multi-Agent System
by: Leong, Hui Yi, et al.
Published: (2025)
by: Leong, Hui Yi, et al.
Published: (2025)
SkillMAS: Skill Co-Evolution with LLM-based Multi-Agent System
by: Pan, Shuai, et al.
Published: (2026)
by: Pan, Shuai, et al.
Published: (2026)
MARCO: Multi-Agent Real-time Chat Orchestration
by: Shrimal, Anubhav, et al.
Published: (2024)
by: Shrimal, Anubhav, et al.
Published: (2024)
MASCA: LLM based-Multi Agents System for Credit Assessment
by: Jajoo, Gautam, et al.
Published: (2025)
by: Jajoo, Gautam, et al.
Published: (2025)
WorkForceAgent-R1: Incentivizing Reasoning Capability in LLM-based Web Agents via Reinforcement Learning
by: Zhuang, Yuchen, et al.
Published: (2025)
by: Zhuang, Yuchen, et al.
Published: (2025)
Every Response Counts: Quantifying Uncertainty of LLM-based Multi-Agent Systems through Tensor Decomposition
by: Chen, Tiejin, et al.
Published: (2026)
by: Chen, Tiejin, et al.
Published: (2026)
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
by: Yu, Yangyang, et al.
Published: (2024)
by: Yu, Yangyang, et al.
Published: (2024)
Organizing, Orchestrating, and Benchmarking Agent Skills at Ecosystem Scale
by: Li, Hao, et al.
Published: (2026)
by: Li, Hao, et al.
Published: (2026)
MIRAI: Evaluating LLM Agents for Event Forecasting
by: Ye, Chenchen, et al.
Published: (2024)
by: Ye, Chenchen, et al.
Published: (2024)
PARL: Prompt-based Agents for Reinforcement Learning
by: Resendiz, Yarik Menchaca, et al.
Published: (2025)
by: Resendiz, Yarik Menchaca, et al.
Published: (2025)
xRouter: Training Cost-Aware LLMs Orchestration System via Reinforcement Learning
by: Qian, Cheng, et al.
Published: (2025)
by: Qian, Cheng, et al.
Published: (2025)
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering
by: Liu, Zexi, et al.
Published: (2025)
by: Liu, Zexi, et al.
Published: (2025)
PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents
by: Wang, Minjia, et al.
Published: (2026)
by: Wang, Minjia, et al.
Published: (2026)
When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems
by: Xu, Naen, et al.
Published: (2026)
by: Xu, Naen, et al.
Published: (2026)
Similar Items
-
Agent Q-Mix: Selecting the Right Action for LLM Multi-Agent Systems through Reinforcement Learning
by: Jiang, Eric Hanchen, et al.
Published: (2026) -
From Reasoning to Agentic: Credit Assignment in Reinforcement Learning for Large Language Models
by: Zhang, Chenchen
Published: (2026) -
Learning Latency-Aware Orchestration for Parallel Multi-Agent Systems
by: Shi, Xi, et al.
Published: (2026) -
Reinforcement World Model Learning for LLM-based Agents
by: Yu, Xiao, et al.
Published: (2026) -
Reinforce LLM Reasoning through Multi-Agent Reflection
by: Yuan, Yurun, et al.
Published: (2025)