Saved in:
| Main Authors: | Tan, Weiting, Qu, Xinghua, Tu, Ming, Ge, Meng, Liu, Andy T., Koehn, Philipp, Lu, Lu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.14480 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FlowPortrait: Reinforcement Learning for Audio-Driven Portrait Video Generation
by: Tan, Weiting, et al.
Published: (2026)
by: Tan, Weiting, et al.
Published: (2026)
Agents that Matter: Optimizing Multi-Agent LLMs via Removal-Based Attribution
by: Lu, Mingyu, et al.
Published: (2026)
by: Lu, Mingyu, et al.
Published: (2026)
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
by: Li, Zhuofeng, et al.
Published: (2025)
by: Li, Zhuofeng, et al.
Published: (2025)
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
by: Han, Senyu, et al.
Published: (2024)
by: Han, Senyu, et al.
Published: (2024)
You Only Align Once: Propagating Cooperative Behaviors in Multi-Agent Systems through Seed Agents
by: Hsing, Nicole, et al.
Published: (2026)
by: Hsing, Nicole, et al.
Published: (2026)
OralAgent: Integrating Reasoning, Tools, and Knowledge for Interactive Dental Image Analysis
by: Hao, Jing, et al.
Published: (2026)
by: Hao, Jing, et al.
Published: (2026)
Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication
by: Lu, Yiming, et al.
Published: (2025)
by: Lu, Yiming, et al.
Published: (2025)
OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning
by: Bi, Zhenyu, et al.
Published: (2025)
by: Bi, Zhenyu, et al.
Published: (2025)
ColorAgent: Building A Robust, Personalized, and Interactive OS Agent
by: Li, Ning, et al.
Published: (2025)
by: Li, Ning, et al.
Published: (2025)
Memory-R1: Enhancing Large Language Model Agents to Manage and Utilize Memories via Reinforcement Learning
by: Yan, Sikuan, et al.
Published: (2025)
by: Yan, Sikuan, et al.
Published: (2025)
TopoDIM: One-shot Topology Generation of Diverse Interaction Modes for Multi-Agent Systems
by: Sun, Rui, et al.
Published: (2026)
by: Sun, Rui, et al.
Published: (2026)
Multi-Agent Computer Use
by: Koh, Jing Yu, et al.
Published: (2026)
by: Koh, Jing Yu, et al.
Published: (2026)
StitchCUDA: An Automated Multi-Agents End-to-End GPU Programing Framework with Rubric-based Agentic Reinforcement Learning
by: Li, Shiyang, et al.
Published: (2026)
by: Li, Shiyang, et al.
Published: (2026)
Which Agent Causes Task Failures and When? On Automated Failure Attribution of LLM Multi-Agent Systems
by: Zhang, Shaokun, et al.
Published: (2025)
by: Zhang, Shaokun, et al.
Published: (2025)
Beyond Static Tools: Test-Time Tool Evolution for Scientific Reasoning
by: Lu, Jiaxuan, et al.
Published: (2026)
by: Lu, Jiaxuan, et al.
Published: (2026)
Transformer-Based Scalable Multi-Agent Reinforcement Learning for Networked Systems with Long-Range Interactions
by: Sinha, Vidur, et al.
Published: (2025)
by: Sinha, Vidur, et al.
Published: (2025)
MAS-GPT: Training LLMs to Build LLM-based Multi-Agent Systems
by: Ye, Rui, et al.
Published: (2025)
by: Ye, Rui, et al.
Published: (2025)
Safe Multi-agent Reinforcement Learning with Natural Language Constraints
by: Wang, Ziyan, et al.
Published: (2024)
by: Wang, Ziyan, et al.
Published: (2024)
STAR-PólyaMath: Multi-Agent Reasoning under Persistent Meta-Strategic Supervision
by: Wu, Jiaao, et al.
Published: (2026)
by: Wu, Jiaao, et al.
Published: (2026)
LLM Agents Making Agent Tools
by: Wölflein, Georg, et al.
Published: (2025)
by: Wölflein, Georg, et al.
Published: (2025)
One Panel Does Not Fit All: Case-Adaptive Multi-Agent Deliberation for Clinical Prediction
by: Lu, Yuxing, et al.
Published: (2026)
by: Lu, Yuxing, et al.
Published: (2026)
When KV Cache Reuse Fails in Multi-Agent Systems: Cross-Candidate Interaction is Crucial for LLM Judges
by: Liang, Sichu, et al.
Published: (2026)
by: Liang, Sichu, et al.
Published: (2026)
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning
by: Sarkar, Bidipta, et al.
Published: (2025)
by: Sarkar, Bidipta, et al.
Published: (2025)
Strategic Persuasion with Trait-Conditioned Multi-Agent Systems for Iterative Legal Argumentation
by: Siedler, Philipp D.
Published: (2026)
by: Siedler, Philipp D.
Published: (2026)
QUACK: Questioning, Understanding, and Auditing Communicated Knowledge in Multimodal Social Deduction Agents
by: Yuan, Ye, et al.
Published: (2026)
by: Yuan, Ye, et al.
Published: (2026)
ClinEnv: An Interactive Multi-Stage Long Horizon EHR Environment for Agents
by: Lu, Yuxing, et al.
Published: (2026)
by: Lu, Yuxing, et al.
Published: (2026)
How Brittle is Agent Safety? Rethinking Agent Risk under Intent Concealment and Task Complexity
by: Ma, Zihan, et al.
Published: (2025)
by: Ma, Zihan, et al.
Published: (2025)
Trajectory Supervision for Continual Tool-Use Learning in LLMs
by: Reddy, Vishnu Vardhan, et al.
Published: (2026)
by: Reddy, Vishnu Vardhan, et al.
Published: (2026)
Efficient Agents: Building Effective Agents While Reducing Cost
by: Wang, Ningning, et al.
Published: (2025)
by: Wang, Ningning, et al.
Published: (2025)
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning
by: Lu, Pan, et al.
Published: (2025)
by: Lu, Pan, et al.
Published: (2025)
ValueFlow: Measuring the Propagation of Value Perturbations in Multi-Agent LLM Systems
by: Liu, Jinnuo, et al.
Published: (2026)
by: Liu, Jinnuo, et al.
Published: (2026)
AgentDisCo: Towards Disentanglement and Collaboration in Open-ended Deep Research Agents
by: Jin, Jiarui, et al.
Published: (2026)
by: Jin, Jiarui, et al.
Published: (2026)
Dynamic Coalition Structure Detection in Natural Language-based Interactions
by: Kulkarni, Abhishek N., et al.
Published: (2025)
by: Kulkarni, Abhishek N., et al.
Published: (2025)
LingxiDiagBench: A Multi-Agent Framework for Benchmarking LLMs in Chinese Psychiatric Consultation and Diagnosis
by: Xu, Shihao, et al.
Published: (2026)
by: Xu, Shihao, et al.
Published: (2026)
Memory-Augmented Reinforcement Learning Agent for CAD Generation
by: Xiaolong, Yin, et al.
Published: (2026)
by: Xiaolong, Yin, et al.
Published: (2026)
Preventing Rogue Agents Improves Multi-Agent Collaboration
by: Barbi, Ohav, et al.
Published: (2025)
by: Barbi, Ohav, et al.
Published: (2025)
A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application
by: Chen, Shuaihang, et al.
Published: (2024)
by: Chen, Shuaihang, et al.
Published: (2024)
ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration
by: Su, Hongjin, et al.
Published: (2025)
by: Su, Hongjin, et al.
Published: (2025)
ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning
by: Wan, Ziyu, et al.
Published: (2025)
by: Wan, Ziyu, et al.
Published: (2025)
Bayesian Ego-graph Inference for Networked Multi-Agent Reinforcement Learning
by: Duan, Wei, et al.
Published: (2025)
by: Duan, Wei, et al.
Published: (2025)
Similar Items
-
FlowPortrait: Reinforcement Learning for Audio-Driven Portrait Video Generation
by: Tan, Weiting, et al.
Published: (2026) -
Agents that Matter: Optimizing Multi-Agent LLMs via Removal-Based Attribution
by: Lu, Mingyu, et al.
Published: (2026) -
In-the-Flow Agentic System Optimization for Effective Planning and Tool Use
by: Li, Zhuofeng, et al.
Published: (2025) -
IBSEN: Director-Actor Agent Collaboration for Controllable and Interactive Drama Script Generation
by: Han, Senyu, et al.
Published: (2024) -
You Only Align Once: Propagating Cooperative Behaviors in Multi-Agent Systems through Seed Agents
by: Hsing, Nicole, et al.
Published: (2026)