Saved in:
| Main Authors: | Su, Haoran, Sun, Yandong, Yu, Congjia |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.08237 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026)
by: Su, Haoran, et al.
Published: (2026)
Multi-Agent Coordination Adaptation via Structure-Guided Orchestration
by: Li, Haoran, et al.
Published: (2026)
by: Li, Haoran, et al.
Published: (2026)
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
by: Gao, Yuan, et al.
Published: (2025)
by: Gao, Yuan, et al.
Published: (2025)
Alternating Target-Path Planning for Scalable Multi-Agent Coordination
by: Kumagai, Yu, et al.
Published: (2026)
by: Kumagai, Yu, et al.
Published: (2026)
RewardHackingAgents: Benchmarking Evaluation Integrity for LLM ML-Engineering Agents
by: Atinafu, Yonas, et al.
Published: (2026)
by: Atinafu, Yonas, et al.
Published: (2026)
Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning
by: Su, Haoran
Published: (2025)
by: Su, Haoran
Published: (2025)
CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs
by: Zou, Chelsea, et al.
Published: (2026)
by: Zou, Chelsea, et al.
Published: (2026)
CausalAgent: A Conversational Multi-Agent System for End-to-End Causal Inference
by: Zhu, Jiawei, et al.
Published: (2026)
by: Zhu, Jiawei, et al.
Published: (2026)
Introspection of Thought Helps AI Agents
by: Sun, Haoran, et al.
Published: (2025)
by: Sun, Haoran, et al.
Published: (2025)
Multi-Agent Coordination across Diverse Applications: A Survey
by: Sun, Lijun, et al.
Published: (2025)
by: Sun, Lijun, et al.
Published: (2025)
NORA: A Harness-Engineered Autonomous Research Agent for End-to-End Spatial Data Science
by: Zhou, Bing, et al.
Published: (2026)
by: Zhou, Bing, et al.
Published: (2026)
Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
by: Li, Weizhen, et al.
Published: (2025)
by: Li, Weizhen, et al.
Published: (2025)
How LLMs Follow Instructions: Skillful Coordination, Not a Universal Mechanism
by: Rocchetti, Elisabetta, et al.
Published: (2026)
by: Rocchetti, Elisabetta, et al.
Published: (2026)
From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling
by: Cao, Yifei, et al.
Published: (2025)
by: Cao, Yifei, et al.
Published: (2025)
Cooperative Reward Shaping for Multi-Agent Pathfinding
by: Song, Zhenyu, et al.
Published: (2024)
by: Song, Zhenyu, et al.
Published: (2024)
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards
by: Da, Jeff, et al.
Published: (2025)
by: Da, Jeff, et al.
Published: (2025)
Data-Efficient Multi-Agent Spatial Planning with LLMs
by: Su, Huangyuan, et al.
Published: (2025)
by: Su, Huangyuan, et al.
Published: (2025)
Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling
by: Li, Derek, et al.
Published: (2025)
by: Li, Derek, et al.
Published: (2025)
Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines
by: Zheng, Xuejing, et al.
Published: (2024)
by: Zheng, Xuejing, et al.
Published: (2024)
Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination
by: ALMutairi, Mariam, et al.
Published: (2025)
by: ALMutairi, Mariam, et al.
Published: (2025)
Swarm Skills: A Portable, Self-Evolving Multi-Agent System Specification for Coordination Engineering
by: Zhang, Xinyu, et al.
Published: (2026)
by: Zhang, Xinyu, et al.
Published: (2026)
AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents
by: Zhang, Haoran, et al.
Published: (2026)
by: Zhang, Haoran, et al.
Published: (2026)
PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLMs
by: Yadav, Ankit, et al.
Published: (2024)
by: Yadav, Ankit, et al.
Published: (2024)
MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning
by: Zhang, Yaolun, et al.
Published: (2026)
by: Zhang, Yaolun, et al.
Published: (2026)
GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems
by: Rana, Ashish, et al.
Published: (2024)
by: Rana, Ashish, et al.
Published: (2024)
Confidence as a Reward: Transforming LLMs into Reward Models
by: Du, He, et al.
Published: (2025)
by: Du, He, et al.
Published: (2025)
Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems
by: Yu, Ye, et al.
Published: (2026)
by: Yu, Ye, et al.
Published: (2026)
Multi-Agent Coordinated Rename Refactoring
by: Bellur, Abhiram, et al.
Published: (2026)
by: Bellur, Abhiram, et al.
Published: (2026)
Redistributing Rewards Across Time and Agents for Multi-Agent Reinforcement Learning
by: Kapoor, Aditya, et al.
Published: (2025)
by: Kapoor, Aditya, et al.
Published: (2025)
ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning
by: Abboud, Elie, et al.
Published: (2026)
by: Abboud, Elie, et al.
Published: (2026)
FROGENT: An End-to-End Full-process Drug Design Multi-Agent System
by: Pan, Qihua, et al.
Published: (2025)
by: Pan, Qihua, et al.
Published: (2025)
SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling
by: Wang, Haoran, et al.
Published: (2025)
by: Wang, Haoran, et al.
Published: (2025)
Coordination Graphs for Constrained Multi-Agent Reinforcement Learning
by: Amaya-Corredor, Santiago, et al.
Published: (2026)
by: Amaya-Corredor, Santiago, et al.
Published: (2026)
Reward-Robust RLHF in LLMs
by: Yan, Yuzi, et al.
Published: (2024)
by: Yan, Yuzi, et al.
Published: (2024)
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024)
by: Huang, Jen-tse, et al.
Published: (2024)
Collaborate, Deliberate, Evaluate: How LLM Alignment Affects Coordinated Multi-Agent Outcomes
by: Nath, Abhijnan, et al.
Published: (2025)
by: Nath, Abhijnan, et al.
Published: (2025)
How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework
by: uulu, Choro Ulan, et al.
Published: (2026)
by: uulu, Choro Ulan, et al.
Published: (2026)
Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
by: Duan, Yiqun, et al.
Published: (2024)
by: Duan, Yiqun, et al.
Published: (2024)
SANNet: A Semantic-Aware Agentic AI Networking Framework for Multi-Agent Cross-Layer Coordination
by: Xiao, Yong, et al.
Published: (2025)
by: Xiao, Yong, et al.
Published: (2025)
Similar Items
-
Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026) -
Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026) -
Multi-Agent Coordination Adaptation via Structure-Guided Orchestration
by: Li, Haoran, et al.
Published: (2026) -
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
by: Gao, Yuan, et al.
Published: (2025) -
Alternating Target-Path Planning for Scalable Multi-Agent Coordination
by: Kumagai, Yu, et al.
Published: (2026)