:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Su, Haoran, Sun, Yandong, Yu, Congjia
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.08237
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Spatiotemporal Decision Transformer for Traffic Coordination
by: Su, Haoran, et al.
Published: (2026)

Emergency Preemption Without Online Exploration: A Decision Transformer Approach
by: Su, Haoran, et al.
Published: (2026)

Multi-Agent Coordination Adaptation via Structure-Guided Orchestration
by: Li, Haoran, et al.
Published: (2026)

SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
by: Gao, Yuan, et al.
Published: (2025)

Alternating Target-Path Planning for Scalable Multi-Agent Coordination
by: Kumagai, Yu, et al.
Published: (2026)

RewardHackingAgents: Benchmarking Evaluation Integrity for LLM ML-Engineering Agents
by: Atinafu, Yonas, et al.
Published: (2026)

Facilitating Emergency Vehicle Passage in Congested Urban Areas Using Multi-agent Deep Reinforcement Learning
by: Su, Haoran
Published: (2025)

CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs
by: Zou, Chelsea, et al.
Published: (2026)

CausalAgent: A Conversational Multi-Agent System for End-to-End Causal Inference
by: Zhu, Jiawei, et al.
Published: (2026)

Introspection of Thought Helps AI Agents
by: Sun, Haoran, et al.
Published: (2025)

Multi-Agent Coordination across Diverse Applications: A Survey
by: Sun, Lijun, et al.
Published: (2025)

NORA: A Harness-Engineered Autonomous Research Agent for End-to-End Spatial Data Science
by: Zhou, Bing, et al.
Published: (2026)

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL
by: Li, Weizhen, et al.
Published: (2025)

How LLMs Follow Instructions: Skillful Coordination, Not a Universal Mechanism
by: Rocchetti, Elisabetta, et al.
Published: (2026)

From Scores to Preferences: Redefining MOS Benchmarking for Speech Quality Reward Modeling
by: Cao, Yifei, et al.
Published: (2025)

Cooperative Reward Shaping for Multi-Agent Pathfinding
by: Song, Zhenyu, et al.
Published: (2024)

Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards
by: Da, Jeff, et al.
Published: (2025)

Data-Efficient Multi-Agent Spatial Planning with LLMs
by: Su, Huangyuan, et al.
Published: (2025)

Omni-Thinker: Scaling Multi-Task RL in LLMs with Hybrid Reward and Task Scheduling
by: Li, Derek, et al.
Published: (2025)

Multi-Agent Reinforcement Learning with a Hierarchy of Reward Machines
by: Zheng, Xuejing, et al.
Published: (2024)

Resilient Multi-Agent Negotiation for Medical Supply Chains:Integrating LLMs and Blockchain for Transparent Coordination
by: ALMutairi, Mariam, et al.
Published: (2025)

Swarm Skills: A Portable, Self-Evolving Multi-Agent System Specification for Coordination Engineering
by: Zhang, Xinyu, et al.
Published: (2026)

AGORA: Adapter-Grounded Observation-Action Retention for Inference-Free Prompt Compression in LLM Agents
by: Zhang, Haoran, et al.
Published: (2026)

PythonSaga: Redefining the Benchmark to Evaluate Code Generating LLMs
by: Yadav, Ankit, et al.
Published: (2024)

MetaAgent-X : Breaking the Ceiling of Automatic Multi-Agent Systems via End-to-End Reinforcement Learning
by: Zhang, Yaolun, et al.
Published: (2026)

GOV-REK: Governed Reward Engineering Kernels for Designing Robust Multi-Agent Reinforcement Learning Systems
by: Rana, Ashish, et al.
Published: (2024)

Confidence as a Reward: Transforming LLMs into Reward Models
by: Du, He, et al.
Published: (2025)

Learning to Communicate: Toward End-to-End Optimization of Multi-Agent Language Systems
by: Yu, Ye, et al.
Published: (2026)

Multi-Agent Coordinated Rename Refactoring
by: Bellur, Abhiram, et al.
Published: (2026)

Redistributing Rewards Across Time and Agents for Multi-Agent Reinforcement Learning
by: Kapoor, Aditya, et al.
Published: (2025)

ARMS: Automatic Reward Shaping for Sparse-Reward Multi-Agent Reinforcement Learning
by: Abboud, Elie, et al.
Published: (2026)

FROGENT: An End-to-End Full-process Drug Design Multi-Agent System
by: Pan, Qihua, et al.
Published: (2025)

SWE-Dev: Building Software Engineering Agents with Training and Inference Scaling
by: Wang, Haoran, et al.
Published: (2025)

Coordination Graphs for Constrained Multi-Agent Reinforcement Learning
by: Amaya-Corredor, Santiago, et al.
Published: (2026)

Reward-Robust RLHF in LLMs
by: Yan, Yuzi, et al.
Published: (2024)

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024)

Collaborate, Deliberate, Evaluate: How LLM Alignment Affects Coordinated Multi-Agent Outcomes
by: Nath, Abhijnan, et al.
Published: (2025)

How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework
by: uulu, Choro Ulan, et al.
Published: (2026)

Prompting Multi-Modal Tokens to Enhance End-to-End Autonomous Driving Imitation Learning with LLMs
by: Duan, Yiqun, et al.
Published: (2024)

SANNet: A Semantic-Aware Agentic AI Networking Framework for Multi-Agent Cross-Layer Coordination
by: Xiao, Yong, et al.
Published: (2025)