Saved in:
| Main Authors: | Liu, Fengyuan, Zhao, Rui, Chen, Shuo, Li, Guohao, Torr, Philip, Han, Lei, Gu, Jindong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16494 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Can Large Language Model Agents Simulate Human Trust Behavior?
by: Xie, Chengxing, et al.
Published: (2024)
by: Xie, Chengxing, et al.
Published: (2024)
LLM Jailbreak Detection for (Almost) Free!
by: Chen, Guorui, et al.
Published: (2025)
by: Chen, Guorui, et al.
Published: (2025)
Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models
by: Gao, Kuofeng, et al.
Published: (2024)
by: Gao, Kuofeng, et al.
Published: (2024)
The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models
by: Sun, Bohang, et al.
Published: (2026)
by: Sun, Bohang, et al.
Published: (2026)
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
by: Xue, Xiangyuan, et al.
Published: (2025)
by: Xue, Xiangyuan, et al.
Published: (2025)
Energy-Latency Manipulation of Multi-modal Large Language Models via Verbose Samples
by: Gao, Kuofeng, et al.
Published: (2024)
by: Gao, Kuofeng, et al.
Published: (2024)
Can Multimodal Large Language Models Truly Perform Multimodal In-Context Learning?
by: Chen, Shuo, et al.
Published: (2023)
by: Chen, Shuo, et al.
Published: (2023)
A Survey on Responsible Generative AI: What to Generate and What Not
by: Gu, Jindong
Published: (2024)
by: Gu, Jindong
Published: (2024)
True Multimodal In-Context Learning Needs Attention to the Visual Context
by: Chen, Shuo, et al.
Published: (2025)
by: Chen, Shuo, et al.
Published: (2025)
Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?
by: Yu, Junchi, et al.
Published: (2025)
by: Yu, Junchi, et al.
Published: (2025)
Nuclear Deployed: Analyzing Catastrophic Risks in Decision-making of Autonomous LLM Agents
by: Xu, Rongwu, et al.
Published: (2025)
by: Xu, Rongwu, et al.
Published: (2025)
Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
by: Tang, Shuo, et al.
Published: (2024)
by: Tang, Shuo, et al.
Published: (2024)
AdaMARP: An Adaptive Multi-Agent Interaction Framework for General Immersive Role-Playing
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
Eigen-1: Adaptive Multi-Agent Refinement with Monitor-Based RAG for Scientific Reasoning
by: Tang, Xiangru, et al.
Published: (2025)
by: Tang, Xiangru, et al.
Published: (2025)
PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving
by: Parmar, Mihir, et al.
Published: (2025)
by: Parmar, Mihir, et al.
Published: (2025)
Visual Question Decomposition on Multimodal Large Language Models
by: Zhang, Haowei, et al.
Published: (2024)
by: Zhang, Haowei, et al.
Published: (2024)
When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems
by: Xu, Naen, et al.
Published: (2026)
by: Xu, Naen, et al.
Published: (2026)
ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering
by: Liu, Zexi, et al.
Published: (2025)
by: Liu, Zexi, et al.
Published: (2025)
Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey
by: Guan, Shengyue, et al.
Published: (2025)
by: Guan, Shengyue, et al.
Published: (2025)
GroundAct: Can LLM Agents Ground Actions in Environmental States?
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
Voting or Consensus? Decision-Making in Multi-Agent Debate
by: Kaesberg, Lars Benedikt, et al.
Published: (2025)
by: Kaesberg, Lars Benedikt, et al.
Published: (2025)
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
Rationale-guided Prompting for Knowledge-based Visual Question Answering
by: Hu, Zhongjian, et al.
Published: (2024)
by: Hu, Zhongjian, et al.
Published: (2024)
FocalPO: Enhancing Preference Optimizing by Focusing on Correct Preference Rankings
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Can Large Language Models Express Uncertainty Like Human?
by: Tao, Linwei, et al.
Published: (2025)
by: Tao, Linwei, et al.
Published: (2025)
Recursive Multi-Agent Systems
by: Yang, Xiyuan, et al.
Published: (2026)
by: Yang, Xiyuan, et al.
Published: (2026)
Fusion-Eval: Integrating Assistant Evaluators with LLMs
by: Shu, Lei, et al.
Published: (2023)
by: Shu, Lei, et al.
Published: (2023)
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models
by: Lan, Michael, et al.
Published: (2023)
by: Lan, Michael, et al.
Published: (2023)
Inclusion-of-Thoughts: Mitigating Preference Instability via Purifying the Decision Space
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2026)
by: Madani, Mohammad Reza Ghasemi, et al.
Published: (2026)
AgentsCourt: Building Judicial Decision-Making Agents with Court Debate Simulation and Legal Knowledge Augmentation
by: He, Zhitao, et al.
Published: (2024)
by: He, Zhitao, et al.
Published: (2024)
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research
by: Chen, Renqi, et al.
Published: (2025)
by: Chen, Renqi, et al.
Published: (2025)
Language Agents as Digital Representatives in Collective Decision-Making
by: Jarrett, Daniel, et al.
Published: (2025)
by: Jarrett, Daniel, et al.
Published: (2025)
Dynamic Evaluation of Large Language Models by Meta Probing Agents
by: Zhu, Kaijie, et al.
Published: (2024)
by: Zhu, Kaijie, et al.
Published: (2024)
Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies
by: Zhang, Hanzhong, et al.
Published: (2026)
by: Zhang, Hanzhong, et al.
Published: (2026)
MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents
by: Lei, Yuzhen, et al.
Published: (2025)
by: Lei, Yuzhen, et al.
Published: (2025)
MMG2Skill: Can Agents Distill In-the-Wild Guides into Self-Evolving Skills?
by: Che, Xinyu, et al.
Published: (2026)
by: Che, Xinyu, et al.
Published: (2026)
SciMaster: Towards General-Purpose Scientific AI Agents, Part I. X-Master as Foundation: Can We Lead on Humanity's Last Exam?
by: Chai, Jingyi, et al.
Published: (2025)
by: Chai, Jingyi, et al.
Published: (2025)
A Proactive Multi-Agent Dialogue Framework for Assessing Social Language Disorder Traits in Autism
by: Hu, Chuanbo, et al.
Published: (2026)
by: Hu, Chuanbo, et al.
Published: (2026)
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
by: Xi, Zhiheng, et al.
Published: (2025)
by: Xi, Zhiheng, et al.
Published: (2025)
LiveMCPBench: Can Agents Navigate an Ocean of MCP Tools?
by: Mo, Guozhao, et al.
Published: (2025)
by: Mo, Guozhao, et al.
Published: (2025)
Similar Items
-
Can Large Language Model Agents Simulate Human Trust Behavior?
by: Xie, Chengxing, et al.
Published: (2024) -
LLM Jailbreak Detection for (Almost) Free!
by: Chen, Guorui, et al.
Published: (2025) -
Benchmarking Open-ended Audio Dialogue Understanding for Large Audio-Language Models
by: Gao, Kuofeng, et al.
Published: (2024) -
The Path Matters: Learning a Token-Commitment Policy for Diffusion Language Models
by: Sun, Bohang, et al.
Published: (2026) -
CoMAS: Co-Evolving Multi-Agent Systems via Interaction Rewards
by: Xue, Xiangyuan, et al.
Published: (2025)