:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Xiao, Wang, Jia, Wang, Yijie, Dang, Pengtao, Cao, Sha, Zhang, Chi
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.20502
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025)

L-MARS: Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search
by: Wang, Ziqi, et al.
Published: (2025)

MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization
by: Zhang, Jian, et al.
Published: (2025)

MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation
by: Li, Pengfei, et al.
Published: (2026)

Learning to reason about rare diseases through retrieval-augmented agents
by: Kim, Ha Young, et al.
Published: (2025)

CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
by: Yu, Ping, et al.
Published: (2025)

Understanding the planning of LLM agents: A survey
by: Huang, Xu, et al.
Published: (2024)

CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation
by: Zhu, Jie, et al.
Published: (2025)

Large language models show fragile cognitive reasoning about human emotions
by: Bhattacharyya, Sree, et al.
Published: (2025)

Diverse And Private Synthetic Datasets Generation for RAG evaluation: A multi-agent framework
by: Driouich, Ilias, et al.
Published: (2025)

Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying
by: Castagna, Federico, et al.
Published: (2024)

Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)

HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues
by: Zhong, Yijie, et al.
Published: (2026)

Auto-ABSA: Cross-Domain Aspect Detection and Sentiment Analysis Using Auxiliary Sentences
by: Wang, Teng, et al.
Published: (2022)

Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
by: Wang, Xidong, et al.
Published: (2024)

MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
by: Yang, Chenghao, et al.
Published: (2025)

ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model
by: Lan, Wuyang, et al.
Published: (2025)

CoreEval: Automatically Building Contamination-Resilient Datasets with Real-World Knowledge toward Reliable LLM Evaluation
by: Zhao, Jingqian, et al.
Published: (2025)

DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue
by: Cao, Lang
Published: (2023)

Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry
by: Peng, Run, et al.
Published: (2025)

MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue
by: Chen, Yujia, et al.
Published: (2025)

Slm-mux: Orchestrating small language models for reasoning
by: Wang, Chenyu, et al.
Published: (2025)

StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows
by: Wu, Yiran, et al.
Published: (2024)

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
by: Zhang, Beichen, et al.
Published: (2025)

SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
by: Ghafarollahi, Alireza, et al.
Published: (2024)

Supervising the search process produces reliable and generalizable information-seeking agents
by: Xiong, Guangzhi, et al.
Published: (2025)

AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation
by: Li, Fengyu, et al.
Published: (2025)

MURMUR: Using cross-user chatter to break collaborative language agents in groups
by: Patlan, Atharv Singh, et al.
Published: (2025)

SEM: Reinforcement Learning for Search-Efficient Large Language Models
by: Sha, Zeyang, et al.
Published: (2025)

Auditing medical multi-agent AI reveals risks of false consensus
by: Zhu, Yinghao, et al.
Published: (2025)

Universal Legal Article Prediction via Tight Collaboration between Supervised Classification Model and LLM
by: Chi, Xiao, et al.
Published: (2025)

MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)

ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning
by: Ghafarollahi, A., et al.
Published: (2024)

Language Ranker: A Lightweight Ranking framework for LLM Decoding
by: Zhang, Chenheng, et al.
Published: (2025)

YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction
by: Xiao, Xinglin, et al.
Published: (2023)

Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents
by: Zhu, Kunlun, et al.
Published: (2025)

Alleviating Choice Supportive Bias in LLM with Reasoning Dependency Generation
by: Zhuang, Nan, et al.
Published: (2025)

Plancraft: an evaluation dataset for planning with LLM agents
by: Dagan, Gautier, et al.
Published: (2024)

ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
by: Li, Zhigen, et al.
Published: (2024)