Saved in:
| Main Authors: | Wang, Xiao, Wang, Jia, Wang, Yijie, Dang, Pengtao, Cao, Sha, Zhang, Chi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.20502 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025)
by: Zhang, Ruiyi, et al.
Published: (2025)
L-MARS: Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search
by: Wang, Ziqi, et al.
Published: (2025)
by: Wang, Ziqi, et al.
Published: (2025)
MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization
by: Zhang, Jian, et al.
Published: (2025)
by: Zhang, Jian, et al.
Published: (2025)
MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation
by: Li, Pengfei, et al.
Published: (2026)
by: Li, Pengfei, et al.
Published: (2026)
Learning to reason about rare diseases through retrieval-augmented agents
by: Kim, Ha Young, et al.
Published: (2025)
by: Kim, Ha Young, et al.
Published: (2025)
CoT-Self-Instruct: Building high-quality synthetic prompts for reasoning and non-reasoning tasks
by: Yu, Ping, et al.
Published: (2025)
by: Yu, Ping, et al.
Published: (2025)
Understanding the planning of LLM agents: A survey
by: Huang, Xu, et al.
Published: (2024)
by: Huang, Xu, et al.
Published: (2024)
CARE: Cognitive-reasoning Augmented Reinforcement for Emotional Support Conversation
by: Zhu, Jie, et al.
Published: (2025)
by: Zhu, Jie, et al.
Published: (2025)
Large language models show fragile cognitive reasoning about human emotions
by: Bhattacharyya, Sree, et al.
Published: (2025)
by: Bhattacharyya, Sree, et al.
Published: (2025)
Diverse And Private Synthetic Datasets Generation for RAG evaluation: A multi-agent framework
by: Driouich, Ilias, et al.
Published: (2025)
by: Driouich, Ilias, et al.
Published: (2025)
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying
by: Castagna, Federico, et al.
Published: (2024)
by: Castagna, Federico, et al.
Published: (2024)
Is continuous CoT better suited for multi-lingual reasoning?
by: Bashir, Ali Hamza, et al.
Published: (2026)
by: Bashir, Ali Hamza, et al.
Published: (2026)
HingeMem: Boundary Guided Long-Term Memory with Query Adaptive Retrieval for Scalable Dialogues
by: Zhong, Yijie, et al.
Published: (2026)
by: Zhong, Yijie, et al.
Published: (2026)
Auto-ABSA: Cross-Domain Aspect Detection and Sentiment Analysis Using Auxiliary Sentences
by: Wang, Teng, et al.
Published: (2022)
by: Wang, Teng, et al.
Published: (2022)
Apollo: A Lightweight Multilingual Medical LLM towards Democratizing Medical AI to 6B People
by: Wang, Xidong, et al.
Published: (2024)
by: Wang, Xidong, et al.
Published: (2024)
MARS-Bench: A Multi-turn Athletic Real-world Scenario Benchmark for Dialogue Evaluation
by: Yang, Chenghao, et al.
Published: (2025)
by: Yang, Chenghao, et al.
Published: (2025)
ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model
by: Lan, Wuyang, et al.
Published: (2025)
by: Lan, Wuyang, et al.
Published: (2025)
CoreEval: Automatically Building Contamination-Resilient Datasets with Real-World Knowledge toward Reliable LLM Evaluation
by: Zhao, Jingqian, et al.
Published: (2025)
by: Zhao, Jingqian, et al.
Published: (2025)
DiagGPT: An LLM-based and Multi-agent Dialogue System with Automatic Topic Management for Flexible Task-Oriented Dialogue
by: Cao, Lang
Published: (2023)
by: Cao, Lang
Published: (2023)
Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry
by: Peng, Run, et al.
Published: (2025)
by: Peng, Run, et al.
Published: (2025)
MIND: Towards Immersive Psychological Healing with Multi-agent Inner Dialogue
by: Chen, Yujia, et al.
Published: (2025)
by: Chen, Yujia, et al.
Published: (2025)
Slm-mux: Orchestrating small language models for reasoning
by: Wang, Chenyu, et al.
Published: (2025)
by: Wang, Chenyu, et al.
Published: (2025)
StateFlow: Enhancing LLM Task-Solving through State-Driven Workflows
by: Wu, Yiran, et al.
Published: (2024)
by: Wu, Yiran, et al.
Published: (2024)
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
by: Zhang, Beichen, et al.
Published: (2025)
by: Zhang, Beichen, et al.
Published: (2025)
SciAgents: Automating scientific discovery through multi-agent intelligent graph reasoning
by: Ghafarollahi, Alireza, et al.
Published: (2024)
by: Ghafarollahi, Alireza, et al.
Published: (2024)
Supervising the search process produces reliable and generalizable information-seeking agents
by: Xiong, Guangzhi, et al.
Published: (2025)
by: Xiong, Guangzhi, et al.
Published: (2025)
AIstorian lets AI be a historian: A KG-powered multi-agent system for accurate biography generation
by: Li, Fengyu, et al.
Published: (2025)
by: Li, Fengyu, et al.
Published: (2025)
MURMUR: Using cross-user chatter to break collaborative language agents in groups
by: Patlan, Atharv Singh, et al.
Published: (2025)
by: Patlan, Atharv Singh, et al.
Published: (2025)
SEM: Reinforcement Learning for Search-Efficient Large Language Models
by: Sha, Zeyang, et al.
Published: (2025)
by: Sha, Zeyang, et al.
Published: (2025)
Auditing medical multi-agent AI reveals risks of false consensus
by: Zhu, Yinghao, et al.
Published: (2025)
by: Zhu, Yinghao, et al.
Published: (2025)
Universal Legal Article Prediction via Tight Collaboration between Supervised Classification Model and LLM
by: Chi, Xiao, et al.
Published: (2025)
by: Chi, Xiao, et al.
Published: (2025)
MARS: Co-evolving Dual-System Deep Research via Multi-Agent Reinforcement Learning
by: Chen, Guoxin, et al.
Published: (2025)
by: Chen, Guoxin, et al.
Published: (2025)
ProtAgents: Protein discovery via large language model multi-agent collaborations combining physics and machine learning
by: Ghafarollahi, A., et al.
Published: (2024)
by: Ghafarollahi, A., et al.
Published: (2024)
Language Ranker: A Lightweight Ranking framework for LLM Decoding
by: Zhang, Chenheng, et al.
Published: (2025)
by: Zhang, Chenheng, et al.
Published: (2025)
YAYI-UIE: A Chat-Enhanced Instruction Tuning Framework for Universal Information Extraction
by: Xiao, Xinglin, et al.
Published: (2023)
by: Xiao, Xinglin, et al.
Published: (2023)
Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent
by: Nusrat, Humza, et al.
Published: (2025)
by: Nusrat, Humza, et al.
Published: (2025)
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents
by: Zhu, Kunlun, et al.
Published: (2025)
by: Zhu, Kunlun, et al.
Published: (2025)
Alleviating Choice Supportive Bias in LLM with Reasoning Dependency Generation
by: Zhuang, Nan, et al.
Published: (2025)
by: Zhuang, Nan, et al.
Published: (2025)
Plancraft: an evaluation dataset for planning with LLM agents
by: Dagan, Gautier, et al.
Published: (2024)
by: Dagan, Gautier, et al.
Published: (2024)
ChatSOP: An SOP-Guided MCTS Planning Framework for Controllable LLM Dialogue Agents
by: Li, Zhigen, et al.
Published: (2024)
by: Li, Zhigen, et al.
Published: (2024)
Similar Items
-
DreamPRM-Code: Function-as-Step Process Reward Model with Label Correction for LLM Coding
by: Zhang, Ruiyi, et al.
Published: (2025) -
L-MARS: Legal Multi-Agent Workflow with Orchestrated Reasoning and Agentic Search
by: Wang, Ziqi, et al.
Published: (2025) -
MARS: Multi-Agent Adaptive Reasoning with Socratic Guidance for Automated Prompt Optimization
by: Zhang, Jian, et al.
Published: (2025) -
MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation
by: Li, Pengfei, et al.
Published: (2026) -
Learning to reason about rare diseases through retrieval-augmented agents
by: Kim, Ha Young, et al.
Published: (2025)