Saved in:
| Main Authors: | Hou, Ruihui, Huai, Ziyue, Zhang, Chennuo, Liu, Ziyan, Zhao, Siran, Yu, Yao, Zhai, Jie, Ruan, Tong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2606.01094 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MSDiagnosis: A Benchmark for Evaluating Large Language Models in Multi-Step Clinical Diagnosis
by: Hou, Ruihui, et al.
Published: (2024)
by: Hou, Ruihui, et al.
Published: (2024)
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
by: Liu, Jiaqi, et al.
Published: (2025)
by: Liu, Jiaqi, et al.
Published: (2025)
CharTool: Tool-Integrated Visual Reasoning for Chart Understanding
by: Zhang, Situo, et al.
Published: (2026)
by: Zhang, Situo, et al.
Published: (2026)
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
by: Ruan, Jingqing, et al.
Published: (2023)
by: Ruan, Jingqing, et al.
Published: (2023)
ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification
by: Liu, Siran, et al.
Published: (2026)
by: Liu, Siran, et al.
Published: (2026)
ToolRLA: Multiplicative Reward Decomposition for Tool-Integrated Agents
by: Liu, Pengbo
Published: (2026)
by: Liu, Pengbo
Published: (2026)
Dissecting Tool-Integrated Reasoning: An Empirical Study and Analysis
by: Zhao, Yufeng, et al.
Published: (2025)
by: Zhao, Yufeng, et al.
Published: (2025)
EGL-SCA: Structural Credit Assignment for Co-Evolving Instructions and Tools in Graph Reasoning Agents
by: Yuan, Zike, et al.
Published: (2026)
by: Yuan, Zike, et al.
Published: (2026)
TableMind: An Autonomous Programmatic Agent for Tool-Augmented Table Reasoning
by: Jiang, Chuang, et al.
Published: (2025)
by: Jiang, Chuang, et al.
Published: (2025)
Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning
by: Gong, Siyu, et al.
Published: (2026)
by: Gong, Siyu, et al.
Published: (2026)
ET-Agent: Incentivizing Effective Tool-Integrated Reasoning Agent via Behavior Calibration
by: Chen, Yifei, et al.
Published: (2026)
by: Chen, Yifei, et al.
Published: (2026)
SIFThinker: Spatially-Aware Image Focus for Visual Reasoning
by: Chen, Zhangquan, et al.
Published: (2025)
by: Chen, Zhangquan, et al.
Published: (2025)
Quantifying and Understanding Uncertainty in Large Reasoning Models
by: Li, Yangyi, et al.
Published: (2026)
by: Li, Yangyi, et al.
Published: (2026)
MapAgent: A Hierarchical Agent for Geospatial Reasoning with Dynamic Map Tool Integration
by: Hasan, Md Hasebul, et al.
Published: (2025)
by: Hasan, Md Hasebul, et al.
Published: (2025)
Agent-Environment Alignment via Automated Interface Generation
by: Liu, Kaiming, et al.
Published: (2025)
by: Liu, Kaiming, et al.
Published: (2025)
Understanding Tool-Integrated Reasoning
by: Lin, Heng, et al.
Published: (2025)
by: Lin, Heng, et al.
Published: (2025)
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
by: Gou, Zhibin, et al.
Published: (2023)
by: Gou, Zhibin, et al.
Published: (2023)
SciToolAgent: A Knowledge Graph-Driven Scientific Agent for Multi-Tool Integration
by: Ding, Keyan, et al.
Published: (2025)
by: Ding, Keyan, et al.
Published: (2025)
Empowering Multi-Turn Tool-Integrated Agentic Reasoning with Group Turn Policy Optimization
by: Ding, Yifeng, et al.
Published: (2025)
by: Ding, Yifeng, et al.
Published: (2025)
EigentSearch-Q+: Enhancing Deep Research Agents with Structured Reasoning Tools
by: Zhang, Boer, et al.
Published: (2026)
by: Zhang, Boer, et al.
Published: (2026)
Building Open-Ended Embodied Agent via Language-Policy Bidirectional Adaptation
by: Zhai, Shaopeng, et al.
Published: (2023)
by: Zhai, Shaopeng, et al.
Published: (2023)
Integrating Medical Imaging and Clinical Reports Using Multimodal Deep Learning for Advanced Disease Analysis
by: Yao, Ziyan, et al.
Published: (2024)
by: Yao, Ziyan, et al.
Published: (2024)
AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design
by: Zhang, Zhishuai, et al.
Published: (2025)
by: Zhang, Zhishuai, et al.
Published: (2025)
Closing Reasoning Gaps in Clinical Agents with Differential Reasoning Learning
by: Liu, Jinsong, et al.
Published: (2026)
by: Liu, Jinsong, et al.
Published: (2026)
MedMMV: A Controllable Multimodal Multi-Agent Framework for Reliable and Verifiable Clinical Reasoning
by: Liu, Hongjun, et al.
Published: (2025)
by: Liu, Hongjun, et al.
Published: (2025)
X-Light: Cross-City Traffic Signal Control Using Transformer on Transformer as Meta Multi-Agent Reinforcement Learner
by: Jiang, Haoyuan, et al.
Published: (2024)
by: Jiang, Haoyuan, et al.
Published: (2024)
GeoAgentBench: A Dynamic Execution Benchmark for Tool-Augmented Agents in Spatial Analysis
by: Yu, Bo, et al.
Published: (2026)
by: Yu, Bo, et al.
Published: (2026)
Visual Reasoning over Time Series via Multi-Agent System
by: Ruan, Weilin, et al.
Published: (2026)
by: Ruan, Weilin, et al.
Published: (2026)
AgentCDM: Enhancing Multi-Agent Collaborative Decision-Making via ACH-Inspired Structured Reasoning
by: Zhao, Xuyang, et al.
Published: (2025)
by: Zhao, Xuyang, et al.
Published: (2025)
Reasoning RAG via System 1 or System 2: A Survey on Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges
by: Liang, Jintao, et al.
Published: (2025)
by: Liang, Jintao, et al.
Published: (2025)
E3-TIR: Enhanced Experience Exploitation for Tool-Integrated Reasoning
by: Guo, Weiyang, et al.
Published: (2026)
by: Guo, Weiyang, et al.
Published: (2026)
LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing
by: Girella, Federico, et al.
Published: (2025)
by: Girella, Federico, et al.
Published: (2025)
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning
by: Chen, Jiawei, et al.
Published: (2025)
by: Chen, Jiawei, et al.
Published: (2025)
A General Adaptive Dual-level Weighting Mechanism for Remote Sensing Pansharpening
by: Huang, Jie, et al.
Published: (2025)
by: Huang, Jie, et al.
Published: (2025)
MROSS: Multi-Round Region-based Optimization for Scene Sketching
by: Liang, Yiqi, et al.
Published: (2024)
by: Liang, Yiqi, et al.
Published: (2024)
Selective Forgetting for Large Reasoning Models
by: Le, Tuan, et al.
Published: (2026)
by: Le, Tuan, et al.
Published: (2026)
M-STAR: Multi-Scale Spatiotemporal Autoregression for Human Mobility Modeling
by: Luo, Yuxiao, et al.
Published: (2025)
by: Luo, Yuxiao, et al.
Published: (2025)
MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use
by: Liu, Wenrui, et al.
Published: (2025)
by: Liu, Wenrui, et al.
Published: (2025)
AgentFugue: Agent Scaling for Long-Horizon Tasks through Collective Reasoning
by: Hu, Yuyang, et al.
Published: (2026)
by: Hu, Yuyang, et al.
Published: (2026)
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
by: Li, Dawei, et al.
Published: (2026)
by: Li, Dawei, et al.
Published: (2026)
Similar Items
-
MSDiagnosis: A Benchmark for Evaluating Large Language Models in Multi-Step Clinical Diagnosis
by: Hou, Ruihui, et al.
Published: (2024) -
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
by: Liu, Jiaqi, et al.
Published: (2025) -
CharTool: Tool-Integrated Visual Reasoning for Chart Understanding
by: Zhang, Situo, et al.
Published: (2026) -
TPTU: Large Language Model-based AI Agents for Task Planning and Tool Usage
by: Ruan, Jingqing, et al.
Published: (2023) -
ConfSpec: Efficient Step-Level Speculative Reasoning via Confidence-Gated Verification
by: Liu, Siran, et al.
Published: (2026)