Saved in:
| Main Authors: | Hamad, Hassan, Xu, Yingru, Zhao, Liang, Yan, Wenbo, Gyanchandani, Narendra |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.17052 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reflect before Act: Proactive Error Correction in Language Models
by: Zeng, Qiuhai, et al.
Published: (2025)
by: Zeng, Qiuhai, et al.
Published: (2025)
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)
by: Kokane, Shirley, et al.
Published: (2024)
Tools Fail: Detecting Silent Errors in Faulty Tools
by: Sun, Jimin, et al.
Published: (2024)
by: Sun, Jimin, et al.
Published: (2024)
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
by: Hu, Xuhao, et al.
Published: (2026)
by: Hu, Xuhao, et al.
Published: (2026)
Utility-Guided Agent Orchestration for Efficient LLM Tool Use
by: Liu, Boyan, et al.
Published: (2026)
by: Liu, Boyan, et al.
Published: (2026)
GOOSE Algorithm: A Powerful Optimization Tool for Real-World Engineering Challenges and Beyond
by: Hamad, Rebwar Khalid, et al.
Published: (2023)
by: Hamad, Rebwar Khalid, et al.
Published: (2023)
Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors
by: Zhang, Zhiwei, et al.
Published: (2026)
by: Zhang, Zhiwei, et al.
Published: (2026)
AskToAct: Enhancing LLMs Tool Use via Self-Correcting Clarification
by: Zhang, Xuan, et al.
Published: (2025)
by: Zhang, Xuan, et al.
Published: (2025)
UniToolCall: Unifying Tool-Use Representation, Data, and Evaluation for LLM Agents
by: Liang, Yijuan, et al.
Published: (2026)
by: Liang, Yijuan, et al.
Published: (2026)
ToolWeave: Structured Synthesis of Complex Multi-Turn Tool-Calling Dialogues
by: Khandelwal, Dinesh, et al.
Published: (2026)
by: Khandelwal, Dinesh, et al.
Published: (2026)
TEA-Bench: A Systematic Benchmarking of Tool-enhanced Emotional Support Dialogue Agent
by: Sui, Xingyu, et al.
Published: (2026)
by: Sui, Xingyu, et al.
Published: (2026)
Budget-Aware Tool-Use Enables Effective Agent Scaling
by: Liu, Tengxiao, et al.
Published: (2025)
by: Liu, Tengxiao, et al.
Published: (2025)
ToolMind Technical Report: A Large-Scale, Reasoning-Enhanced Tool-Use Dataset
by: Yang, Chen, et al.
Published: (2025)
by: Yang, Chen, et al.
Published: (2025)
Learning to Rewrite Tool Descriptions for Reliable LLM-Agent Tool Use
by: Guo, Ruocheng, et al.
Published: (2026)
by: Guo, Ruocheng, et al.
Published: (2026)
Learning How to Use Tools, Not Just When: Pattern-Aware Tool-Integrated Reasoning
by: Xu, Ningning, et al.
Published: (2025)
by: Xu, Ningning, et al.
Published: (2025)
Logit Dynamics in Softmax Policy Gradient Methods
by: Li, Yingru
Published: (2025)
by: Li, Yingru
Published: (2025)
Advancing Parkinson's Disease Progression Prediction: Comparing Long Short-Term Memory Networks and Kolmogorov-Arnold Networks
by: Roy, Abhinav, et al.
Published: (2024)
by: Roy, Abhinav, et al.
Published: (2024)
ToolRM: Towards Agentic Tool-Use Reward Modeling
by: Li, Renhao, et al.
Published: (2025)
by: Li, Renhao, et al.
Published: (2025)
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
by: Feng, Jiazhan, et al.
Published: (2025)
by: Feng, Jiazhan, et al.
Published: (2025)
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use
by: Deng, Mengjie, et al.
Published: (2025)
by: Deng, Mengjie, et al.
Published: (2025)
Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning
by: Gong, Siyu, et al.
Published: (2026)
by: Gong, Siyu, et al.
Published: (2026)
Are Tools All We Need? Unveiling the Tool-Use Tax in LLM Agents
by: Zhang, Kaituo, et al.
Published: (2026)
by: Zhang, Kaituo, et al.
Published: (2026)
Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection
by: Zhang, Fanrui, et al.
Published: (2025)
by: Zhang, Fanrui, et al.
Published: (2025)
ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models
by: Fang, Bowen, et al.
Published: (2026)
by: Fang, Bowen, et al.
Published: (2026)
FamilyTool: A Multi-hop Personalized Tool Use Benchmark
by: Wang, Yuxin, et al.
Published: (2025)
by: Wang, Yuxin, et al.
Published: (2025)
DICE-BENCH: Evaluating the Tool-Use Capabilities of Large Language Models in Multi-Round, Multi-Party Dialogues
by: Jang, Kyochul, et al.
Published: (2025)
by: Jang, Kyochul, et al.
Published: (2025)
Tool-Augmented Policy Optimization: Synergizing Reasoning and Adaptive Tool Use with Reinforcement Learning
by: Wu, Wenxun, et al.
Published: (2025)
by: Wu, Wenxun, et al.
Published: (2025)
Model-Adaptive Tool Necessity Reveals the Knowing-Doing Gap in LLM Tool Use
by: Cheng, Yize, et al.
Published: (2026)
by: Cheng, Yize, et al.
Published: (2026)
FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use
by: Lu, Jiaxuan, et al.
Published: (2026)
by: Lu, Jiaxuan, et al.
Published: (2026)
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
by: Zhang, Kexun, et al.
Published: (2023)
by: Zhang, Kexun, et al.
Published: (2023)
Controllable and Verifiable Tool-Use Data Synthesis for Agentic Reinforcement Learning
by: Xu, Siyuan, et al.
Published: (2026)
by: Xu, Siyuan, et al.
Published: (2026)
In-Context Reinforcement Learning for Tool Use in Large Language Models
by: Ye, Yaoqi, et al.
Published: (2026)
by: Ye, Yaoqi, et al.
Published: (2026)
AutoTool: Automatic Scaling of Tool-Use Capabilities in RL via Decoupled Entropy Constraints
by: Zeng, Yirong, et al.
Published: (2026)
by: Zeng, Yirong, et al.
Published: (2026)
ECG-Agent: On-Device Tool-Calling Agent for ECG Multi-Turn Dialogue
by: Chung, Hyunseung, et al.
Published: (2026)
by: Chung, Hyunseung, et al.
Published: (2026)
EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection
by: Yang, Shuo, et al.
Published: (2026)
by: Yang, Shuo, et al.
Published: (2026)
iTool: Reinforced Fine-Tuning with Dynamic Deficiency Calibration for Advanced Tool Use
by: Zeng, Yirong, et al.
Published: (2025)
by: Zeng, Yirong, et al.
Published: (2025)
MMedAgent: Learning to Use Medical Tools with Multi-modal Agent
by: Li, Binxu, et al.
Published: (2024)
by: Li, Binxu, et al.
Published: (2024)
Tool-as-Interface: Learning Robot Policies from Observing Human Tool Use
by: Chen, Haonan, et al.
Published: (2025)
by: Chen, Haonan, et al.
Published: (2025)
Reducing Cognitive Overhead in Tool Use via Multi-Small-Agent Reinforcement Learning
by: Wang, Dayu, et al.
Published: (2025)
by: Wang, Dayu, et al.
Published: (2025)
Creative Robot Tool Use by Counterfactual Reasoning
by: Akbulut, M. Tuluhan, et al.
Published: (2026)
by: Akbulut, M. Tuluhan, et al.
Published: (2026)
Similar Items
-
Reflect before Act: Proactive Error Correction in Language Models
by: Zeng, Qiuhai, et al.
Published: (2025) -
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024) -
Tools Fail: Detecting Silent Errors in Faulty Tools
by: Sun, Jimin, et al.
Published: (2024) -
ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents
by: Hu, Xuhao, et al.
Published: (2026) -
Utility-Guided Agent Orchestration for Efficient LLM Tool Use
by: Liu, Boyan, et al.
Published: (2026)