Saved in:
| Main Authors: | Pan, Yu, Li, Xiaocheng, Wang, Hanzhao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.20415 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)
by: Tsay, Jason, et al.
Published: (2025)
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026)
by: Xu, Haoyuan, et al.
Published: (2026)
ScaleCall -- Agentic Tool Calling at Scale for Fintech: Challenges, Methods, and Deployment Insights
by: Osuagwu, Richard, et al.
Published: (2025)
by: Osuagwu, Richard, et al.
Published: (2025)
ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)
by: Wang, Youjin, et al.
Published: (2026)
Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
by: Li, Henger, et al.
Published: (2025)
by: Li, Henger, et al.
Published: (2025)
Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
by: Zhang, Zeyu, et al.
Published: (2026)
by: Zhang, Zeyu, et al.
Published: (2026)
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
by: Huang, Yue, et al.
Published: (2023)
by: Huang, Yue, et al.
Published: (2023)
ReF Decompile: Relabeling and Function Call Enhanced Decompile
by: Feng, Yunlong, et al.
Published: (2025)
by: Feng, Yunlong, et al.
Published: (2025)
GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion
by: Wang, Baoyi, et al.
Published: (2026)
by: Wang, Baoyi, et al.
Published: (2026)
Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)
by: Li, Zeping, et al.
Published: (2026)
A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning
by: Li, Xinzhe
Published: (2024)
by: Li, Xinzhe
Published: (2024)
ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback
by: Zhang, Wei, et al.
Published: (2024)
by: Zhang, Wei, et al.
Published: (2024)
ToolRegistry: A Protocol-Agnostic Tool Management Library for Function-Calling LLMs
by: Ding, Peng, et al.
Published: (2025)
by: Ding, Peng, et al.
Published: (2025)
Teaching Code LLMs to Use Autocompletion Tools in Repository-Level Code Generation
by: Wang, Chong, et al.
Published: (2024)
by: Wang, Chong, et al.
Published: (2024)
CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
by: Song, Yewei, et al.
Published: (2025)
by: Song, Yewei, et al.
Published: (2025)
Clawdrain: Exploiting Tool-Calling Chains for Stealthy Token Exhaustion in OpenClaw Agents
by: Dong, Ben, et al.
Published: (2026)
by: Dong, Ben, et al.
Published: (2026)
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
by: Ji, Zhenlan, et al.
Published: (2025)
by: Ji, Zhenlan, et al.
Published: (2025)
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
by: Huang, Shiting, et al.
Published: (2025)
by: Huang, Shiting, et al.
Published: (2025)
Towards Verifiably Safe Tool Use for LLM Agents
by: Doshi, Aarya, et al.
Published: (2026)
by: Doshi, Aarya, et al.
Published: (2026)
Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)
by: Lu, Yuxuan, et al.
Published: (2026)
On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)
by: Bhat, Vishvesh, et al.
Published: (2025)
Semantic-Enhanced Indirect Call Analysis with Large Language Models
by: Cheng, Baijun, et al.
Published: (2024)
by: Cheng, Baijun, et al.
Published: (2024)
ROOT: Requirements Organization and Optimization Tool
by: Dearstyne, Katherine R., et al.
Published: (2024)
by: Dearstyne, Katherine R., et al.
Published: (2024)
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)
by: Kokane, Shirley, et al.
Published: (2024)
Live API-Bench: 2500+ Live APIs for Testing Multi-Step Tool Calling
by: Elder, Benjamin, et al.
Published: (2025)
by: Elder, Benjamin, et al.
Published: (2025)
Prompting in Practice: Investigating Software Practitioners' Use of Generative AI Tools
by: Otten, Daniel, et al.
Published: (2025)
by: Otten, Daniel, et al.
Published: (2025)
GeoJSON Agents:A Multi-Agent LLM Architecture for Geospatial Analysis-Function Calling vs Code Generation
by: Luo, Qianqian, et al.
Published: (2025)
by: Luo, Qianqian, et al.
Published: (2025)
AI Tool Use and Adoption in Software Development by Individuals and Organizations: A Grounded Theory Study
by: Li, Ze Shi, et al.
Published: (2024)
by: Li, Ze Shi, et al.
Published: (2024)
"I Don't Use AI for Everything": Exploring Utility, Attitude, and Responsibility of AI-empowered Tools in Software Development
by: Pan, Shidong, et al.
Published: (2024)
by: Pan, Shidong, et al.
Published: (2024)
LLM-Assisted Tool for Joint Generation of Formulas and Functions in Rule-Based Verification of Map Transformations
by: He, Ruidi, et al.
Published: (2025)
by: He, Ruidi, et al.
Published: (2025)
SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?
by: Chen, Shiqi, et al.
Published: (2026)
by: Chen, Shiqi, et al.
Published: (2026)
RAG-Enhanced Commit Message Generation
by: Zhang, Linghao, et al.
Published: (2024)
by: Zhang, Linghao, et al.
Published: (2024)
EFACT: an External Function Auto-Completion Tool to Strengthen Static Binary Lifting
by: Zhang, Yilei, et al.
Published: (2024)
by: Zhang, Yilei, et al.
Published: (2024)
Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
by: Chen, Xuan, et al.
Published: (2026)
by: Chen, Xuan, et al.
Published: (2026)
Tool Calling is Linearly Readable and Steerable in Language Models
by: Wu, Zekun, et al.
Published: (2026)
by: Wu, Zekun, et al.
Published: (2026)
Investigating Tool-Memory Conflicts in Tool-Augmented LLMs
by: Cheng, Jiali, et al.
Published: (2026)
by: Cheng, Jiali, et al.
Published: (2026)
RITA: A Tool for Automated Requirements Classification and Specification from Online User Feedback
by: Mallya, Manjeshwar Aniruddh, et al.
Published: (2026)
by: Mallya, Manjeshwar Aniruddh, et al.
Published: (2026)
ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering
by: Liu, Marianne Menglin, et al.
Published: (2025)
by: Liu, Marianne Menglin, et al.
Published: (2025)
Can You Mimic Me? Exploring the Use of Android Record & Replay Tools in Debugging
by: Song, Zihe, et al.
Published: (2025)
by: Song, Zihe, et al.
Published: (2025)
Automated Tool Support for Category-Partition Testing: Design Decisions, UI and Examples of Use
by: Labiche, Yvan
Published: (2026)
by: Labiche, Yvan
Published: (2026)
Similar Items
-
Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025) -
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026) -
ScaleCall -- Agentic Tool Calling at Scale for Fintech: Challenges, Methods, and Deployment Insights
by: Osuagwu, Richard, et al.
Published: (2025) -
ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026) -
Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
by: Li, Henger, et al.
Published: (2025)