:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pan, Yu, Li, Xiaocheng, Wang, Hanzhao
Format:	Preprint
Published:	2025
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2509.20415
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026)

ScaleCall -- Agentic Tool Calling at Scale for Fintech: Challenges, Methods, and Deployment Insights
by: Osuagwu, Richard, et al.
Published: (2025)

ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)

Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
by: Li, Henger, et al.
Published: (2025)

Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
by: Zhang, Zeyu, et al.
Published: (2026)

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
by: Huang, Yue, et al.
Published: (2023)

ReF Decompile: Relabeling and Function Call Enhanced Decompile
by: Feng, Yunlong, et al.
Published: (2025)

GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion
by: Wang, Baoyi, et al.
Published: (2026)

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)

A Review of Prominent Paradigms for LLM-Based Agents: Tool Use (Including RAG), Planning, and Feedback Learning
by: Li, Xinzhe
Published: (2024)

ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback
by: Zhang, Wei, et al.
Published: (2024)

ToolRegistry: A Protocol-Agnostic Tool Management Library for Function-Calling LLMs
by: Ding, Peng, et al.
Published: (2025)

Teaching Code LLMs to Use Autocompletion Tools in Repository-Level Code Generation
by: Wang, Chong, et al.
Published: (2024)

CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
by: Song, Yewei, et al.
Published: (2025)

Clawdrain: Exploiting Tool-Calling Chains for Stealthy Token Exhaustion in OpenClaw Agents
by: Dong, Ben, et al.
Published: (2026)

Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
by: Ji, Zhenlan, et al.
Published: (2025)

CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
by: Huang, Shiting, et al.
Published: (2025)

Towards Verifiably Safe Tool Use for LLM Agents
by: Doshi, Aarya, et al.
Published: (2026)

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)

On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)

Semantic-Enhanced Indirect Call Analysis with Large Language Models
by: Cheng, Baijun, et al.
Published: (2024)

ROOT: Requirements Organization and Optimization Tool
by: Dearstyne, Katherine R., et al.
Published: (2024)

ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)

Live API-Bench: 2500+ Live APIs for Testing Multi-Step Tool Calling
by: Elder, Benjamin, et al.
Published: (2025)

Prompting in Practice: Investigating Software Practitioners' Use of Generative AI Tools
by: Otten, Daniel, et al.
Published: (2025)

GeoJSON Agents:A Multi-Agent LLM Architecture for Geospatial Analysis-Function Calling vs Code Generation
by: Luo, Qianqian, et al.
Published: (2025)

AI Tool Use and Adoption in Software Development by Individuals and Organizations: A Grounded Theory Study
by: Li, Ze Shi, et al.
Published: (2024)

"I Don't Use AI for Everything": Exploring Utility, Attitude, and Responsibility of AI-empowered Tools in Software Development
by: Pan, Shidong, et al.
Published: (2024)

LLM-Assisted Tool for Joint Generation of Formulas and Functions in Rule-Based Verification of Map Transformations
by: He, Ruidi, et al.
Published: (2025)

SkillCraft: Can LLM Agents Learn to Use Tools Skillfully?
by: Chen, Shiqi, et al.
Published: (2026)

RAG-Enhanced Commit Message Generation
by: Zhang, Linghao, et al.
Published: (2024)

EFACT: an External Function Auto-Completion Tool to Strengthen Static Binary Lifting
by: Zhang, Yilei, et al.
Published: (2024)

Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
by: Chen, Xuan, et al.
Published: (2026)

Tool Calling is Linearly Readable and Steerable in Language Models
by: Wu, Zekun, et al.
Published: (2026)

Investigating Tool-Memory Conflicts in Tool-Augmented LLMs
by: Cheng, Jiali, et al.
Published: (2026)

RITA: A Tool for Automated Requirements Classification and Specification from Online User Feedback
by: Mallya, Manjeshwar Aniruddh, et al.
Published: (2026)

ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering
by: Liu, Marianne Menglin, et al.
Published: (2025)

Can You Mimic Me? Exploring the Use of Android Record & Replay Tools in Debugging
by: Song, Zihe, et al.
Published: (2025)

Automated Tool Support for Category-Partition Testing: Design Decisions, UI and Examples of Use
by: Labiche, Yvan
Published: (2026)