:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Osuagwu, Richard, Cook, Thomas, Masoud, Maraim, Ghosal, Koustav, Mattivi, Riccardo
Format:	Preprint
Published:	2025
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2511.00074
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation
by: Cook, Thomas, et al.
Published: (2025)

On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)

Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)

Online-Optimized RAG for Tool Use and Function Calling
by: Pan, Yu, et al.
Published: (2025)

CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
by: Song, Yewei, et al.
Published: (2025)

A Large-Scale Study of Call Graph-based Impact Prediction using Mutation Testing
by: Musco, Vincenzo, et al.
Published: (2018)

ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)

The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026)

Optimizing Agentic Language Model Inference via Speculative Tool Calls
by: Nichols, Daniel, et al.
Published: (2025)

Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)

Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
by: Li, Henger, et al.
Published: (2025)

Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
by: Zhang, Zeyu, et al.
Published: (2026)

Call Graph Soundness in Android Static Analysis
by: Samhi, Jordan, et al.
Published: (2024)

Live API-Bench: 2500+ Live APIs for Testing Multi-Step Tool Calling
by: Elder, Benjamin, et al.
Published: (2025)

Clawdrain: Exploiting Tool-Calling Chains for Stealthy Token Exhaustion in OpenClaw Agents
by: Dong, Ben, et al.
Published: (2026)

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)

OriginPruner: Leveraging Method Origins for Guided Call Graph Pruning
by: Mir, Amir M., et al.
Published: (2024)

Tool Calling is Linearly Readable and Steerable in Language Models
by: Wu, Zekun, et al.
Published: (2026)

Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement
by: Liu, Fengrui, et al.
Published: (2026)

Detecting Call Graph Unsoundness without Ground Truth
by: Zhong, Fangtian, et al.
Published: (2026)

CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
by: Huang, Shiting, et al.
Published: (2025)

Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
by: Chen, Xuan, et al.
Published: (2026)

ReF Decompile: Relabeling and Function Call Enhanced Decompile
by: Feng, Yunlong, et al.
Published: (2025)

Scalable and Precise Application-Centered Call Graph Construction for Python
by: Huang, Kaifeng, et al.
Published: (2023)

Static JavaScript Call Graphs: A Comparative Study
by: Antal, Gábor, et al.
Published: (2024)

Semantic-Enhanced Indirect Call Analysis with Large Language Models
by: Cheng, Baijun, et al.
Published: (2024)

Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments
by: Crouse, Maxwell, et al.
Published: (2026)

Benchmarks as Microscopes: A Call for Model Metrology
by: Saxon, Michael, et al.
Published: (2024)

Seneca: Taint-Based Call Graph Construction for Java Object Deserialization
by: Santos, Joanna C. S., et al.
Published: (2023)

ToolRegistry: A Protocol-Agnostic Tool Management Library for Function-Calling LLMs
by: Ding, Peng, et al.
Published: (2025)

Improving Examples in Web API Specifications using Iterated-Calls In-Context Learning
by: Jain, Kush, et al.
Published: (2025)

Morphis: SLO-Aware Resource Scheduling for Microservices with Time-Varying Call Graphs
by: Tang, Yu, et al.
Published: (2026)

Agentic AI in Industry: Adoption Level and Deployment Barriers
by: Apostolou, Spyridon Alvanakis, et al.
Published: (2026)

Call Me Maybe: Enhancing JavaScript Call Graph Construction using Graph Neural Networks
by: Bhuiyan, Masudul Hasan Masud, et al.
Published: (2025)

SynAE: A Framework for Measuring the Quality of Synthetic Data for Tool-Calling Agent Evaluations
by: Wang, Shuaiqi, et al.
Published: (2026)

Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents
by: Cartagena, Arnold, et al.
Published: (2026)

ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback
by: Zhang, Wei, et al.
Published: (2024)

Who's actually being Studied? A Call for Population Analysis in Software Engineering Research
by: Molléri, Jefferson Seide
Published: (2024)

Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
by: Ji, Zhenlan, et al.
Published: (2025)

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics
by: Antal, Gábor, et al.
Published: (2024)