Saved in:
| Main Authors: | Osuagwu, Richard, Cook, Thomas, Masoud, Maraim, Ghosal, Koustav, Mattivi, Riccardo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.00074 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation
by: Cook, Thomas, et al.
Published: (2025)
by: Cook, Thomas, et al.
Published: (2025)
On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)
by: Bhat, Vishvesh, et al.
Published: (2025)
Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026)
by: Lu, Yuxuan, et al.
Published: (2026)
Online-Optimized RAG for Tool Use and Function Calling
by: Pan, Yu, et al.
Published: (2025)
by: Pan, Yu, et al.
Published: (2025)
CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
by: Song, Yewei, et al.
Published: (2025)
by: Song, Yewei, et al.
Published: (2025)
A Large-Scale Study of Call Graph-based Impact Prediction using Mutation Testing
by: Musco, Vincenzo, et al.
Published: (2018)
by: Musco, Vincenzo, et al.
Published: (2018)
ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)
by: Wang, Youjin, et al.
Published: (2026)
The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration
by: Xu, Haoyuan, et al.
Published: (2026)
by: Xu, Haoyuan, et al.
Published: (2026)
Optimizing Agentic Language Model Inference via Speculative Tool Calls
by: Nichols, Daniel, et al.
Published: (2025)
by: Nichols, Daniel, et al.
Published: (2025)
Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)
by: Tsay, Jason, et al.
Published: (2025)
Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors
by: Li, Henger, et al.
Published: (2025)
by: Li, Henger, et al.
Published: (2025)
Gecko: A Simulation Environment with Stateful Feedback for Refining Agent Tool Calls
by: Zhang, Zeyu, et al.
Published: (2026)
by: Zhang, Zeyu, et al.
Published: (2026)
Call Graph Soundness in Android Static Analysis
by: Samhi, Jordan, et al.
Published: (2024)
by: Samhi, Jordan, et al.
Published: (2024)
Live API-Bench: 2500+ Live APIs for Testing Multi-Step Tool Calling
by: Elder, Benjamin, et al.
Published: (2025)
by: Elder, Benjamin, et al.
Published: (2025)
Clawdrain: Exploiting Tool-Calling Chains for Stealthy Token Exhaustion in OpenClaw Agents
by: Dong, Ben, et al.
Published: (2026)
by: Dong, Ben, et al.
Published: (2026)
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)
by: Chen, Aili, et al.
Published: (2026)
OriginPruner: Leveraging Method Origins for Guided Call Graph Pruning
by: Mir, Amir M., et al.
Published: (2024)
by: Mir, Amir M., et al.
Published: (2024)
Tool Calling is Linearly Readable and Steerable in Language Models
by: Wu, Zekun, et al.
Published: (2026)
by: Wu, Zekun, et al.
Published: (2026)
Help Without Being Asked: A Deployed Proactive Agent System for On-Call Support with Continuous Self-Improvement
by: Liu, Fengrui, et al.
Published: (2026)
by: Liu, Fengrui, et al.
Published: (2026)
Detecting Call Graph Unsoundness without Ground Truth
by: Zhong, Fangtian, et al.
Published: (2026)
by: Zhong, Fangtian, et al.
Published: (2026)
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
by: Huang, Shiting, et al.
Published: (2025)
by: Huang, Shiting, et al.
Published: (2025)
Who Tests the Testers? Systematic Enumeration and Coverage Audit of LLM Agent Tool Call Safety
by: Chen, Xuan, et al.
Published: (2026)
by: Chen, Xuan, et al.
Published: (2026)
ReF Decompile: Relabeling and Function Call Enhanced Decompile
by: Feng, Yunlong, et al.
Published: (2025)
by: Feng, Yunlong, et al.
Published: (2025)
Scalable and Precise Application-Centered Call Graph Construction for Python
by: Huang, Kaifeng, et al.
Published: (2023)
by: Huang, Kaifeng, et al.
Published: (2023)
Static JavaScript Call Graphs: A Comparative Study
by: Antal, Gábor, et al.
Published: (2024)
by: Antal, Gábor, et al.
Published: (2024)
Semantic-Enhanced Indirect Call Analysis with Large Language Models
by: Cheng, Baijun, et al.
Published: (2024)
by: Cheng, Baijun, et al.
Published: (2024)
Simulating Complex Multi-Turn Tool Calling Interactions in Stateless Execution Environments
by: Crouse, Maxwell, et al.
Published: (2026)
by: Crouse, Maxwell, et al.
Published: (2026)
Benchmarks as Microscopes: A Call for Model Metrology
by: Saxon, Michael, et al.
Published: (2024)
by: Saxon, Michael, et al.
Published: (2024)
Seneca: Taint-Based Call Graph Construction for Java Object Deserialization
by: Santos, Joanna C. S., et al.
Published: (2023)
by: Santos, Joanna C. S., et al.
Published: (2023)
ToolRegistry: A Protocol-Agnostic Tool Management Library for Function-Calling LLMs
by: Ding, Peng, et al.
Published: (2025)
by: Ding, Peng, et al.
Published: (2025)
Improving Examples in Web API Specifications using Iterated-Calls In-Context Learning
by: Jain, Kush, et al.
Published: (2025)
by: Jain, Kush, et al.
Published: (2025)
Morphis: SLO-Aware Resource Scheduling for Microservices with Time-Varying Call Graphs
by: Tang, Yu, et al.
Published: (2026)
by: Tang, Yu, et al.
Published: (2026)
Agentic AI in Industry: Adoption Level and Deployment Barriers
by: Apostolou, Spyridon Alvanakis, et al.
Published: (2026)
by: Apostolou, Spyridon Alvanakis, et al.
Published: (2026)
Call Me Maybe: Enhancing JavaScript Call Graph Construction using Graph Neural Networks
by: Bhuiyan, Masudul Hasan Masud, et al.
Published: (2025)
by: Bhuiyan, Masudul Hasan Masud, et al.
Published: (2025)
SynAE: A Framework for Measuring the Quality of Synthetic Data for Tool-Calling Agent Evaluations
by: Wang, Shuaiqi, et al.
Published: (2026)
by: Wang, Shuaiqi, et al.
Published: (2026)
Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents
by: Cartagena, Arnold, et al.
Published: (2026)
by: Cartagena, Arnold, et al.
Published: (2026)
ADC: Enhancing Function Calling Via Adversarial Datasets and Code Line-Level Feedback
by: Zhang, Wei, et al.
Published: (2024)
by: Zhang, Wei, et al.
Published: (2024)
Who's actually being Studied? A Call for Population Analysis in Software Engineering Research
by: Molléri, Jefferson Seide
Published: (2024)
by: Molléri, Jefferson Seide
Published: (2024)
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
by: Ji, Zhenlan, et al.
Published: (2025)
by: Ji, Zhenlan, et al.
Published: (2025)
Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics
by: Antal, Gábor, et al.
Published: (2024)
by: Antal, Gábor, et al.
Published: (2024)
Similar Items
-
Retrieval Augmented Generation (RAG) for Fintech: Agentic Design and Evaluation
by: Cook, Thomas, et al.
Published: (2025) -
On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025) -
Firefly: Illuminating Large-Scale Verified Tool-Call Data Generation from Real APIs
by: Lu, Yuxuan, et al.
Published: (2026) -
Online-Optimized RAG for Tool Use and Function Calling
by: Pan, Yu, et al.
Published: (2025) -
CallNavi, A Challenge and Empirical Study on LLM Function Calling and Routing
by: Song, Yewei, et al.
Published: (2025)