Saved in:
| Main Authors: | Delassi, Khaled Bachir, Zeggane, Lakhdar, Cherroun, Hadda, Haouhat, Abdelhamid, Bouzouad, Kaoutar |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.03488 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Arabic Multimodal Machine Learning: Datasets, Applications, Approaches, and Challenges
by: Haouhat, Abdelhamid, et al.
Published: (2025)
by: Haouhat, Abdelhamid, et al.
Published: (2025)
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
by: Mudunuri, Sarat, et al.
Published: (2026)
by: Mudunuri, Sarat, et al.
Published: (2026)
The Potential of LLMs in Automating Software Testing: From Generation to Reporting
by: Sherifi, Betim, et al.
Published: (2024)
by: Sherifi, Betim, et al.
Published: (2024)
Tool-integrated Reinforcement Learning for Repo Deep Search
by: Ma, Zexiong, et al.
Published: (2025)
by: Ma, Zexiong, et al.
Published: (2025)
A Tool for Generating Exceptional Behavior Tests With Large Language Models
by: Zhong, Linghan, et al.
Published: (2025)
by: Zhong, Linghan, et al.
Published: (2025)
ToolFuzz -- Automated Agent Tool Testing
by: Milev, Ivan, et al.
Published: (2025)
by: Milev, Ivan, et al.
Published: (2025)
JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents
by: Ghoshal, Sandip, et al.
Published: (2026)
by: Ghoshal, Sandip, et al.
Published: (2026)
Investigating Tool-Memory Conflicts in Tool-Augmented LLMs
by: Cheng, Jiali, et al.
Published: (2026)
by: Cheng, Jiali, et al.
Published: (2026)
Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction
by: Gao, Xingjie, et al.
Published: (2026)
by: Gao, Xingjie, et al.
Published: (2026)
Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)
by: Li, Zeping, et al.
Published: (2026)
ParaTool: Shifting Tool Representations from Context to Parameters
by: Yu, Zekai, et al.
Published: (2026)
by: Yu, Zekai, et al.
Published: (2026)
LLMs Integration in Software Engineering Team Projects: Roles, Impact, and a Pedagogical Design Space for AI Tools in Computing Education
by: Kharrufa, Ahmed, et al.
Published: (2024)
by: Kharrufa, Ahmed, et al.
Published: (2024)
MIMIC-Py: An Extensible Tool for Personality-Driven Automated Game Testing with Large Language Models
by: Chen, Yifei, et al.
Published: (2026)
by: Chen, Yifei, et al.
Published: (2026)
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)
by: Kokane, Shirley, et al.
Published: (2024)
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
by: Li, Dawei, et al.
Published: (2026)
by: Li, Dawei, et al.
Published: (2026)
Comparison of Static Application Security Testing Tools and Large Language Models for Repo-level Vulnerability Detection
by: Zhou, Xin, et al.
Published: (2024)
by: Zhou, Xin, et al.
Published: (2024)
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
by: Lin, Zhihao, et al.
Published: (2024)
by: Lin, Zhihao, et al.
Published: (2024)
EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts
by: Kaliyev, Alibek T., et al.
Published: (2026)
by: Kaliyev, Alibek T., et al.
Published: (2026)
ToolMisuseBench: An Offline Deterministic Benchmark for Tool Misuse and Recovery in Agentic Systems
by: Sigdel, Akshey, et al.
Published: (2026)
by: Sigdel, Akshey, et al.
Published: (2026)
The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment
by: Yu, Shasha, et al.
Published: (2026)
by: Yu, Shasha, et al.
Published: (2026)
Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository
by: Deshpande, Ajinkya, et al.
Published: (2024)
by: Deshpande, Ajinkya, et al.
Published: (2024)
The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge?
by: Zeng, Yirong, et al.
Published: (2026)
by: Zeng, Yirong, et al.
Published: (2026)
Breaking the Illusion of Identity in LLM Tooling
by: Miller, Marek
Published: (2026)
by: Miller, Marek
Published: (2026)
Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)
by: Sigdel, Akshey, et al.
Published: (2026)
User Centric Evaluation of Code Generation Tools
by: Miah, Tanha, et al.
Published: (2024)
by: Miah, Tanha, et al.
Published: (2024)
Towards a Digital Twin Modeling Method for Container Terminal Port
by: Hakimi, Faouzi, et al.
Published: (2025)
by: Hakimi, Faouzi, et al.
Published: (2025)
PyGen: A Collaborative Human-AI Approach to Python Package Creation
by: Barua, Saikat, et al.
Published: (2024)
by: Barua, Saikat, et al.
Published: (2024)
ReXCL: A Tool for Requirement Document Extraction and Classification
by: Bhattacharya, Paheli, et al.
Published: (2025)
by: Bhattacharya, Paheli, et al.
Published: (2025)
AISysRev -- LLM-based Tool for Title-abstract Screening
by: Huotala, Aleksi, et al.
Published: (2025)
by: Huotala, Aleksi, et al.
Published: (2025)
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
by: Fei, Xiang, et al.
Published: (2025)
by: Fei, Xiang, et al.
Published: (2025)
ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)
by: Wang, Youjin, et al.
Published: (2026)
Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents
by: Kovács, Ádám
Published: (2026)
by: Kovács, Ádám
Published: (2026)
Applying an Agentic Coding Tool for Improving Published Algorithm Implementations
by: Suwannik, Worasait
Published: (2026)
by: Suwannik, Worasait
Published: (2026)
Revisiting Software Engineering Education in the Era of Large Language Models: A Curriculum Adaptation and Academic Integrity Framework
by: Degerli, Mustafa
Published: (2026)
by: Degerli, Mustafa
Published: (2026)
Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools
by: Agarwal, Prerna, et al.
Published: (2025)
by: Agarwal, Prerna, et al.
Published: (2025)
Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)
by: Tsay, Jason, et al.
Published: (2025)
Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents
by: Winston, Cailin, et al.
Published: (2026)
by: Winston, Cailin, et al.
Published: (2026)
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)
by: Chen, Aili, et al.
Published: (2026)
Trajectory Supervision for Continual Tool-Use Learning in LLMs
by: Reddy, Vishnu Vardhan, et al.
Published: (2026)
by: Reddy, Vishnu Vardhan, et al.
Published: (2026)
Leveraging LLMs to support co-evolution between definitions and instances of textual DSLs: A Systematic Evaluation
by: Zhang, Weixing, et al.
Published: (2026)
by: Zhang, Weixing, et al.
Published: (2026)
Similar Items
-
Arabic Multimodal Machine Learning: Datasets, Applications, Approaches, and Challenges
by: Haouhat, Abdelhamid, et al.
Published: (2025) -
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
by: Mudunuri, Sarat, et al.
Published: (2026) -
The Potential of LLMs in Automating Software Testing: From Generation to Reporting
by: Sherifi, Betim, et al.
Published: (2024) -
Tool-integrated Reinforcement Learning for Repo Deep Search
by: Ma, Zexiong, et al.
Published: (2025) -
A Tool for Generating Exceptional Behavior Tests With Large Language Models
by: Zhong, Linghan, et al.
Published: (2025)