:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Delassi, Khaled Bachir, Zeggane, Lakhdar, Cherroun, Hadda, Haouhat, Abdelhamid, Bouzouad, Kaoutar
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Software Engineering
Online Access:	https://arxiv.org/abs/2508.03488
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Arabic Multimodal Machine Learning: Datasets, Applications, Approaches, and Challenges
by: Haouhat, Abdelhamid, et al.
Published: (2025)

Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection
by: Mudunuri, Sarat, et al.
Published: (2026)

The Potential of LLMs in Automating Software Testing: From Generation to Reporting
by: Sherifi, Betim, et al.
Published: (2024)

Tool-integrated Reinforcement Learning for Repo Deep Search
by: Ma, Zexiong, et al.
Published: (2025)

A Tool for Generating Exceptional Behavior Tests With Large Language Models
by: Zhong, Linghan, et al.
Published: (2025)

ToolFuzz -- Automated Agent Tool Testing
by: Milev, Ivan, et al.
Published: (2025)

JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents
by: Ghoshal, Sandip, et al.
Published: (2026)

Investigating Tool-Memory Conflicts in Tool-Augmented LLMs
by: Cheng, Jiali, et al.
Published: (2026)

Teaching LLMs to Learn Tool Trialing and Execution through Environment Interaction
by: Gao, Xingjie, et al.
Published: (2026)

Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents
by: Li, Zeping, et al.
Published: (2026)

ParaTool: Shifting Tool Representations from Context to Parameters
by: Yu, Zekai, et al.
Published: (2026)

LLMs Integration in Software Engineering Team Projects: Roles, Impact, and a Pedagogical Design Space for AI Tools in Computing Education
by: Kharrufa, Ahmed, et al.
Published: (2024)

MIMIC-Py: An Extensible Tool for Personality-Driven Automated Game Testing with Large Language Models
by: Chen, Yifei, et al.
Published: (2026)

ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
by: Kokane, Shirley, et al.
Published: (2024)

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
by: Li, Dawei, et al.
Published: (2026)

Comparison of Static Application Security Testing Tools and Large Language Models for Repo-level Vulnerability Detection
by: Zhou, Xin, et al.
Published: (2024)

Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
by: Lin, Zhihao, et al.
Published: (2024)

EvolveTool-Bench: Evaluating the Quality of LLM-Generated Tool Libraries as Software Artifacts
by: Kaliyev, Alibek T., et al.
Published: (2026)

ToolMisuseBench: An Offline Deterministic Benchmark for Tool Misuse and Recovery in Agentic Systems
by: Sigdel, Akshey, et al.
Published: (2026)

The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment
by: Yu, Shasha, et al.
Published: (2026)

Class-Level Code Generation from Natural Language Using Iterative, Tool-Enhanced Reasoning over Repository
by: Deshpande, Ajinkya, et al.
Published: (2024)

The Tool-Overuse Illusion: Why Does LLM Prefer External Tools over Internal Knowledge?
by: Zeng, Yirong, et al.
Published: (2026)

Breaking the Illusion of Identity in LLM Tooling
by: Miller, Marek
Published: (2026)

Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)

User Centric Evaluation of Code Generation Tools
by: Miah, Tanha, et al.
Published: (2024)

Towards a Digital Twin Modeling Method for Container Terminal Port
by: Hakimi, Faouzi, et al.
Published: (2025)

PyGen: A Collaborative Human-AI Approach to Python Package Creation
by: Barua, Saikat, et al.
Published: (2024)

ReXCL: A Tool for Requirement Document Extraction and Classification
by: Bhattacharya, Paheli, et al.
Published: (2025)

AISysRev -- LLM-based Tool for Title-abstract Screening
by: Huotala, Aleksi, et al.
Published: (2025)

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
by: Fei, Xiang, et al.
Published: (2025)

ASA: Training-Free Representation Engineering for Tool-Calling Agents
by: Wang, Youjin, et al.
Published: (2026)

Squeez: Task-Conditioned Tool-Output Pruning for Coding Agents
by: Kovács, Ádám
Published: (2026)

Applying an Agentic Coding Tool for Improving Published Algorithm Implementations
by: Suwannik, Worasait
Published: (2026)

Revisiting Software Engineering Education in the Era of Large Language Models: A Curriculum Adaptation and Academic Integrity Framework
by: Degerli, Mustafa
Published: (2026)

Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools
by: Agarwal, Prerna, et al.
Published: (2025)

Repairing Tool Calls Using Post-tool Execution Reflection and RAG
by: Tsay, Jason, et al.
Published: (2025)

Solver-Aided Verification of Policy Compliance in Tool-Augmented LLM Agents
by: Winston, Cailin, et al.
Published: (2026)

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)

Trajectory Supervision for Continual Tool-Use Learning in LLMs
by: Reddy, Vishnu Vardhan, et al.
Published: (2026)

Leveraging LLMs to support co-evolution between definitions and instances of textual DSLs: A Systematic Evaluation
by: Zhang, Weixing, et al.
Published: (2026)