:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Dang, Hy, Dao, Quang, Jiang, Meng
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Software Engineering
Online Access:	https://arxiv.org/abs/2604.00137
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

A Self-Healing Framework for Reliable LLM-Based Autonomous Agents
by: Jeong, Cheonsu, et al.
Published: (2026)

OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software
by: Meng, Lingkai, et al.
Published: (2025)

The Cognitive Circuit Breaker: A Systems Engineering Framework for Intrinsic AI Reliability
by: Pan, Jonathan
Published: (2026)

JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents
by: Ghoshal, Sandip, et al.
Published: (2026)

iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)

Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
by: Lin, Zhihao, et al.
Published: (2024)

SynthTools: A Framework for Scaling Synthetic Tools for Agent Development
by: Castellani, Tommaso, et al.
Published: (2025)

ToolFuzz -- Automated Agent Tool Testing
by: Milev, Ivan, et al.
Published: (2025)

An Empirical Study of Agent Developer Practices in AI Agent Frameworks
by: Wang, Yanlin, et al.
Published: (2025)

MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
by: Bulusu, Arya, et al.
Published: (2024)

Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
by: He, Qingsong, et al.
Published: (2025)

AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions
by: Pysmennyi, Ihor, et al.
Published: (2025)

AgentMesh: A Cooperative Multi-Agent Generative AI Framework for Software Development Automation
by: Khanzadeh, Sourena
Published: (2025)

On the Adoption of AI Coding Agents in Open-source Android and iOS Development
by: Khan, Muhammad Ahmad, et al.
Published: (2026)

AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents
by: Pautsch, Erik, et al.
Published: (2025)

An Executable Benchmarking Suite for Tool-Using Agents
by: Zhong, Zhiqing, et al.
Published: (2026)

AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development
by: Zhu, Yuecai, et al.
Published: (2026)

VulnAgent-X: A Layered Agentic Framework for Repository-Level Vulnerability Detection
by: Meng, Renwei, et al.
Published: (2026)

The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment
by: Yu, Shasha, et al.
Published: (2026)

Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents
by: Liu, Ting
Published: (2026)

Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems
by: Xiong, Qian, et al.
Published: (2025)

Repeton: Structured Bug Repair with ReAct-Guided Patch-and-Test Cycles
by: Vinh, Nguyen Phu, et al.
Published: (2025)

Rethinking Software Engineering in the Foundation Model Era: From Task-Driven AI Copilots to Goal-Driven AI Pair Programmers
by: Hassan, Ahmed E., et al.
Published: (2024)

Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents
by: Bhardwaj, Varun Pratap
Published: (2026)

ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
by: Li, Dawei, et al.
Published: (2026)

Is Open Source the Future of AI? A Data-Driven Approach
by: Vake, Domen, et al.
Published: (2025)

Toward a Science of Intent: Closure Gaps and Delegation Envelopes for Open-World AI Agents
by: Armesto, Maximiliano, et al.
Published: (2026)

OpenHands: An Open Platform for AI Software Developers as Generalist Agents
by: Wang, Xingyao, et al.
Published: (2024)

ParaTool: Shifting Tool Representations from Context to Parameters
by: Yu, Zekai, et al.
Published: (2026)

AgentGit: A Version Control Framework for Reliable and Scalable LLM-Powered Multi-Agent Systems
by: Li, Yang, et al.
Published: (2025)

Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)

Towards Reliable LLM-Driven Fuzz Testing: Vision and Road Ahead
by: Cheng, Yiran, et al.
Published: (2025)

Unified Software Engineering Agent as AI Software Engineer
by: Applis, Leonhard, et al.
Published: (2025)

TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
by: Gao, Shuzheng, et al.
Published: (2025)

OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies
by: Di, Peng, et al.
Published: (2025)

Agent Design Pattern Catalogue: A Collection of Architectural Patterns for Foundation Model based Agents
by: Liu, Yue, et al.
Published: (2024)

How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses
by: Watanabe, Kan, et al.
Published: (2026)

The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
by: Wang, Xingyao, et al.
Published: (2025)

From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python
by: Wang, Jinhua, et al.
Published: (2026)

Beyond Autonomy: A Dynamic Tiered AgentRunner Framework for Governable and Resilient Enterprise AI Execution
by: Pan, Kai, et al.
Published: (2026)