Saved in:
| Main Authors: | Dang, Hy, Dao, Quang, Jiang, Meng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.00137 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Self-Healing Framework for Reliable LLM-Based Autonomous Agents
by: Jeong, Cheonsu, et al.
Published: (2026)
by: Jeong, Cheonsu, et al.
Published: (2026)
OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software
by: Meng, Lingkai, et al.
Published: (2025)
by: Meng, Lingkai, et al.
Published: (2025)
The Cognitive Circuit Breaker: A Systems Engineering Framework for Intrinsic AI Reliability
by: Pan, Jonathan
Published: (2026)
by: Pan, Jonathan
Published: (2026)
JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents
by: Ghoshal, Sandip, et al.
Published: (2026)
by: Ghoshal, Sandip, et al.
Published: (2026)
iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)
by: Sun, Xikai, et al.
Published: (2025)
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
by: Lin, Zhihao, et al.
Published: (2024)
by: Lin, Zhihao, et al.
Published: (2024)
SynthTools: A Framework for Scaling Synthetic Tools for Agent Development
by: Castellani, Tommaso, et al.
Published: (2025)
by: Castellani, Tommaso, et al.
Published: (2025)
ToolFuzz -- Automated Agent Tool Testing
by: Milev, Ivan, et al.
Published: (2025)
by: Milev, Ivan, et al.
Published: (2025)
An Empirical Study of Agent Developer Practices in AI Agent Frameworks
by: Wang, Yanlin, et al.
Published: (2025)
by: Wang, Yanlin, et al.
Published: (2025)
MathViz-E: A Case-study in Domain-Specialized Tool-Using Agents
by: Bulusu, Arya, et al.
Published: (2024)
by: Bulusu, Arya, et al.
Published: (2024)
Z-Space: A Multi-Agent Tool Orchestration Framework for Enterprise-Grade LLM Automation
by: He, Qingsong, et al.
Published: (2025)
by: He, Qingsong, et al.
Published: (2025)
AI-Driven Tools in Modern Software Quality Assurance: An Assessment of Benefits, Challenges, and Future Directions
by: Pysmennyi, Ihor, et al.
Published: (2025)
by: Pysmennyi, Ihor, et al.
Published: (2025)
AgentMesh: A Cooperative Multi-Agent Generative AI Framework for Software Development Automation
by: Khanzadeh, Sourena
Published: (2025)
by: Khanzadeh, Sourena
Published: (2025)
On the Adoption of AI Coding Agents in Open-source Android and iOS Development
by: Khan, Muhammad Ahmad, et al.
Published: (2026)
by: Khan, Muhammad Ahmad, et al.
Published: (2026)
AgentHub: A Registry for Discoverable, Verifiable, and Reproducible AI Agents
by: Pautsch, Erik, et al.
Published: (2025)
by: Pautsch, Erik, et al.
Published: (2025)
An Executable Benchmarking Suite for Tool-Using Agents
by: Zhong, Zhiqing, et al.
Published: (2026)
by: Zhong, Zhiqing, et al.
Published: (2026)
AI-Generated Smells: An Analysis of Code and Architecture in LLM and Agent-Driven Development
by: Zhu, Yuecai, et al.
Published: (2026)
by: Zhu, Yuecai, et al.
Published: (2026)
VulnAgent-X: A Layered Agentic Framework for Repository-Level Vulnerability Detection
by: Meng, Renwei, et al.
Published: (2026)
by: Meng, Renwei, et al.
Published: (2026)
The A-R Behavioral Space: Execution-Level Profiling of Tool-Using Language Model Agents in Organizational Deployment
by: Yu, Shasha, et al.
Published: (2026)
by: Yu, Shasha, et al.
Published: (2026)
Contractual Skills: A GovernSpec Design Framework for Enterprise AI Agents
by: Liu, Ting
Published: (2026)
by: Liu, Ting
Published: (2026)
Butterfly Effects in Toolchains: A Comprehensive Analysis of Failed Parameter Filling in LLM Tool-Agent Systems
by: Xiong, Qian, et al.
Published: (2025)
by: Xiong, Qian, et al.
Published: (2025)
Repeton: Structured Bug Repair with ReAct-Guided Patch-and-Test Cycles
by: Vinh, Nguyen Phu, et al.
Published: (2025)
by: Vinh, Nguyen Phu, et al.
Published: (2025)
Rethinking Software Engineering in the Foundation Model Era: From Task-Driven AI Copilots to Goal-Driven AI Pair Programmers
by: Hassan, Ahmed E., et al.
Published: (2024)
by: Hassan, Ahmed E., et al.
Published: (2024)
Agent Behavioral Contracts: Formal Specification and Runtime Enforcement for Reliable Autonomous AI Agents
by: Bhardwaj, Varun Pratap
Published: (2026)
by: Bhardwaj, Varun Pratap
Published: (2026)
ToolPRMBench: Evaluating and Advancing Process Reward Models for Tool-using Agents
by: Li, Dawei, et al.
Published: (2026)
by: Li, Dawei, et al.
Published: (2026)
Is Open Source the Future of AI? A Data-Driven Approach
by: Vake, Domen, et al.
Published: (2025)
by: Vake, Domen, et al.
Published: (2025)
Toward a Science of Intent: Closure Gaps and Delegation Envelopes for Open-World AI Agents
by: Armesto, Maximiliano, et al.
Published: (2026)
by: Armesto, Maximiliano, et al.
Published: (2026)
OpenHands: An Open Platform for AI Software Developers as Generalist Agents
by: Wang, Xingyao, et al.
Published: (2024)
by: Wang, Xingyao, et al.
Published: (2024)
ParaTool: Shifting Tool Representations from Context to Parameters
by: Yu, Zekai, et al.
Published: (2026)
by: Yu, Zekai, et al.
Published: (2026)
AgentGit: A Version Control Framework for Reliable and Scalable LLM-Powered Multi-Agent Systems
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Schema First Tool APIs for LLM Agents: A Controlled Study of Tool Misuse, Recovery, and Budgeted Performance
by: Sigdel, Akshey, et al.
Published: (2026)
by: Sigdel, Akshey, et al.
Published: (2026)
Towards Reliable LLM-Driven Fuzz Testing: Vision and Road Ahead
by: Cheng, Yiran, et al.
Published: (2025)
by: Cheng, Yiran, et al.
Published: (2025)
Unified Software Engineering Agent as AI Software Engineer
by: Applis, Leonhard, et al.
Published: (2025)
by: Applis, Leonhard, et al.
Published: (2025)
TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
by: Gao, Shuzheng, et al.
Published: (2025)
by: Gao, Shuzheng, et al.
Published: (2025)
OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies
by: Di, Peng, et al.
Published: (2025)
by: Di, Peng, et al.
Published: (2025)
Agent Design Pattern Catalogue: A Collection of Architectural Patterns for Foundation Model based Agents
by: Liu, Yue, et al.
Published: (2024)
by: Liu, Yue, et al.
Published: (2024)
How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses
by: Watanabe, Kan, et al.
Published: (2026)
by: Watanabe, Kan, et al.
Published: (2026)
The OpenHands Software Agent SDK: A Composable and Extensible Foundation for Production Agents
by: Wang, Xingyao, et al.
Published: (2025)
by: Wang, Xingyao, et al.
Published: (2025)
From Translation to Superset: Benchmark-Driven Evolution of a Production AI Agent from Rust to Python
by: Wang, Jinhua, et al.
Published: (2026)
by: Wang, Jinhua, et al.
Published: (2026)
Beyond Autonomy: A Dynamic Tiered AgentRunner Framework for Governable and Resilient Enterprise AI Execution
by: Pan, Kai, et al.
Published: (2026)
by: Pan, Kai, et al.
Published: (2026)
Similar Items
-
A Self-Healing Framework for Reliable LLM-Based Autonomous Agents
by: Jeong, Cheonsu, et al.
Published: (2026) -
OSS-UAgent: An Agent-based Usability Evaluation Framework for Open Source Software
by: Meng, Lingkai, et al.
Published: (2025) -
The Cognitive Circuit Breaker: A Systems Engineering Framework for Intrinsic AI Reliability
by: Pan, Jonathan
Published: (2026) -
JTPRO: A Joint Tool-Prompt Reflective Optimization Framework for Language Agents
by: Ghoshal, Sandip, et al.
Published: (2026) -
iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)