:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lin, Liangtao, Zhu, Zhaomeng, Zhang, Tianwei, Wen, Yonggang
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Software Engineering
Online Access:	https://arxiv.org/abs/2509.13704
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures
by: Lin, Liangtao, et al.
Published: (2026)

Agentic Business Process Management Systems
by: Dumas, Marlon, et al.
Published: (2026)

VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
by: Han, Qijun, et al.
Published: (2026)

Agentic Frameworks for Reasoning Tasks: An Empirical Study
by: Rasheed, Zeeshan, et al.
Published: (2026)

GUITestScape: Towards Open-set Evaluation on Exploratory GUI Testing
by: Chen, Xiaoyi, et al.
Published: (2026)

LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps
by: Zhao, Shanhui, et al.
Published: (2025)

GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
by: Lindenbauer, Tobias, et al.
Published: (2025)

You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation
by: Bian, Yutong, et al.
Published: (2025)

ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI
by: Zhang, Gaoyang, et al.
Published: (2026)

AgenticTCAD: A LLM-based Multi-Agent Framework for Automated TCAD Code Generation and Device Optimization
by: Fan, Guangxi, et al.
Published: (2025)

WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation
by: Xu, Mingde, et al.
Published: (2025)

Impact-driven Context Filtering For Cross-file Code Completion
by: Li, Yanzhou, et al.
Published: (2025)

LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)

LaQual: A Novel Framework for Automated Evaluation of LLM App Quality
by: Wang, Yan, et al.
Published: (2025)

Digital Twin and Agentic AI for Wild Fire Disaster Management: Intelligent Virtual Situation Room
by: Morsali, Mohammad, et al.
Published: (2026)

DiagEval: Trajectory-Conditioned Diagnosis for Reliable Software Evaluation with GUI Agents
by: Hong, Sirui, et al.
Published: (2026)

A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement
by: Zhang, Huan, et al.
Published: (2024)

SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs
by: Tripathy, Arihant, et al.
Published: (2025)

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
by: Xu, Jingxuan, et al.
Published: (2025)

Coding with Eyes: Visual Feedback Unlocks Reliable GUI Code Generating and Debugging
by: Liu, Zhilin, et al.
Published: (2026)

Fragmented Layer Grouping in GUI Designs Through Graph Learning Based on Multimodal Information
by: Chen, Yunnong, et al.
Published: (2024)

SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration
by: Wang, Qingni, et al.
Published: (2026)

AndroidControl-Curated: Revealing the True Potential of GUI Agents through Benchmark Purification
by: Leung, Ho Fai, et al.
Published: (2025)

VulnAgent-X: A Layered Agentic Framework for Repository-Level Vulnerability Detection
by: Meng, Renwei, et al.
Published: (2026)

Agentic Harness for Real-World Compilers
by: Zheng, Yingwei, et al.
Published: (2026)

Beyond the 'Diff': Addressing Agentic Entropy in Agentic Software Development
by: Casserini, Matteo, et al.
Published: (2026)

LLM Agents for Interactive Exploration of Historical Cadastre Data: Framework and Application to Venice
by: Karch, Tristan, et al.
Published: (2025)

Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
by: Yang, Ya-Ting, et al.
Published: (2026)

An Empirical Exploration of ChatGPT's Ability to Support Problem Formulation Tasks for Mission Engineering and a Documentation of its Performance Variability
by: Ofsa, Max, et al.
Published: (2025)

Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement
by: Gallaba, Keheliya, et al.
Published: (2025)

app.build: A Production Framework for Scaling Agentic Prompt-to-App Generation with Environment Scaffolding
by: Kniazev, Evgenii, et al.
Published: (2025)

Tuning LLM-based Code Optimization via Meta-Prompting: An Industrial Perspective
by: Gong, Jingzhi, et al.
Published: (2025)

PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design
by: Yang, Ruozhao, et al.
Published: (2025)

Agentic Software Engineering: Foundational Pillars and a Research Roadmap
by: Hassan, Ahmed E., et al.
Published: (2025)

MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
by: Dong, Yuchen, et al.
Published: (2024)

DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)

On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)

FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning
by: Liu, Yihao, et al.
Published: (2026)

Industrial LLM-based Code Optimization under Regulation: A Mixture-of-Agents Approach
by: Ashiga, Mari, et al.
Published: (2025)

Runtime-Structured Task Decomposition for Agentic Coding Systems
by: Asthana, Shubhi, et al.
Published: (2026)