Saved in:
| Main Authors: | Lin, Liangtao, Zhu, Zhaomeng, Zhang, Tianwei, Wen, Yonggang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.13704 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures
by: Lin, Liangtao, et al.
Published: (2026)
by: Lin, Liangtao, et al.
Published: (2026)
Agentic Business Process Management Systems
by: Dumas, Marlon, et al.
Published: (2026)
by: Dumas, Marlon, et al.
Published: (2026)
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
by: Han, Qijun, et al.
Published: (2026)
by: Han, Qijun, et al.
Published: (2026)
Agentic Frameworks for Reasoning Tasks: An Empirical Study
by: Rasheed, Zeeshan, et al.
Published: (2026)
by: Rasheed, Zeeshan, et al.
Published: (2026)
GUITestScape: Towards Open-set Evaluation on Exploratory GUI Testing
by: Chen, Xiaoyi, et al.
Published: (2026)
by: Chen, Xiaoyi, et al.
Published: (2026)
LLM-Explorer: Towards Efficient and Affordable LLM-based Exploration for Mobile Apps
by: Zhao, Shanhui, et al.
Published: (2025)
by: Zhao, Shanhui, et al.
Published: (2025)
GitGoodBench: A Novel Benchmark For Evaluating Agentic Performance On Git
by: Lindenbauer, Tobias, et al.
Published: (2025)
by: Lindenbauer, Tobias, et al.
Published: (2025)
You Don't Know Until You Click:Automated GUI Testing for Production-Ready Software Evaluation
by: Bian, Yutong, et al.
Published: (2025)
by: Bian, Yutong, et al.
Published: (2025)
ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI
by: Zhang, Gaoyang, et al.
Published: (2026)
by: Zhang, Gaoyang, et al.
Published: (2026)
AgenticTCAD: A LLM-based Multi-Agent Framework for Automated TCAD Code Generation and Device Optimization
by: Fan, Guangxi, et al.
Published: (2025)
by: Fan, Guangxi, et al.
Published: (2025)
WebVIA: A Web-based Vision-Language Agentic Framework for Interactive and Verifiable UI-to-Code Generation
by: Xu, Mingde, et al.
Published: (2025)
by: Xu, Mingde, et al.
Published: (2025)
Impact-driven Context Filtering For Cross-file Code Completion
by: Li, Yanzhou, et al.
Published: (2025)
by: Li, Yanzhou, et al.
Published: (2025)
LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)
by: Xu, Dong, et al.
Published: (2026)
LaQual: A Novel Framework for Automated Evaluation of LLM App Quality
by: Wang, Yan, et al.
Published: (2025)
by: Wang, Yan, et al.
Published: (2025)
Digital Twin and Agentic AI for Wild Fire Disaster Management: Intelligent Virtual Situation Room
by: Morsali, Mohammad, et al.
Published: (2026)
by: Morsali, Mohammad, et al.
Published: (2026)
DiagEval: Trajectory-Conditioned Diagnosis for Reliable Software Evaluation with GUI Agents
by: Hong, Sirui, et al.
Published: (2026)
by: Hong, Sirui, et al.
Published: (2026)
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement
by: Zhang, Huan, et al.
Published: (2024)
by: Zhang, Huan, et al.
Published: (2024)
SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs
by: Tripathy, Arihant, et al.
Published: (2025)
by: Tripathy, Arihant, et al.
Published: (2025)
SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models
by: Xu, Jingxuan, et al.
Published: (2025)
by: Xu, Jingxuan, et al.
Published: (2025)
Coding with Eyes: Visual Feedback Unlocks Reliable GUI Code Generating and Debugging
by: Liu, Zhilin, et al.
Published: (2026)
by: Liu, Zhilin, et al.
Published: (2026)
Fragmented Layer Grouping in GUI Designs Through Graph Learning Based on Multimodal Information
by: Chen, Yunnong, et al.
Published: (2024)
by: Chen, Yunnong, et al.
Published: (2024)
SafeGround: Know When to Trust GUI Grounding Models via Uncertainty Calibration
by: Wang, Qingni, et al.
Published: (2026)
by: Wang, Qingni, et al.
Published: (2026)
AndroidControl-Curated: Revealing the True Potential of GUI Agents through Benchmark Purification
by: Leung, Ho Fai, et al.
Published: (2025)
by: Leung, Ho Fai, et al.
Published: (2025)
VulnAgent-X: A Layered Agentic Framework for Repository-Level Vulnerability Detection
by: Meng, Renwei, et al.
Published: (2026)
by: Meng, Renwei, et al.
Published: (2026)
Agentic Harness for Real-World Compilers
by: Zheng, Yingwei, et al.
Published: (2026)
by: Zheng, Yingwei, et al.
Published: (2026)
Beyond the 'Diff': Addressing Agentic Entropy in Agentic Software Development
by: Casserini, Matteo, et al.
Published: (2026)
by: Casserini, Matteo, et al.
Published: (2026)
LLM Agents for Interactive Exploration of Historical Cadastre Data: Framework and Application to Venice
by: Karch, Tristan, et al.
Published: (2025)
by: Karch, Tristan, et al.
Published: (2025)
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
by: Yang, Ya-Ting, et al.
Published: (2026)
by: Yang, Ya-Ting, et al.
Published: (2026)
An Empirical Exploration of ChatGPT's Ability to Support Problem Formulation Tasks for Mission Engineering and a Documentation of its Performance Variability
by: Ofsa, Max, et al.
Published: (2025)
by: Ofsa, Max, et al.
Published: (2025)
Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement
by: Gallaba, Keheliya, et al.
Published: (2025)
by: Gallaba, Keheliya, et al.
Published: (2025)
app.build: A Production Framework for Scaling Agentic Prompt-to-App Generation with Environment Scaffolding
by: Kniazev, Evgenii, et al.
Published: (2025)
by: Kniazev, Evgenii, et al.
Published: (2025)
Tuning LLM-based Code Optimization via Meta-Prompting: An Industrial Perspective
by: Gong, Jingzhi, et al.
Published: (2025)
by: Gong, Jingzhi, et al.
Published: (2025)
PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design
by: Yang, Ruozhao, et al.
Published: (2025)
by: Yang, Ruozhao, et al.
Published: (2025)
Agentic Software Engineering: Foundational Pillars and a Research Roadmap
by: Hassan, Ahmed E., et al.
Published: (2025)
by: Hassan, Ahmed E., et al.
Published: (2025)
MaxMind: A Memory Loop Network to Enhance Software Productivity based on Large Language Models
by: Dong, Yuchen, et al.
Published: (2024)
by: Dong, Yuchen, et al.
Published: (2024)
DIVE: Scaling Diversity in Agentic Task Synthesis for Generalizable Tool Use
by: Chen, Aili, et al.
Published: (2026)
by: Chen, Aili, et al.
Published: (2026)
On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset
by: Bhat, Vishvesh, et al.
Published: (2025)
by: Bhat, Vishvesh, et al.
Published: (2025)
FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning
by: Liu, Yihao, et al.
Published: (2026)
by: Liu, Yihao, et al.
Published: (2026)
Industrial LLM-based Code Optimization under Regulation: A Mixture-of-Agents Approach
by: Ashiga, Mari, et al.
Published: (2025)
by: Ashiga, Mari, et al.
Published: (2025)
Runtime-Structured Task Decomposition for Agentic Coding Systems
by: Asthana, Shubhi, et al.
Published: (2026)
by: Asthana, Shubhi, et al.
Published: (2026)
Similar Items
-
SOPRAG: Multi-view Graph Experts Retrieval for Industrial Standard Operating Procedures
by: Lin, Liangtao, et al.
Published: (2026) -
Agentic Business Process Management Systems
by: Dumas, Marlon, et al.
Published: (2026) -
VLAA-GUI: Knowing When to Stop, Recover, and Search, A Modular Framework for GUI Automation
by: Han, Qijun, et al.
Published: (2026) -
Agentic Frameworks for Reasoning Tasks: An Empirical Study
by: Rasheed, Zeeshan, et al.
Published: (2026) -
GUITestScape: Towards Open-set Evaluation on Exploratory GUI Testing
by: Chen, Xiaoyi, et al.
Published: (2026)