:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Seroul, Alan, Fagnoni, Théo, Adnani, Inès, Mohamed, Dana O., Kingston, Phillip
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Software Engineering
Online Access:	https://arxiv.org/abs/2511.04220
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Opus: A Prompt Intention Framework for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2025)

Opus: A Workflow Intention Framework for Complex Workflow Generation
by: Kingston, Phillip, et al.
Published: (2025)

Opus: A Large Work Model for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2024)

Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
by: Lorenzo, Luis Guzmán
Published: (2026)

An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
by: Yan, Zihe, et al.
Published: (2025)

ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI
by: Zhang, Gaoyang, et al.
Published: (2026)

Workflow for Safe-AI
by: Veljanovska, Suzana, et al.
Published: (2025)

Workflows vs Agents for Code Translation
by: Gray, Henry, et al.
Published: (2025)

Blueprint First, Model Second: A Framework for Deterministic LLM Workflow
by: Qiu, Libin, et al.
Published: (2025)

LogBabylon: A Unified Framework for Cross-Log File Integration and Analysis
by: Karanjai, Rabimba, et al.
Published: (2024)

World of Workflows: A Benchmark for Bringing World Models to Enterprise Systems
by: Gupta, Lakshya, et al.
Published: (2026)

Planning-Driven Programming: A Large Language Model Programming Workflow
by: Lei, Chao, et al.
Published: (2024)

FLOW-BENCH: Towards Conversational Generation of Enterprise Workflows
by: Duesterwald, Evelyn, et al.
Published: (2025)

VTS-Guided AI Interaction Workflow for Business Insights
by: Ding, Sun, et al.
Published: (2025)

Demystifying the Lifecycle of Failures in Platform-Orchestrated Agentic Workflows
by: Ma, Xuyan, et al.
Published: (2025)

Model-based Workflow for the Automated Generation of PDDL Descriptions
by: Nabizada, Hamied, et al.
Published: (2024)

REVERE: Reflective Evolving Research Engineer for Scientific Workflows
by: Gangireddi, Balaji Dinesh, et al.
Published: (2026)

A Process Mining-Based System For The Analysis and Prediction of Software Development Workflows
by: Dorado, Antía, et al.
Published: (2025)

Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
by: Li, Chenxin, et al.
Published: (2026)

Engineering AI Agents for Clinical Workflows: A Case Study in Architecture,MLOps, and Governance
by: Lopes, Cláudio Lúcio do Val, et al.
Published: (2026)

LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)

MAS-Algorithm: A Workflow for Solving Algorithmic Programming Problems with a Multi-Agent System
by: Xu, Yuliang, et al.
Published: (2026)

WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models
by: Fan, Shengda, et al.
Published: (2024)

Automating Complex Document Workflows via Stepwise and Rollback-Enabled Operation Orchestration
by: Zhang, Yanbin, et al.
Published: (2025)

QualityFlow: An Agentic Workflow for Program Synthesis Controlled by LLM Quality Checks
by: Hu, Yaojie, et al.
Published: (2025)

Generating a Low-code Complete Workflow via Task Decomposition and RAG
by: Ayala, Orlando Marquez, et al.
Published: (2024)

FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning
by: Liu, Yihao, et al.
Published: (2026)

A Defect Classification Framework for AI-Based Software Systems (AI-ODC)
by: Alannsary, Mohammed O.
Published: (2025)

ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation
by: Xiang, Jiahui, et al.
Published: (2025)

The Foundations of Computational Management: A Systematic Approach to Task Automation for the Integration of Artificial Intelligence into Existing Workflows
by: Jadad-Garcia, Tamen, et al.
Published: (2024)

TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
by: Gao, Shuzheng, et al.
Published: (2025)

DataGovBench: Benchmarking LLM Agents for Real-World Data Governance Workflows
by: Liu, Zhou, et al.
Published: (2025)

LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study
by: Wang, Shuai, et al.
Published: (2026)

LaQual: A Novel Framework for Automated Evaluation of LLM App Quality
by: Wang, Yan, et al.
Published: (2025)

Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
by: Yang, Ya-Ting, et al.
Published: (2026)

CIRCLE: A Framework for Evaluating AI from a Real-World Lens
by: Schwartz, Reva, et al.
Published: (2026)

Beyond Isolated Tasks: A Framework for Evaluating Coding Agents on Sequential Software Evolution
by: Shastry, KN Ajay, et al.
Published: (2026)

An Empirical Framework for Evaluating Semantic Preservation Using Hugging Face
by: Jia, Nan, et al.
Published: (2025)

VulnLLMEval: A Framework for Evaluating Large Language Models in Software Vulnerability Detection and Patching
by: Zibaeirad, Arastoo, et al.
Published: (2024)

Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
by: Zhang, Xing, et al.
Published: (2025)