Saved in:
| Main Authors: | Seroul, Alan, Fagnoni, Théo, Adnani, Inès, Mohamed, Dana O., Kingston, Phillip |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.04220 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Opus: A Prompt Intention Framework for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2025)
by: Fagnoni, Théo, et al.
Published: (2025)
Opus: A Workflow Intention Framework for Complex Workflow Generation
by: Kingston, Phillip, et al.
Published: (2025)
by: Kingston, Phillip, et al.
Published: (2025)
Opus: A Large Work Model for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2024)
by: Fagnoni, Théo, et al.
Published: (2024)
Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
by: Lorenzo, Luis Guzmán
Published: (2026)
by: Lorenzo, Luis Guzmán
Published: (2026)
An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
by: Yan, Zihe, et al.
Published: (2025)
by: Yan, Zihe, et al.
Published: (2025)
ReusStdFlow: A Standardized Reusability Framework for Dynamic Workflow Construction in Agentic AI
by: Zhang, Gaoyang, et al.
Published: (2026)
by: Zhang, Gaoyang, et al.
Published: (2026)
Workflow for Safe-AI
by: Veljanovska, Suzana, et al.
Published: (2025)
by: Veljanovska, Suzana, et al.
Published: (2025)
Workflows vs Agents for Code Translation
by: Gray, Henry, et al.
Published: (2025)
by: Gray, Henry, et al.
Published: (2025)
Blueprint First, Model Second: A Framework for Deterministic LLM Workflow
by: Qiu, Libin, et al.
Published: (2025)
by: Qiu, Libin, et al.
Published: (2025)
LogBabylon: A Unified Framework for Cross-Log File Integration and Analysis
by: Karanjai, Rabimba, et al.
Published: (2024)
by: Karanjai, Rabimba, et al.
Published: (2024)
World of Workflows: A Benchmark for Bringing World Models to Enterprise Systems
by: Gupta, Lakshya, et al.
Published: (2026)
by: Gupta, Lakshya, et al.
Published: (2026)
Planning-Driven Programming: A Large Language Model Programming Workflow
by: Lei, Chao, et al.
Published: (2024)
by: Lei, Chao, et al.
Published: (2024)
FLOW-BENCH: Towards Conversational Generation of Enterprise Workflows
by: Duesterwald, Evelyn, et al.
Published: (2025)
by: Duesterwald, Evelyn, et al.
Published: (2025)
VTS-Guided AI Interaction Workflow for Business Insights
by: Ding, Sun, et al.
Published: (2025)
by: Ding, Sun, et al.
Published: (2025)
Demystifying the Lifecycle of Failures in Platform-Orchestrated Agentic Workflows
by: Ma, Xuyan, et al.
Published: (2025)
by: Ma, Xuyan, et al.
Published: (2025)
Model-based Workflow for the Automated Generation of PDDL Descriptions
by: Nabizada, Hamied, et al.
Published: (2024)
by: Nabizada, Hamied, et al.
Published: (2024)
REVERE: Reflective Evolving Research Engineer for Scientific Workflows
by: Gangireddi, Balaji Dinesh, et al.
Published: (2026)
by: Gangireddi, Balaji Dinesh, et al.
Published: (2026)
A Process Mining-Based System For The Analysis and Prediction of Software Development Workflows
by: Dorado, Antía, et al.
Published: (2025)
by: Dorado, Antía, et al.
Published: (2025)
Claw-Eval-Live: A Live Agent Benchmark for Evolving Real-World Workflows
by: Li, Chenxin, et al.
Published: (2026)
by: Li, Chenxin, et al.
Published: (2026)
Engineering AI Agents for Clinical Workflows: A Case Study in Architecture,MLOps, and Governance
by: Lopes, Cláudio Lúcio do Val, et al.
Published: (2026)
by: Lopes, Cláudio Lúcio do Val, et al.
Published: (2026)
LiveFMBench: Unveiling the Power and Limits of Agentic Workflows in Specification Generation
by: Xu, Dong, et al.
Published: (2026)
by: Xu, Dong, et al.
Published: (2026)
MAS-Algorithm: A Workflow for Solving Algorithmic Programming Problems with a Multi-Agent System
by: Xu, Yuliang, et al.
Published: (2026)
by: Xu, Yuliang, et al.
Published: (2026)
WorkflowLLM: Enhancing Workflow Orchestration Capability of Large Language Models
by: Fan, Shengda, et al.
Published: (2024)
by: Fan, Shengda, et al.
Published: (2024)
Automating Complex Document Workflows via Stepwise and Rollback-Enabled Operation Orchestration
by: Zhang, Yanbin, et al.
Published: (2025)
by: Zhang, Yanbin, et al.
Published: (2025)
QualityFlow: An Agentic Workflow for Program Synthesis Controlled by LLM Quality Checks
by: Hu, Yaojie, et al.
Published: (2025)
by: Hu, Yaojie, et al.
Published: (2025)
Generating a Low-code Complete Workflow via Task Decomposition and RAG
by: Ayala, Orlando Marquez, et al.
Published: (2024)
by: Ayala, Orlando Marquez, et al.
Published: (2024)
FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning
by: Liu, Yihao, et al.
Published: (2026)
by: Liu, Yihao, et al.
Published: (2026)
A Defect Classification Framework for AI-Based Software Systems (AI-ODC)
by: Alannsary, Mohammed O.
Published: (2025)
by: Alannsary, Mohammed O.
Published: (2025)
ModiGen: A Large Language Model-Based Workflow for Multi-Task Modelica Code Generation
by: Xiang, Jiahui, et al.
Published: (2025)
by: Xiang, Jiahui, et al.
Published: (2025)
The Foundations of Computational Management: A Systematic Approach to Task Automation for the Integration of Artificial Intelligence into Existing Workflows
by: Jadad-Garcia, Tamen, et al.
Published: (2024)
by: Jadad-Garcia, Tamen, et al.
Published: (2024)
TREAT: A Code LLMs Trustworthiness / Reliability Evaluation and Testing Framework
by: Gao, Shuzheng, et al.
Published: (2025)
by: Gao, Shuzheng, et al.
Published: (2025)
DataGovBench: Benchmarking LLM Agents for Real-World Data Governance Workflows
by: Liu, Zhou, et al.
Published: (2025)
by: Liu, Zhou, et al.
Published: (2025)
LLM-Powered Workflow Optimization for Multidisciplinary Software Development: An Automotive Industry Case Study
by: Wang, Shuai, et al.
Published: (2026)
by: Wang, Shuai, et al.
Published: (2026)
LaQual: A Novel Framework for Automated Evaluation of LLM App Quality
by: Wang, Yan, et al.
Published: (2025)
by: Wang, Yan, et al.
Published: (2025)
Toward Reliable Design of LLM-Enabled Agentic Workflows: Optimizing Latency-Reliability-Cost Tradeoffs
by: Yang, Ya-Ting, et al.
Published: (2026)
by: Yang, Ya-Ting, et al.
Published: (2026)
CIRCLE: A Framework for Evaluating AI from a Real-World Lens
by: Schwartz, Reva, et al.
Published: (2026)
by: Schwartz, Reva, et al.
Published: (2026)
Beyond Isolated Tasks: A Framework for Evaluating Coding Agents on Sequential Software Evolution
by: Shastry, KN Ajay, et al.
Published: (2026)
by: Shastry, KN Ajay, et al.
Published: (2026)
An Empirical Framework for Evaluating Semantic Preservation Using Hugging Face
by: Jia, Nan, et al.
Published: (2025)
by: Jia, Nan, et al.
Published: (2025)
VulnLLMEval: A Framework for Evaluating Large Language Models in Software Vulnerability Detection and Patching
by: Zibaeirad, Arastoo, et al.
Published: (2024)
by: Zibaeirad, Arastoo, et al.
Published: (2024)
Skeleton-Guided-Translation: A Benchmarking Framework for Code Repository Translation with Fine-Grained Quality Evaluation
by: Zhang, Xing, et al.
Published: (2025)
by: Zhang, Xing, et al.
Published: (2025)
Similar Items
-
Opus: A Prompt Intention Framework for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2025) -
Opus: A Workflow Intention Framework for Complex Workflow Generation
by: Kingston, Phillip, et al.
Published: (2025) -
Opus: A Large Work Model for Complex Workflow Generation
by: Fagnoni, Théo, et al.
Published: (2024) -
Poisoned Identifiers Survive LLM Deobfuscation: A Case Study on Claude Opus 4.6
by: Lorenzo, Luis Guzmán
Published: (2026) -
An LLM-based Quantitative Framework for Evaluating High-Stealthy Backdoor Risks in OSS Supply Chains
by: Yan, Zihe, et al.
Published: (2025)