Saved in:
Bibliographic Details
Main Authors: Zhang, Yanbin, Ye, Hanhui, Bai, Yue, Zhang, Qiming, Xiang, Liao, Mianzhi, Wu, Hu, Renjun
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.04445
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914179650158592
author Zhang, Yanbin
Ye, Hanhui
Bai, Yue
Zhang, Qiming
Xiang, Liao
Mianzhi, Wu
Hu, Renjun
author_facet Zhang, Yanbin
Ye, Hanhui
Bai, Yue
Zhang, Qiming
Xiang, Liao
Mianzhi, Wu
Hu, Renjun
contents Workflow automation promises substantial productivity gains in everyday document-related tasks. While prior agentic systems can execute isolated instructions, they struggle with automating multi-step, session-level workflows due to limited control over the operational process. To this end, we introduce AutoDW, a novel execution framework that enables stepwise, rollback-enabled operation orchestration. AutoDW incrementally plans API actions conditioned on user instructions, intent-filtered API candidates, and the evolving states of the document. It further employs robust rollback mechanisms at both the argument and API levels, enabling dynamic correction and fault tolerance. These designs together ensure that the execution trajectory of AutoDW remains aligned with user intent and document context across long-horizon workflows. To assess its effectiveness, we construct a comprehensive benchmark of 250 sessions and 1,708 human-annotated instructions, reflecting realistic document processing scenarios with interdependent instructions. AutoDW achieves 90% and 62% completion rates on instruction- and session-level tasks, respectively, outperforming strong baselines by 40% and 76%. Moreover, AutoDW also remains robust for the decision of backbone LLMs and on tasks with varying difficulty. Code and data will be open-sourced. Code: https://github.com/YJett/AutoDW
format Preprint
id arxiv_https___arxiv_org_abs_2512_04445
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Automating Complex Document Workflows via Stepwise and Rollback-Enabled Operation Orchestration
Zhang, Yanbin
Ye, Hanhui
Bai, Yue
Zhang, Qiming
Xiang, Liao
Mianzhi, Wu
Hu, Renjun
Software Engineering
Artificial Intelligence
Workflow automation promises substantial productivity gains in everyday document-related tasks. While prior agentic systems can execute isolated instructions, they struggle with automating multi-step, session-level workflows due to limited control over the operational process. To this end, we introduce AutoDW, a novel execution framework that enables stepwise, rollback-enabled operation orchestration. AutoDW incrementally plans API actions conditioned on user instructions, intent-filtered API candidates, and the evolving states of the document. It further employs robust rollback mechanisms at both the argument and API levels, enabling dynamic correction and fault tolerance. These designs together ensure that the execution trajectory of AutoDW remains aligned with user intent and document context across long-horizon workflows. To assess its effectiveness, we construct a comprehensive benchmark of 250 sessions and 1,708 human-annotated instructions, reflecting realistic document processing scenarios with interdependent instructions. AutoDW achieves 90% and 62% completion rates on instruction- and session-level tasks, respectively, outperforming strong baselines by 40% and 76%. Moreover, AutoDW also remains robust for the decision of backbone LLMs and on tasks with varying difficulty. Code and data will be open-sourced. Code: https://github.com/YJett/AutoDW
title Automating Complex Document Workflows via Stepwise and Rollback-Enabled Operation Orchestration
topic Software Engineering
Artificial Intelligence
url https://arxiv.org/abs/2512.04445