Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sun, Qiang, Li, Sirui, Bi, Tingting, Huynh, Du, Reynolds, Mark, Luo, Yuanyi, Liu, Wei
Format:	Preprint
Published:	2025
Subjects:	Software Engineering Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.03214
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918011204534272
author	Sun, Qiang Li, Sirui Bi, Tingting Huynh, Du Reynolds, Mark Luo, Yuanyi Liu, Wei
author_facet	Sun, Qiang Li, Sirui Bi, Tingting Huynh, Du Reynolds, Mark Luo, Yuanyi Liu, Wei
contents	Acquiring structured data from domain-specific, image-based documents such as scanned reports is crucial for many downstream tasks but remains challenging due to document variability. Many of these documents exist as images rather than as machine-readable text, which requires human annotation to train automated extraction systems. We present DocSpiral, the first Human-in-the-Spiral assistive document annotation platform, designed to address the challenge of extracting structured information from domain-specific, image-based document collections. Our spiral design establishes an iterative cycle in which human annotations train models that progressively require less manual intervention. DocSpiral integrates document format normalization, comprehensive annotation interfaces, evaluation metrics dashboard, and API endpoints for the development of AI / ML models into a unified workflow. Experiments demonstrate that our framework reduces annotation time by at least 41\% while showing consistent performance gains across three iterations during model training. By making this annotation platform freely accessible, we aim to lower barriers to AI/ML models development in document processing, facilitating the adoption of large language models in image-based, document-intensive fields such as geoscience and healthcare. The system is freely available at: https://app.ai4wa.com. The demonstration video is available: https://app.ai4wa.com/docs/docspiral/demo.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_03214
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral Sun, Qiang Li, Sirui Bi, Tingting Huynh, Du Reynolds, Mark Luo, Yuanyi Liu, Wei Software Engineering Artificial Intelligence Acquiring structured data from domain-specific, image-based documents such as scanned reports is crucial for many downstream tasks but remains challenging due to document variability. Many of these documents exist as images rather than as machine-readable text, which requires human annotation to train automated extraction systems. We present DocSpiral, the first Human-in-the-Spiral assistive document annotation platform, designed to address the challenge of extracting structured information from domain-specific, image-based document collections. Our spiral design establishes an iterative cycle in which human annotations train models that progressively require less manual intervention. DocSpiral integrates document format normalization, comprehensive annotation interfaces, evaluation metrics dashboard, and API endpoints for the development of AI / ML models into a unified workflow. Experiments demonstrate that our framework reduces annotation time by at least 41\% while showing consistent performance gains across three iterations during model training. By making this annotation platform freely accessible, we aim to lower barriers to AI/ML models development in document processing, facilitating the adoption of large language models in image-based, document-intensive fields such as geoscience and healthcare. The system is freely available at: https://app.ai4wa.com. The demonstration video is available: https://app.ai4wa.com/docs/docspiral/demo.
title	DocSpiral: A Platform for Integrated Assistive Document Annotation through Human-in-the-Spiral
topic	Software Engineering Artificial Intelligence
url	https://arxiv.org/abs/2505.03214

Similar Items