Enregistré dans:
Détails bibliographiques
Auteurs principaux: Xu, Yunqing, Ma, Xinbei, Qiu, Jiyang, Zhao, Hai
Format: Preprint
Publié: 2025
Sujets:
Accès en ligne:https://arxiv.org/abs/2502.15412
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866916624827678720
author Xu, Yunqing
Ma, Xinbei
Qiu, Jiyang
Zhao, Hai
author_facet Xu, Yunqing
Ma, Xinbei
Qiu, Jiyang
Zhao, Hai
contents Generating presentation slides is a time-consuming task that urgently requires automation. Due to their limited flexibility and lack of automated refinement mechanisms, existing autonomous LLM-based agents face constraints in real-world applicability. We decompose the task of generating missing presentation slides into two key components: content generation and layout generation, aligning with the typical process of creating academic slides. First, we introduce a content generation approach that enhances coherence and relevance by incorporating context from surrounding slides and leveraging section retrieval strategies. For layout generation, we propose a textual-to-visual self-verification process using a LLM-based Reviewer + Refiner workflow, transforming complex textual layouts into intuitive visual formats. This modality transformation simplifies the task, enabling accurate and human-like review and refinement. Experiments show that our approach significantly outperforms baseline methods in terms of alignment, logical flow, visual appeal, and readability.
format Preprint
id arxiv_https___arxiv_org_abs_2502_15412
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Textual-to-Visual Iterative Self-Verification for Slide Generation
Xu, Yunqing
Ma, Xinbei
Qiu, Jiyang
Zhao, Hai
Computation and Language
Generating presentation slides is a time-consuming task that urgently requires automation. Due to their limited flexibility and lack of automated refinement mechanisms, existing autonomous LLM-based agents face constraints in real-world applicability. We decompose the task of generating missing presentation slides into two key components: content generation and layout generation, aligning with the typical process of creating academic slides. First, we introduce a content generation approach that enhances coherence and relevance by incorporating context from surrounding slides and leveraging section retrieval strategies. For layout generation, we propose a textual-to-visual self-verification process using a LLM-based Reviewer + Refiner workflow, transforming complex textual layouts into intuitive visual formats. This modality transformation simplifies the task, enabling accurate and human-like review and refinement. Experiments show that our approach significantly outperforms baseline methods in terms of alignment, logical flow, visual appeal, and readability.
title Textual-to-Visual Iterative Self-Verification for Slide Generation
topic Computation and Language
url https://arxiv.org/abs/2502.15412