Saved in:
Bibliographic Details
Main Authors: Huang, Chenyang, Zhou, Hao, Jen, Cameron, Zheng, Kangjie, Zaïane, Osmar R., Mou, Lili
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.04535
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Length-control summarization aims to condense long texts into a short one within a certain length limit. Previous approaches often use autoregressive (AR) models and treat the length requirement as a soft constraint, which may not always be satisfied. In this study, we propose a novel length-control decoding algorithm based on the Directed Acyclic Transformer (DAT). Our approach allows for multiple plausible sequence fragments and predicts a \emph{path} to connect them. In addition, we propose a Sequence Maximum a Posteriori (SeqMAP) decoding algorithm that marginalizes different possible paths and finds the most probable summary satisfying the length budget. Our algorithm is based on beam search, which further facilitates a reranker for performance improvement. Experimental results on the Gigaword and DUC2004 datasets demonstrate our state-of-the-art performance for length-control summarization.