Saved in:
Bibliographic Details
Main Authors: Ravaut, Mathieu, Joty, Shafiq, Chen, Nancy
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2212.09593
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915018452238336
author Ravaut, Mathieu
Joty, Shafiq
Chen, Nancy
author_facet Ravaut, Mathieu
Joty, Shafiq
Chen, Nancy
contents With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from these models while only one candidate is kept as the summary output. In this paper, we propose to re-rank summary candidates in an unsupervised manner, aiming to close the performance gap between unsupervised and supervised models. Our approach improves the unsupervised PEGASUS by up to 7.27% and ChatGPT by up to 6.86% relative mean ROUGE across four widely-adopted summarization benchmarks ; and achieves relative gains of 7.51% (up to 23.73% from XSum to WikiHow) averaged over 30 zero-shot transfer setups (finetuning on a dataset, evaluating on another).
format Preprint
id arxiv_https___arxiv_org_abs_2212_09593
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Unsupervised Summarization Re-ranking
Ravaut, Mathieu
Joty, Shafiq
Chen, Nancy
Computation and Language
With the rise of task-specific pre-training objectives, abstractive summarization models like PEGASUS offer appealing zero-shot performance on downstream summarization tasks. However, the performance of such unsupervised models still lags significantly behind their supervised counterparts. Similarly to the supervised setup, we notice a very high variance in quality among summary candidates from these models while only one candidate is kept as the summary output. In this paper, we propose to re-rank summary candidates in an unsupervised manner, aiming to close the performance gap between unsupervised and supervised models. Our approach improves the unsupervised PEGASUS by up to 7.27% and ChatGPT by up to 6.86% relative mean ROUGE across four widely-adopted summarization benchmarks ; and achieves relative gains of 7.51% (up to 23.73% from XSum to WikiHow) averaged over 30 zero-shot transfer setups (finetuning on a dataset, evaluating on another).
title Unsupervised Summarization Re-ranking
topic Computation and Language
url https://arxiv.org/abs/2212.09593