Enregistré dans:
Détails bibliographiques
Auteurs principaux: Lee, Seolhwa, Yang, Kisu, Park, Chanjun, Sedoc, João, Lim, Heuiseok
Format: Preprint
Publié: 2021
Sujets:
Accès en ligne:https://arxiv.org/abs/2109.14199
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866914989701332992
author Lee, Seolhwa
Yang, Kisu
Park, Chanjun
Sedoc, João
Lim, Heuiseok
author_facet Lee, Seolhwa
Yang, Kisu
Park, Chanjun
Sedoc, João
Lim, Heuiseok
contents Abstractive dialogue summarization is a challenging task for several reasons. First, most of the important pieces of information in a conversation are scattered across utterances through multi-party interactions with different textual styles. Second, dialogues are often informal structures, wherein different individuals express personal perspectives, unlike text summarization, tasks that usually target formal documents such as news articles. To address these issues, we focused on the association between utterances from individual speakers and unique syntactic structures. Speakers have unique textual styles that can contain linguistic information, such as voiceprint. Therefore, we constructed a syntax-aware model by leveraging linguistic information (i.e., POS tagging), which alleviates the above issues by inherently distinguishing sentences uttered from individual speakers. We employed multi-task learning of both syntax-aware information and dialogue summarization. To the best of our knowledge, our approach is the first method to apply multi-task learning to the dialogue summarization task. Experiments on a SAMSum corpus (a large-scale dialogue summarization corpus) demonstrated that our method improved upon the vanilla model. We further analyze the costs and benefits of our approach relative to baseline models.
format Preprint
id arxiv_https___arxiv_org_abs_2109_14199
institution arXiv
publishDate 2021
record_format arxiv
spellingShingle Who speaks like a style of Vitamin: Towards Syntax-Aware DialogueSummarization using Multi-task Learning
Lee, Seolhwa
Yang, Kisu
Park, Chanjun
Sedoc, João
Lim, Heuiseok
Computation and Language
Abstractive dialogue summarization is a challenging task for several reasons. First, most of the important pieces of information in a conversation are scattered across utterances through multi-party interactions with different textual styles. Second, dialogues are often informal structures, wherein different individuals express personal perspectives, unlike text summarization, tasks that usually target formal documents such as news articles. To address these issues, we focused on the association between utterances from individual speakers and unique syntactic structures. Speakers have unique textual styles that can contain linguistic information, such as voiceprint. Therefore, we constructed a syntax-aware model by leveraging linguistic information (i.e., POS tagging), which alleviates the above issues by inherently distinguishing sentences uttered from individual speakers. We employed multi-task learning of both syntax-aware information and dialogue summarization. To the best of our knowledge, our approach is the first method to apply multi-task learning to the dialogue summarization task. Experiments on a SAMSum corpus (a large-scale dialogue summarization corpus) demonstrated that our method improved upon the vanilla model. We further analyze the costs and benefits of our approach relative to baseline models.
title Who speaks like a style of Vitamin: Towards Syntax-Aware DialogueSummarization using Multi-task Learning
topic Computation and Language
url https://arxiv.org/abs/2109.14199