Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Vossen, Piek, Santamaría, Selene Báez, Bajčetić, Lenka, Belluci, Thomas
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2412.18364
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929646370553856
author	Vossen, Piek Santamaría, Selene Báez Bajčetić, Lenka Belluci, Thomas
author_facet	Vossen, Piek Santamaría, Selene Báez Bajčetić, Lenka Belluci, Thomas
contents	Obtaining an explicit understanding of communication within a Hybrid Intelligence collaboration is essential to create controllable and transparent agents. In this paper, we describe a number of Natural Language Understanding models that extract explicit symbolic triples from social conversation. Triple extraction has mostly been developed and tested for Knowledge Base Completion using Wikipedia text and data for training and testing. However, social conversation is very different as a genre in which interlocutors exchange information in sequences of utterances that involve statements, questions, and answers. Phenomena such as co-reference, ellipsis, coordination, and implicit and explicit negation or confirmation are more prominent in conversation than in Wikipedia text. We therefore describe an attempt to fill this gap by releasing data sets for training and testing triple extraction from social conversation. We also created five triple extraction models and tested them in our evaluation data. The highest precision is 51.14 for complete triples and 69.32 for triple elements when tested on single utterances. However, scores for conversational triples that span multiple turns are much lower, showing that extracting knowledge from true conversational data is much more challenging.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_18364
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Extracting triples from dialogues for conversational social agents Vossen, Piek Santamaría, Selene Báez Bajčetić, Lenka Belluci, Thomas Computation and Language Obtaining an explicit understanding of communication within a Hybrid Intelligence collaboration is essential to create controllable and transparent agents. In this paper, we describe a number of Natural Language Understanding models that extract explicit symbolic triples from social conversation. Triple extraction has mostly been developed and tested for Knowledge Base Completion using Wikipedia text and data for training and testing. However, social conversation is very different as a genre in which interlocutors exchange information in sequences of utterances that involve statements, questions, and answers. Phenomena such as co-reference, ellipsis, coordination, and implicit and explicit negation or confirmation are more prominent in conversation than in Wikipedia text. We therefore describe an attempt to fill this gap by releasing data sets for training and testing triple extraction from social conversation. We also created five triple extraction models and tested them in our evaluation data. The highest precision is 51.14 for complete triples and 69.32 for triple elements when tested on single utterances. However, scores for conversational triples that span multiple turns are much lower, showing that extracting knowledge from true conversational data is much more challenging.
title	Extracting triples from dialogues for conversational social agents
topic	Computation and Language
url	https://arxiv.org/abs/2412.18364

Similar Items