Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Nguyen, Tuan-Phong, Razniewski, Simon, Weikum, Gerhard
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2402.10689
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917730637053952
author	Nguyen, Tuan-Phong Razniewski, Simon Weikum, Gerhard
author_facet	Nguyen, Tuan-Phong Razniewski, Simon Weikum, Gerhard
contents	Despite recent progress, large language models (LLMs) still face the challenge of appropriately reacting to the intricacies of social and cultural conventions. This paper presents MANGO, a methodology for distilling high-accuracy, high-recall assertions of cultural knowledge. We judiciously and iteratively prompt LLMs for this purpose from two entry points, concepts and cultures. Outputs are consolidated via clustering and generative summarization. Running the MANGO method with GPT-3.5 as underlying LLM yields 167K high-accuracy assertions for 30K concepts and 11K cultures, surpassing prior resources by a large margin in quality and size. In an extrinsic evaluation for intercultural dialogues, we explore augmenting dialogue systems with cultural knowledge assertions. Notably, despite LLMs inherently possessing cultural knowledge, we find that adding knowledge from MANGO improves the overall quality, specificity, and cultural sensitivity of dialogue responses, as judged by human annotators. Data and code are available for download.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_10689
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Cultural Commonsense Knowledge for Intercultural Dialogues Nguyen, Tuan-Phong Razniewski, Simon Weikum, Gerhard Computation and Language Despite recent progress, large language models (LLMs) still face the challenge of appropriately reacting to the intricacies of social and cultural conventions. This paper presents MANGO, a methodology for distilling high-accuracy, high-recall assertions of cultural knowledge. We judiciously and iteratively prompt LLMs for this purpose from two entry points, concepts and cultures. Outputs are consolidated via clustering and generative summarization. Running the MANGO method with GPT-3.5 as underlying LLM yields 167K high-accuracy assertions for 30K concepts and 11K cultures, surpassing prior resources by a large margin in quality and size. In an extrinsic evaluation for intercultural dialogues, we explore augmenting dialogue systems with cultural knowledge assertions. Notably, despite LLMs inherently possessing cultural knowledge, we find that adding knowledge from MANGO improves the overall quality, specificity, and cultural sensitivity of dialogue responses, as judged by human annotators. Data and code are available for download.
title	Cultural Commonsense Knowledge for Intercultural Dialogues
topic	Computation and Language
url	https://arxiv.org/abs/2402.10689

Similar Items