Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Satapara, Shrey, Mehta, Parth, Ganguly, Debasis, Modha, Sandip
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2401.04481
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909066528292864
author	Satapara, Shrey Mehta, Parth Ganguly, Debasis Modha, Sandip
author_facet	Satapara, Shrey Mehta, Parth Ganguly, Debasis Modha, Sandip
contents	The recent success in language generation capabilities of large language models (LLMs), such as GPT, Bard, Llama etc., can potentially lead to concerns about their possible misuse in inducing mass agitation and communal hatred via generating fake news and spreading misinformation. Traditional means of developing a misinformation ground-truth dataset does not scale well because of the extensive manual effort required to annotate the data. In this paper, we propose an LLM-based approach of creating silver-standard ground-truth datasets for identifying misinformation. Specifically speaking, given a trusted news article, our proposed approach involves prompting LLMs to automatically generate a summarised version of the original article. The prompts in our proposed approach act as a controlling mechanism to generate specific types of factual incorrectness in the generated summaries, e.g., incorrect quantities, false attributions etc. To investigate the usefulness of this dataset, we conduct a set of experiments where we train a range of supervised models for the task of misinformation detection.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_04481
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset Satapara, Shrey Mehta, Parth Ganguly, Debasis Modha, Sandip Computation and Language Artificial Intelligence The recent success in language generation capabilities of large language models (LLMs), such as GPT, Bard, Llama etc., can potentially lead to concerns about their possible misuse in inducing mass agitation and communal hatred via generating fake news and spreading misinformation. Traditional means of developing a misinformation ground-truth dataset does not scale well because of the extensive manual effort required to annotate the data. In this paper, we propose an LLM-based approach of creating silver-standard ground-truth datasets for identifying misinformation. Specifically speaking, given a trusted news article, our proposed approach involves prompting LLMs to automatically generate a summarised version of the original article. The prompts in our proposed approach act as a controlling mechanism to generate specific types of factual incorrectness in the generated summaries, e.g., incorrect quantities, false attributions etc. To investigate the usefulness of this dataset, we conduct a set of experiments where we train a range of supervised models for the task of misinformation detection.
title	Fighting Fire with Fire: Adversarial Prompting to Generate a Misinformation Detection Dataset
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2401.04481

Similar Items