Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Fraser, Kathleen C., Dawkins, Hillary, Kiritchenko, Svetlana
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Computers and Society
Online Access:	https://arxiv.org/abs/2406.15583
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912326407421952
author	Fraser, Kathleen C. Dawkins, Hillary Kiritchenko, Svetlana
author_facet	Fraser, Kathleen C. Dawkins, Hillary Kiritchenko, Svetlana
contents	Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as combating the spread of misinformation and political propaganda. The task of AI-generated text (AIGT) detection is therefore both very challenging, and highly critical. In this survey, we summarize state-of-the art approaches to AIGT detection, including watermarking, statistical and stylistic analysis, and machine learning classification. We also provide information about existing datasets for this task. Synthesizing the research findings, we aim to provide insight into the salient factors that combine to determine how "detectable" AIGT text is under different scenarios, and to make practical recommendations for future work towards this significant technical and societal challenge.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_15583
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods Fraser, Kathleen C. Dawkins, Hillary Kiritchenko, Svetlana Computation and Language Computers and Society Large language models (LLMs) have advanced to a point that even humans have difficulty discerning whether a text was generated by another human, or by a computer. However, knowing whether a text was produced by human or artificial intelligence (AI) is important to determining its trustworthiness, and has applications in many domains including detecting fraud and academic dishonesty, as well as combating the spread of misinformation and political propaganda. The task of AI-generated text (AIGT) detection is therefore both very challenging, and highly critical. In this survey, we summarize state-of-the art approaches to AIGT detection, including watermarking, statistical and stylistic analysis, and machine learning classification. We also provide information about existing datasets for this task. Synthesizing the research findings, we aim to provide insight into the salient factors that combine to determine how "detectable" AIGT text is under different scenarios, and to make practical recommendations for future work towards this significant technical and societal challenge.
title	Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
topic	Computation and Language Computers and Society
url	https://arxiv.org/abs/2406.15583

Similar Items