Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gromadzki, Michał, Wróblewska, Anna, Kaliska, Agnieszka
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Artificial Intelligence I.2.7
Online Access:	https://arxiv.org/abs/2601.20006
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910003012567040
author	Gromadzki, Michał Wróblewska, Anna Kaliska, Agnieszka
author_facet	Gromadzki, Michał Wróblewska, Anna Kaliska, Agnieszka
contents	The rapid progress of large language models has enabled the generation of text that closely resembles human writing, creating challenges for authenticity verification in education, publishing, and digital security. Detecting AI-generated text has therefore become a crucial technical and ethical issue. This paper presents a comprehensive study of AI-generated text detection based on large-scale corpora and novel training strategies. We introduce a 1-billion-token corpus of human-authored texts spanning multiple genres and a 1.9-billion-token corpus of AI-generated texts produced by prompting a variety of LLMs across diverse domains. Using these resources, we develop and evaluate numerous detection models and propose two novel training paradigms: Per LLM and Per LLM family fine-tuning. Across a 100-million-token benchmark covering 21 large language models, our best fine-tuned detector achieves up to $99.6\%$ token-level accuracy, substantially outperforming existing open-source baselines.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_20006
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text Gromadzki, Michał Wróblewska, Anna Kaliska, Agnieszka Computation and Language Artificial Intelligence I.2.7 The rapid progress of large language models has enabled the generation of text that closely resembles human writing, creating challenges for authenticity verification in education, publishing, and digital security. Detecting AI-generated text has therefore become a crucial technical and ethical issue. This paper presents a comprehensive study of AI-generated text detection based on large-scale corpora and novel training strategies. We introduce a 1-billion-token corpus of human-authored texts spanning multiple genres and a 1.9-billion-token corpus of AI-generated texts produced by prompting a variety of LLMs across diverse domains. Using these resources, we develop and evaluate numerous detection models and propose two novel training paradigms: Per LLM and Per LLM family fine-tuning. Across a 100-million-token benchmark covering 21 large language models, our best fine-tuned detector achieves up to $99.6\%$ token-level accuracy, substantially outperforming existing open-source baselines.
title	On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
topic	Computation and Language Artificial Intelligence I.2.7
url	https://arxiv.org/abs/2601.20006

Similar Items