Saved in:
Bibliographic Details
Main Authors: Gromadzki, Michał, Wróblewska, Anna, Kaliska, Agnieszka
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.20006
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910003012567040
author Gromadzki, Michał
Wróblewska, Anna
Kaliska, Agnieszka
author_facet Gromadzki, Michał
Wróblewska, Anna
Kaliska, Agnieszka
contents The rapid progress of large language models has enabled the generation of text that closely resembles human writing, creating challenges for authenticity verification in education, publishing, and digital security. Detecting AI-generated text has therefore become a crucial technical and ethical issue. This paper presents a comprehensive study of AI-generated text detection based on large-scale corpora and novel training strategies. We introduce a 1-billion-token corpus of human-authored texts spanning multiple genres and a 1.9-billion-token corpus of AI-generated texts produced by prompting a variety of LLMs across diverse domains. Using these resources, we develop and evaluate numerous detection models and propose two novel training paradigms: Per LLM and Per LLM family fine-tuning. Across a 100-million-token benchmark covering 21 large language models, our best fine-tuned detector achieves up to $99.6\%$ token-level accuracy, substantially outperforming existing open-source baselines.
format Preprint
id arxiv_https___arxiv_org_abs_2601_20006
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
Gromadzki, Michał
Wróblewska, Anna
Kaliska, Agnieszka
Computation and Language
Artificial Intelligence
I.2.7
The rapid progress of large language models has enabled the generation of text that closely resembles human writing, creating challenges for authenticity verification in education, publishing, and digital security. Detecting AI-generated text has therefore become a crucial technical and ethical issue. This paper presents a comprehensive study of AI-generated text detection based on large-scale corpora and novel training strategies. We introduce a 1-billion-token corpus of human-authored texts spanning multiple genres and a 1.9-billion-token corpus of AI-generated texts produced by prompting a variety of LLMs across diverse domains. Using these resources, we develop and evaluate numerous detection models and propose two novel training paradigms: Per LLM and Per LLM family fine-tuning. Across a 100-million-token benchmark covering 21 large language models, our best fine-tuned detector achieves up to $99.6\%$ token-level accuracy, substantially outperforming existing open-source baselines.
title On the Effectiveness of LLM-Specific Fine-Tuning for Detecting AI-Generated Text
topic Computation and Language
Artificial Intelligence
I.2.7
url https://arxiv.org/abs/2601.20006