Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhao, Xuandong, Liao, Chenwen, Wang, Yu-Xiang, Li, Lei
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.03600
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915338520625152
author	Zhao, Xuandong Liao, Chenwen Wang, Yu-Xiang Li, Lei
author_facet	Zhao, Xuandong Liao, Chenwen Wang, Yu-Xiang Li, Lei
contents	Text watermarks in large language models (LLMs) are increasingly used to detect synthetic text, mitigating misuse cases like fake news and academic dishonesty. While existing watermarking detection techniques primarily focus on classifying entire documents as watermarked or not, they often neglect the common scenario of identifying individual watermark segments within longer, mixed-source documents. Drawing inspiration from plagiarism detection systems, we propose two novel methods for partial watermark detection. First, we develop a geometry cover detection framework aimed at determining whether there is a watermark segment in long text. Second, we introduce an adaptive online learning algorithm to pinpoint the precise location of watermark segments within the text. Evaluated on three popular watermarking techniques (KGW-Watermark, Unigram-Watermark, and Gumbel-Watermark), our approach achieves high accuracy, significantly outperforming baseline methods. Moreover, our framework is adaptable to other watermarking techniques, offering new insights for precise watermark detection. Our code is publicly available at https://github.com/XuandongZhao/llm-watermark-location
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_03600
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Efficiently Identifying Watermarked Segments in Mixed-Source Texts Zhao, Xuandong Liao, Chenwen Wang, Yu-Xiang Li, Lei Computation and Language Text watermarks in large language models (LLMs) are increasingly used to detect synthetic text, mitigating misuse cases like fake news and academic dishonesty. While existing watermarking detection techniques primarily focus on classifying entire documents as watermarked or not, they often neglect the common scenario of identifying individual watermark segments within longer, mixed-source documents. Drawing inspiration from plagiarism detection systems, we propose two novel methods for partial watermark detection. First, we develop a geometry cover detection framework aimed at determining whether there is a watermark segment in long text. Second, we introduce an adaptive online learning algorithm to pinpoint the precise location of watermark segments within the text. Evaluated on three popular watermarking techniques (KGW-Watermark, Unigram-Watermark, and Gumbel-Watermark), our approach achieves high accuracy, significantly outperforming baseline methods. Moreover, our framework is adaptable to other watermarking techniques, offering new insights for precise watermark detection. Our code is publicly available at https://github.com/XuandongZhao/llm-watermark-location
title	Efficiently Identifying Watermarked Segments in Mixed-Source Texts
topic	Computation and Language
url	https://arxiv.org/abs/2410.03600

Similar Items