Saved in:
Bibliographic Details
Main Authors: Zhao, Xuandong, Liao, Chenwen, Wang, Yu-Xiang, Li, Lei
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.03600
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915338520625152
author Zhao, Xuandong
Liao, Chenwen
Wang, Yu-Xiang
Li, Lei
author_facet Zhao, Xuandong
Liao, Chenwen
Wang, Yu-Xiang
Li, Lei
contents Text watermarks in large language models (LLMs) are increasingly used to detect synthetic text, mitigating misuse cases like fake news and academic dishonesty. While existing watermarking detection techniques primarily focus on classifying entire documents as watermarked or not, they often neglect the common scenario of identifying individual watermark segments within longer, mixed-source documents. Drawing inspiration from plagiarism detection systems, we propose two novel methods for partial watermark detection. First, we develop a geometry cover detection framework aimed at determining whether there is a watermark segment in long text. Second, we introduce an adaptive online learning algorithm to pinpoint the precise location of watermark segments within the text. Evaluated on three popular watermarking techniques (KGW-Watermark, Unigram-Watermark, and Gumbel-Watermark), our approach achieves high accuracy, significantly outperforming baseline methods. Moreover, our framework is adaptable to other watermarking techniques, offering new insights for precise watermark detection. Our code is publicly available at https://github.com/XuandongZhao/llm-watermark-location
format Preprint
id arxiv_https___arxiv_org_abs_2410_03600
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Efficiently Identifying Watermarked Segments in Mixed-Source Texts
Zhao, Xuandong
Liao, Chenwen
Wang, Yu-Xiang
Li, Lei
Computation and Language
Text watermarks in large language models (LLMs) are increasingly used to detect synthetic text, mitigating misuse cases like fake news and academic dishonesty. While existing watermarking detection techniques primarily focus on classifying entire documents as watermarked or not, they often neglect the common scenario of identifying individual watermark segments within longer, mixed-source documents. Drawing inspiration from plagiarism detection systems, we propose two novel methods for partial watermark detection. First, we develop a geometry cover detection framework aimed at determining whether there is a watermark segment in long text. Second, we introduce an adaptive online learning algorithm to pinpoint the precise location of watermark segments within the text. Evaluated on three popular watermarking techniques (KGW-Watermark, Unigram-Watermark, and Gumbel-Watermark), our approach achieves high accuracy, significantly outperforming baseline methods. Moreover, our framework is adaptable to other watermarking techniques, offering new insights for precise watermark detection. Our code is publicly available at https://github.com/XuandongZhao/llm-watermark-location
title Efficiently Identifying Watermarked Segments in Mixed-Source Texts
topic Computation and Language
url https://arxiv.org/abs/2410.03600