Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Su, Weijie, Wang, Ruodu, Zhao, Zinan
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2602.14286
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866911582098817024
author Su, Weijie
Wang, Ruodu
Zhao, Zinan
author_facet Su, Weijie
Wang, Ruodu
Zhao, Zinan
contents Watermarking for large language models (LLMs) has emerged as an effective tool for distinguishing AI-generated text from human-written content. Statistically, watermark schemes induce dependence between generated tokens and a pseudo-random sequence, reducing watermark detection to a hypothesis testing problem on independence. We develop a unified framework for LLM watermark detection based on e-processes, providing anytime-valid guarantees for online testing. We propose various methods to construct empirically adaptive e-processes that can enhance the detection power. The proposed methods are applicable to any sequential testing problem where independent pivotal statistics are available. In addition, theoretical results are established to characterize the power properties of the proposed procedures. Some experiments demonstrate that the proposed framework achieves competitive performance compared to existing watermark detection methods.
format Preprint
id arxiv_https___arxiv_org_abs_2602_14286
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Online LLM watermark detection via e-processes
Su, Weijie
Wang, Ruodu
Zhao, Zinan
Methodology
Machine Learning
Watermarking for large language models (LLMs) has emerged as an effective tool for distinguishing AI-generated text from human-written content. Statistically, watermark schemes induce dependence between generated tokens and a pseudo-random sequence, reducing watermark detection to a hypothesis testing problem on independence. We develop a unified framework for LLM watermark detection based on e-processes, providing anytime-valid guarantees for online testing. We propose various methods to construct empirically adaptive e-processes that can enhance the detection power. The proposed methods are applicable to any sequential testing problem where independent pivotal statistics are available. In addition, theoretical results are established to characterize the power properties of the proposed procedures. Some experiments demonstrate that the proposed framework achieves competitive performance compared to existing watermark detection methods.
title Online LLM watermark detection via e-processes
topic Methodology
Machine Learning
url https://arxiv.org/abs/2602.14286