Saved in:
Bibliographic Details
Main Authors: Sun, Ke, Bao, Guangsheng, Cui, Han, Zhang, Yue
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.01240
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912866884386816
author Sun, Ke
Bao, Guangsheng
Cui, Han
Zhang, Yue
author_facet Sun, Ke
Bao, Guangsheng
Cui, Han
Zhang, Yue
contents Zero-shot methods detect LLM-generated text by computing statistical signatures using a surrogate model. Existing approaches typically employ a fixed surrogate for all inputs regardless of the unknown source. We systematically examine this design and find that detection performance varies substantially depending on surrogate-source alignment. We observe that while no single surrogate achieves optimal performance universally, a well-matched surrogate typically exists within a diverse pool for any given input. This finding transforms robust detection into a routing problem: selecting the most appropriate surrogate for each input. We propose DetectRouter, a prototype-based framework that learns text-detector affinity through two-stage training. The first stage constructs discriminative prototypes from white-box models; the second generalizes to black-box sources by aligning geometric distances with observed detection scores. Experiments on EvoBench and MAGE benchmarks demonstrate consistent improvements across multiple detection criteria and model families.
format Preprint
id arxiv_https___arxiv_org_abs_2602_01240
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
Sun, Ke
Bao, Guangsheng
Cui, Han
Zhang, Yue
Computation and Language
Zero-shot methods detect LLM-generated text by computing statistical signatures using a surrogate model. Existing approaches typically employ a fixed surrogate for all inputs regardless of the unknown source. We systematically examine this design and find that detection performance varies substantially depending on surrogate-source alignment. We observe that while no single surrogate achieves optimal performance universally, a well-matched surrogate typically exists within a diverse pool for any given input. This finding transforms robust detection into a routing problem: selecting the most appropriate surrogate for each input. We propose DetectRouter, a prototype-based framework that learns text-detector affinity through two-stage training. The first stage constructs discriminative prototypes from white-box models; the second generalizes to black-box sources by aligning geometric distances with observed detection scores. Experiments on EvoBench and MAGE benchmarks demonstrate consistent improvements across multiple detection criteria and model families.
title Minimizing Mismatch Risk: A Prototype-Based Routing Framework for Zero-shot LLM-generated Text Detection
topic Computation and Language
url https://arxiv.org/abs/2602.01240