MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Yi, Jiatong, Li, Yanyang
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2601.11314
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866908770657894400
author	Yi, Jiatong Li, Yanyang
author_facet	Yi, Jiatong Li, Yanyang
contents	Membership Inference Attacks (MIAs) act as a crucial auditing tool for the opaque training data of Large Language Models (LLMs). However, existing techniques predominantly rely on inaccessible model internals (e.g., logits) or suffer from poor generalization across domains in strict black-box settings where only generated text is available. In this work, we propose SimMIA, a robust MIA framework tailored for this text-only regime by leveraging an advanced sampling strategy and scoring mechanism. Furthermore, we present WikiMIA-25, a new benchmark curated to evaluate MIA performance on modern proprietary LLMs. Experiments demonstrate that SimMIA achieves state-of-the-art results in the black-box setting, rivaling baselines that exploit internal model information.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_11314
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Membership Inference on LLMs in the Wild Yi, Jiatong Li, Yanyang Computation and Language Membership Inference Attacks (MIAs) act as a crucial auditing tool for the opaque training data of Large Language Models (LLMs). However, existing techniques predominantly rely on inaccessible model internals (e.g., logits) or suffer from poor generalization across domains in strict black-box settings where only generated text is available. In this work, we propose SimMIA, a robust MIA framework tailored for this text-only regime by leveraging an advanced sampling strategy and scoring mechanism. Furthermore, we present WikiMIA-25, a new benchmark curated to evaluate MIA performance on modern proprietary LLMs. Experiments demonstrate that SimMIA achieves state-of-the-art results in the black-box setting, rivaling baselines that exploit internal model information.
title	Membership Inference on LLMs in the Wild
topic	Computation and Language
url	https://arxiv.org/abs/2601.11314

Documenti analoghi