Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Zhang, Hanxiu, Zheng, Yue
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Cryptography and Security Artificial Intelligence Computation and Language Machine Learning
Online-Zugang:	https://arxiv.org/abs/2512.03620
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866915651178725376
author	Zhang, Hanxiu Zheng, Yue
author_facet	Zhang, Hanxiu Zheng, Yue
contents	The protection of Intellectual Property (IP) in Large Language Models (LLMs) represents a critical challenge in contemporary AI research. While fingerprinting techniques have emerged as a fundamental mechanism for detecting unauthorized model usage, existing methods -- whether behavior-based or structural -- suffer from vulnerabilities such as false claim attacks or susceptible to weight manipulations. To overcome these limitations, we propose SELF, a novel intrinsic weight-based fingerprinting scheme that eliminates dependency on input and inherently resists false claims. SELF achieves robust IP protection through two key innovations: 1) unique, scalable and transformation-invariant fingerprint extraction via singular value and eigenvalue decomposition of LLM attention weights, and 2) effective neural network-based fingerprint similarity comparison based on few-shot learning and data augmentation. Experimental results demonstrate SELF maintains high IP infringement detection accuracy while showing strong robustness against various downstream modifications, including quantization, pruning, and fine-tuning attacks. Our code is available at https://github.com/HanxiuZhang/SELF_v2.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_03620
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting Zhang, Hanxiu Zheng, Yue Cryptography and Security Artificial Intelligence Computation and Language Machine Learning The protection of Intellectual Property (IP) in Large Language Models (LLMs) represents a critical challenge in contemporary AI research. While fingerprinting techniques have emerged as a fundamental mechanism for detecting unauthorized model usage, existing methods -- whether behavior-based or structural -- suffer from vulnerabilities such as false claim attacks or susceptible to weight manipulations. To overcome these limitations, we propose SELF, a novel intrinsic weight-based fingerprinting scheme that eliminates dependency on input and inherently resists false claims. SELF achieves robust IP protection through two key innovations: 1) unique, scalable and transformation-invariant fingerprint extraction via singular value and eigenvalue decomposition of LLM attention weights, and 2) effective neural network-based fingerprint similarity comparison based on few-shot learning and data augmentation. Experimental results demonstrate SELF maintains high IP infringement detection accuracy while showing strong robustness against various downstream modifications, including quantization, pruning, and fine-tuning attacks. Our code is available at https://github.com/HanxiuZhang/SELF_v2.
title	SELF: A Robust Singular Value and Eigenvalue Approach for LLM Fingerprinting
topic	Cryptography and Security Artificial Intelligence Computation and Language Machine Learning
url	https://arxiv.org/abs/2512.03620

Ähnliche Einträge