Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Shen, Bolin, Cheng, Zhan, Gong, Neil Zhenqiang, Yao, Fan, Dong, Yushun
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.20419
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912922893025280
author	Shen, Bolin Cheng, Zhan Gong, Neil Zhenqiang Yao, Fan Dong, Yushun
author_facet	Shen, Bolin Cheng, Zhan Gong, Neil Zhenqiang Yao, Fan Dong, Yushun
contents	Machine Learning as a Service (MLaaS) has emerged as a widely adopted paradigm for providing access to deep neural network (DNN) models, enabling users to conveniently leverage these models through standardized APIs. However, such services are highly vulnerable to Model Extraction Attacks (MEAs), where an adversary repeatedly queries a target model to collect input-output pairs and uses them to train a surrogate model that closely replicates its functionality. While numerous defense strategies have been proposed, verifying the ownership of a suspicious model with strict theoretical guarantees remains a challenging task. To address this gap, we introduce CREDIT, a certified ownership verification against MEAs. Specifically, we employ mutual information to quantify the similarity between DNN models, propose a practical verification threshold, and provide rigorous theoretical guarantees for ownership verification based on this threshold. We extensively evaluate our approach on several mainstream datasets across different domains and tasks, achieving state-of-the-art performance. Our implementation is publicly available at: https://github.com/LabRAI/CREDIT.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_20419
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks Shen, Bolin Cheng, Zhan Gong, Neil Zhenqiang Yao, Fan Dong, Yushun Machine Learning Machine Learning as a Service (MLaaS) has emerged as a widely adopted paradigm for providing access to deep neural network (DNN) models, enabling users to conveniently leverage these models through standardized APIs. However, such services are highly vulnerable to Model Extraction Attacks (MEAs), where an adversary repeatedly queries a target model to collect input-output pairs and uses them to train a surrogate model that closely replicates its functionality. While numerous defense strategies have been proposed, verifying the ownership of a suspicious model with strict theoretical guarantees remains a challenging task. To address this gap, we introduce CREDIT, a certified ownership verification against MEAs. Specifically, we employ mutual information to quantify the similarity between DNN models, propose a practical verification threshold, and provide rigorous theoretical guarantees for ownership verification based on this threshold. We extensively evaluate our approach on several mainstream datasets across different domains and tasks, achieving state-of-the-art performance. Our implementation is publicly available at: https://github.com/LabRAI/CREDIT.
title	CREDIT: Certified Ownership Verification of Deep Neural Networks Against Model Extraction Attacks
topic	Machine Learning
url	https://arxiv.org/abs/2602.20419

Similar Items