Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Qiu, Han, Zeng, Yi, Guo, Shangwei, Zhang, Tianwei, Qiu, Meikang, Thuraisingham, Bhavani
Format: Preprint
Veröffentlicht: 2020
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2012.07006
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866911750663700480
author Qiu, Han
Zeng, Yi
Guo, Shangwei
Zhang, Tianwei
Qiu, Meikang
Thuraisingham, Bhavani
author_facet Qiu, Han
Zeng, Yi
Guo, Shangwei
Zhang, Tianwei
Qiu, Meikang
Thuraisingham, Bhavani
contents Public resources and services (e.g., datasets, training platforms, pre-trained models) have been widely adopted to ease the development of Deep Learning-based applications. However, if the third-party providers are untrusted, they can inject poisoned samples into the datasets or embed backdoors in those models. Such an integrity breach can cause severe consequences, especially in safety- and security-critical applications. Various backdoor attack techniques have been proposed for higher effectiveness and stealthiness. Unfortunately, existing defense solutions are not practical to thwart those attacks in a comprehensive way. In this paper, we investigate the effectiveness of data augmentation techniques in mitigating backdoor attacks and enhancing DL models' robustness. An evaluation framework is introduced to achieve this goal. Specifically, we consider a unified defense solution, which (1) adopts a data augmentation policy to fine-tune the infected model and eliminate the effects of the embedded backdoor; (2) uses another augmentation policy to preprocess input samples and invalidate the triggers during inference. We propose a systematic approach to discover the optimal policies for defending against different backdoor attacks by comprehensively evaluating 71 state-of-the-art data augmentation functions. Extensive experiments show that our identified policy can effectively mitigate eight different kinds of backdoor attacks and outperform five existing defense methods. We envision this framework can be a good benchmark tool to advance future DNN backdoor studies.
format Preprint
id arxiv_https___arxiv_org_abs_2012_07006
institution arXiv
publishDate 2020
record_format arxiv
spellingShingle DeepSweep: An Evaluation Framework for Mitigating DNN Backdoor Attacks using Data Augmentation
Qiu, Han
Zeng, Yi
Guo, Shangwei
Zhang, Tianwei
Qiu, Meikang
Thuraisingham, Bhavani
Cryptography and Security
Machine Learning
Public resources and services (e.g., datasets, training platforms, pre-trained models) have been widely adopted to ease the development of Deep Learning-based applications. However, if the third-party providers are untrusted, they can inject poisoned samples into the datasets or embed backdoors in those models. Such an integrity breach can cause severe consequences, especially in safety- and security-critical applications. Various backdoor attack techniques have been proposed for higher effectiveness and stealthiness. Unfortunately, existing defense solutions are not practical to thwart those attacks in a comprehensive way. In this paper, we investigate the effectiveness of data augmentation techniques in mitigating backdoor attacks and enhancing DL models' robustness. An evaluation framework is introduced to achieve this goal. Specifically, we consider a unified defense solution, which (1) adopts a data augmentation policy to fine-tune the infected model and eliminate the effects of the embedded backdoor; (2) uses another augmentation policy to preprocess input samples and invalidate the triggers during inference. We propose a systematic approach to discover the optimal policies for defending against different backdoor attacks by comprehensively evaluating 71 state-of-the-art data augmentation functions. Extensive experiments show that our identified policy can effectively mitigate eight different kinds of backdoor attacks and outperform five existing defense methods. We envision this framework can be a good benchmark tool to advance future DNN backdoor studies.
title DeepSweep: An Evaluation Framework for Mitigating DNN Backdoor Attacks using Data Augmentation
topic Cryptography and Security
Machine Learning
url https://arxiv.org/abs/2012.07006