Saved in:
Bibliographic Details
Main Authors: Yang, He, Lv, Dongyi, Ma, Song, Xi, Wei, Wang, Zhi, Gu, Hanlin, Wang, Yajie
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.28092
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908920528764928
author Yang, He
Lv, Dongyi
Ma, Song
Xi, Wei
Wang, Zhi
Gu, Hanlin
Wang, Yajie
author_facet Yang, He
Lv, Dongyi
Ma, Song
Xi, Wei
Wang, Zhi
Gu, Hanlin
Wang, Yajie
contents Dataset Condensation (DC) is a data-efficient learning paradigm that synthesizes small yet informative datasets, enabling models to match the performance of full-data training. However, recent work exposes a critical vulnerability of DC to backdoor attacks, where malicious patterns (\textit{e.g.}, triggers) are implanted into the condensation dataset, inducing targeted misclassification on specific inputs. Existing attacks always prioritize attack effectiveness and model utility, overlooking the crucial dimension of stealthiness. To bridge this gap, we propose InkDrop, which enhances the imperceptibility of malicious manipulation without degrading attack effectiveness and model utility. InkDrop leverages the inherent uncertainty near model decision boundaries, where minor input perturbations can induce semantic shifts, to construct a stealthy and effective backdoor attack. Specifically, InkDrop first selects candidate samples near the target decision boundary that exhibit latent semantic affinity to the target class. It then learns instance-dependent perturbations constrained by perceptual and spatial consistency, embedding targeted malicious behavior into the condensed dataset. Extensive experiments across diverse datasets validate the overall effectiveness of InkDrop, demonstrating its ability to integrate adversarial intent into condensed datasets while preserving model utility and minimizing detectability. Our code is available at https://github.com/lvdongyi/InkDrop.
format Preprint
id arxiv_https___arxiv_org_abs_2603_28092
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle InkDrop: Invisible Backdoor Attacks Against Dataset Condensation
Yang, He
Lv, Dongyi
Ma, Song
Xi, Wei
Wang, Zhi
Gu, Hanlin
Wang, Yajie
Machine Learning
Dataset Condensation (DC) is a data-efficient learning paradigm that synthesizes small yet informative datasets, enabling models to match the performance of full-data training. However, recent work exposes a critical vulnerability of DC to backdoor attacks, where malicious patterns (\textit{e.g.}, triggers) are implanted into the condensation dataset, inducing targeted misclassification on specific inputs. Existing attacks always prioritize attack effectiveness and model utility, overlooking the crucial dimension of stealthiness. To bridge this gap, we propose InkDrop, which enhances the imperceptibility of malicious manipulation without degrading attack effectiveness and model utility. InkDrop leverages the inherent uncertainty near model decision boundaries, where minor input perturbations can induce semantic shifts, to construct a stealthy and effective backdoor attack. Specifically, InkDrop first selects candidate samples near the target decision boundary that exhibit latent semantic affinity to the target class. It then learns instance-dependent perturbations constrained by perceptual and spatial consistency, embedding targeted malicious behavior into the condensed dataset. Extensive experiments across diverse datasets validate the overall effectiveness of InkDrop, demonstrating its ability to integrate adversarial intent into condensed datasets while preserving model utility and minimizing detectability. Our code is available at https://github.com/lvdongyi/InkDrop.
title InkDrop: Invisible Backdoor Attacks Against Dataset Condensation
topic Machine Learning
url https://arxiv.org/abs/2603.28092