Saved in:
| Main Authors: | Huang, Kai, Wang, Haoming, Gao, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.17472 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception
by: Hu, Senkang, et al.
Published: (2025)
by: Hu, Senkang, et al.
Published: (2025)
A Probabilistic Fluctuation based Membership Inference Attack for Diffusion Models
by: Fu, Wenjie, et al.
Published: (2023)
by: Fu, Wenjie, et al.
Published: (2023)
Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
by: Biggs, Benjamin, et al.
Published: (2024)
by: Biggs, Benjamin, et al.
Published: (2024)
Hypnopaedia-Aware Machine Unlearning via Psychometrics of Artificial Mental Imagery
by: Chang, Ching-Chun, et al.
Published: (2024)
by: Chang, Ching-Chun, et al.
Published: (2024)
Real-world Adversarial Defense against Patch Attacks based on Diffusion Model
by: Wei, Xingxing, et al.
Published: (2024)
by: Wei, Xingxing, et al.
Published: (2024)
Training Data Protection with Compositional Diffusion Models
by: Golatkar, Aditya, et al.
Published: (2023)
by: Golatkar, Aditya, et al.
Published: (2023)
REFINE: Inversion-Free Backdoor Defense via Model Reprogramming
by: Chen, Yukun, et al.
Published: (2025)
by: Chen, Yukun, et al.
Published: (2025)
PointNCBW: Towards Dataset Ownership Verification for Point Clouds via Negative Clean-label Backdoor Watermark
by: Wei, Cheng, et al.
Published: (2024)
by: Wei, Cheng, et al.
Published: (2024)
OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning
by: Zhu, Boyu, et al.
Published: (2025)
by: Zhu, Boyu, et al.
Published: (2025)
SWA-LDM: Toward Stealthy Watermarks for Latent Diffusion Models
by: Yang, Zhonghao, et al.
Published: (2025)
by: Yang, Zhonghao, et al.
Published: (2025)
Towards Black-Box Membership Inference Attack for Diffusion Models
by: Li, Jingwei, et al.
Published: (2024)
by: Li, Jingwei, et al.
Published: (2024)
Redesigning Traffic Signs to Mitigate Machine-Learning Patch Attacks
by: Shua, Tsufit, et al.
Published: (2024)
by: Shua, Tsufit, et al.
Published: (2024)
DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks
by: Kang, Caixin, et al.
Published: (2023)
by: Kang, Caixin, et al.
Published: (2023)
What Lurks Within? Concept Auditing for Shared Diffusion Models at Scale
by: Yuan, Xiaoyong, et al.
Published: (2025)
by: Yuan, Xiaoyong, et al.
Published: (2025)
Towards Resilient Safety-driven Unlearning for Diffusion Models against Downstream Fine-tuning
by: Li, Boheng, et al.
Published: (2025)
by: Li, Boheng, et al.
Published: (2025)
Selective Amnesia: On Efficient, High-Fidelity and Blind Suppression of Backdoor Effects in Trojaned Machine Learning Models
by: Zhu, Rui, et al.
Published: (2022)
by: Zhu, Rui, et al.
Published: (2022)
SPQR: A Standardized Benchmark for Modern Safety Alignment Methods in Text-to-Image Diffusion Models
by: Alam, Mohammed Talha, et al.
Published: (2025)
by: Alam, Mohammed Talha, et al.
Published: (2025)
IDEA: An Inverse Domain Expert Adaptation Based Active DNN IP Protection Method
by: Xu, Chaohui, et al.
Published: (2024)
by: Xu, Chaohui, et al.
Published: (2024)
Kill it with FIRE: On Leveraging Latent Space Directions for Runtime Backdoor Mitigation in Deep Neural Networks
by: Ahlers, Enrico, et al.
Published: (2026)
by: Ahlers, Enrico, et al.
Published: (2026)
A Novel Defense Against Poisoning Attacks on Federated Learning: LayerCAM Augmented with Autoencoder
by: Zheng, Jingjing, et al.
Published: (2024)
by: Zheng, Jingjing, et al.
Published: (2024)
Efficient Backdoor Defense in Multimodal Contrastive Learning: A Token-Level Unlearning Method for Mitigating Threats
by: Liu, Kuanrong, et al.
Published: (2024)
by: Liu, Kuanrong, et al.
Published: (2024)
Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling
by: Cao, Yichuan, et al.
Published: (2025)
by: Cao, Yichuan, et al.
Published: (2025)
Efficient Black-box Adversarial Attacks via Bayesian Optimization Guided by a Function Prior
by: Cheng, Shuyu, et al.
Published: (2024)
by: Cheng, Shuyu, et al.
Published: (2024)
Improving Robustness to Model Inversion Attacks via Sparse Coding Architectures
by: Dibbo, Sayanton V., et al.
Published: (2024)
by: Dibbo, Sayanton V., et al.
Published: (2024)
REFORGE: Multi-modal Attacks Reveal Vulnerable Concept Unlearning in Image Generation Models
by: Zou, Yong, et al.
Published: (2026)
by: Zou, Yong, et al.
Published: (2026)
Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger
by: Zhu, Mingyan, et al.
Published: (2023)
by: Zhu, Mingyan, et al.
Published: (2023)
Exploring User-level Gradient Inversion with a Diffusion Prior
by: Li, Zhuohang, et al.
Published: (2024)
by: Li, Zhuohang, et al.
Published: (2024)
PubDef: Defending Against Transfer Attacks From Public Models
by: Sitawarin, Chawin, et al.
Published: (2023)
by: Sitawarin, Chawin, et al.
Published: (2023)
MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks
by: Ha, Hyeonjeong, et al.
Published: (2025)
by: Ha, Hyeonjeong, et al.
Published: (2025)
FC-Attack: Jailbreaking Multimodal Large Language Models via Auto-Generated Flowcharts
by: Zhang, Ziyi, et al.
Published: (2025)
by: Zhang, Ziyi, et al.
Published: (2025)
Transferable Black-Box One-Shot Forging of Watermarks via Image Preference Models
by: Souček, Tomáš, et al.
Published: (2025)
by: Souček, Tomáš, et al.
Published: (2025)
CLIP-Inspector: Model-Level Backdoor Detection for Prompt-Tuned CLIP via OOD Trigger Inversion
by: Jindal, Akshit, et al.
Published: (2026)
by: Jindal, Akshit, et al.
Published: (2026)
DarkLLM: Learning Language-Driven Adversarial Attacks with Large Language Models
by: Sun, Ye, et al.
Published: (2026)
by: Sun, Ye, et al.
Published: (2026)
CGI-DM: Digital Copyright Authentication for Diffusion Models via Contrasting Gradient Inversion
by: Wu, Xiaoyu, et al.
Published: (2024)
by: Wu, Xiaoyu, et al.
Published: (2024)
PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models
by: Yuan, Lingzhi, et al.
Published: (2025)
by: Yuan, Lingzhi, et al.
Published: (2025)
A Survey on Physical Adversarial Attacks against Face Recognition Systems
by: Wang, Mingsi, et al.
Published: (2024)
by: Wang, Mingsi, et al.
Published: (2024)
Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks
by: Yang, Wenhan, et al.
Published: (2023)
by: Yang, Wenhan, et al.
Published: (2023)
PrivFedTalk: Privacy-Aware Federated Diffusion with Identity-Stable Adapters for Personalized Talking-Head Generation
by: Mazumdar, Soumya, et al.
Published: (2026)
by: Mazumdar, Soumya, et al.
Published: (2026)
Leave My Images Alone: Preventing Multi-Modal Large Language Models from Analyzing Images via Visual Prompt Injection
by: Shao, Zedian, et al.
Published: (2026)
by: Shao, Zedian, et al.
Published: (2026)
Rapid Plug-in Defenders
by: Wu, Kai, et al.
Published: (2023)
by: Wu, Kai, et al.
Published: (2023)
Similar Items
-
CP-Guard+: A New Paradigm for Malicious Agent Detection and Defense in Collaborative Perception
by: Hu, Senkang, et al.
Published: (2025) -
A Probabilistic Fluctuation based Membership Inference Attack for Diffusion Models
by: Fu, Wenjie, et al.
Published: (2023) -
Diffusion Soup: Model Merging for Text-to-Image Diffusion Models
by: Biggs, Benjamin, et al.
Published: (2024) -
Hypnopaedia-Aware Machine Unlearning via Psychometrics of Artificial Mental Imagery
by: Chang, Ching-Chun, et al.
Published: (2024) -
Real-world Adversarial Defense against Patch Attacks based on Diffusion Model
by: Wei, Xingxing, et al.
Published: (2024)