Saved in:
| Main Authors: | Huster, Todd, Lin, Peter, Stefanescu, Razvan, Ekwedike, Emmanuel, Chadha, Ritu |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2411.03445 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing
by: Jin, Haibo, et al.
Published: (2021)
by: Jin, Haibo, et al.
Published: (2021)
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
by: Chen, Junxi, et al.
Published: (2025)
by: Chen, Junxi, et al.
Published: (2025)
An Experimental Study of Trojan Vulnerabilities in UAV Autonomous Landing
by: Ahmari, Reza, et al.
Published: (2025)
by: Ahmari, Reza, et al.
Published: (2025)
DeepSight: An All-in-One LM Safety Toolkit
by: Zhang, Bo, et al.
Published: (2026)
by: Zhang, Bo, et al.
Published: (2026)
Event Trojan: Asynchronous Event-based Backdoor Attacks
by: Wang, Ruofei, et al.
Published: (2024)
by: Wang, Ruofei, et al.
Published: (2024)
A Survey of Trojan Attacks and Defenses to Deep Neural Networks
by: Jin, Lingxin, et al.
Published: (2024)
by: Jin, Lingxin, et al.
Published: (2024)
Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety
by: Ma, Xingjun, et al.
Published: (2025)
by: Ma, Xingjun, et al.
Published: (2025)
Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
by: Zhang, Yunchao, et al.
Published: (2022)
by: Zhang, Yunchao, et al.
Published: (2022)
Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
by: Wang, Guangjing, et al.
Published: (2023)
by: Wang, Guangjing, et al.
Published: (2023)
PII-VisBench: Evaluating Personally Identifiable Information Safety in Vision Language Models Along a Continuum of Visibility
by: Shahariar, G M, et al.
Published: (2026)
by: Shahariar, G M, et al.
Published: (2026)
SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image Models
by: Li, Xinfeng, et al.
Published: (2024)
by: Li, Xinfeng, et al.
Published: (2024)
Rethinking Machine Unlearning in Image Generation Models
by: Liu, Renyang, et al.
Published: (2025)
by: Liu, Renyang, et al.
Published: (2025)
VLMs Can Aggregate Scattered Training Patches
by: Zhou, Zhanhui, et al.
Published: (2025)
by: Zhou, Zhanhui, et al.
Published: (2025)
Jailbreaking Safeguarded Text-to-Image Models via Large Language Models
by: Jiang, Zhengyuan, et al.
Published: (2025)
by: Jiang, Zhengyuan, et al.
Published: (2025)
VLSBench: Unveiling Visual Leakage in Multimodal Safety
by: Hu, Xuhao, et al.
Published: (2024)
by: Hu, Xuhao, et al.
Published: (2024)
SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
by: Zheng, Mengxin, et al.
Published: (2023)
by: Zheng, Mengxin, et al.
Published: (2023)
Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector
by: Huang, Youcheng, et al.
Published: (2024)
by: Huang, Youcheng, et al.
Published: (2024)
Recovering the Pre-Fine-Tuning Weights of Generative Models
by: Horwitz, Eliahu, et al.
Published: (2024)
by: Horwitz, Eliahu, et al.
Published: (2024)
Effective Fine-Tuning of Vision Transformers with Low-Rank Adaptation for Privacy-Preserving Image Classification
by: Lin, Haiwei, et al.
Published: (2025)
by: Lin, Haiwei, et al.
Published: (2025)
Selective Amnesia: On Efficient, High-Fidelity and Blind Suppression of Backdoor Effects in Trojaned Machine Learning Models
by: Zhu, Rui, et al.
Published: (2022)
by: Zhu, Rui, et al.
Published: (2022)
Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
by: Wang, Wenxuan, et al.
Published: (2024)
by: Wang, Wenxuan, et al.
Published: (2024)
Is Artificial Intelligence Generated Image Detection a Solved Problem?
by: Li, Ziqiang, et al.
Published: (2025)
by: Li, Ziqiang, et al.
Published: (2025)
Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey
by: Kheddar, Hamza
Published: (2024)
by: Kheddar, Hamza
Published: (2024)
CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
by: Chen, Yuxi, et al.
Published: (2026)
by: Chen, Yuxi, et al.
Published: (2026)
AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security
by: Liu, Dongrui, et al.
Published: (2026)
by: Liu, Dongrui, et al.
Published: (2026)
Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection
by: Yang, Wenkui, et al.
Published: (2026)
by: Yang, Wenkui, et al.
Published: (2026)
Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
by: Naseh, Ali, et al.
Published: (2024)
by: Naseh, Ali, et al.
Published: (2024)
IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
by: Li, Junxian, et al.
Published: (2025)
by: Li, Junxian, et al.
Published: (2025)
Image-Based Geolocation Using Large Vision-Language Models
by: Liu, Yi, et al.
Published: (2024)
by: Liu, Yi, et al.
Published: (2024)
MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
by: Pi, Renjie, et al.
Published: (2024)
by: Pi, Renjie, et al.
Published: (2024)
Privacy-Preserving Federated Learning with Verifiable Fairness Guarantees
by: Ali, Mohammed Himayath, et al.
Published: (2026)
by: Ali, Mohammed Himayath, et al.
Published: (2026)
Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
by: Miao, Ziqi, et al.
Published: (2025)
by: Miao, Ziqi, et al.
Published: (2025)
SlowBA: An efficiency backdoor attack towards VLM-based GUI agents
by: Li, Junxian, et al.
Published: (2026)
by: Li, Junxian, et al.
Published: (2026)
Generating Synthetic Data with Formal Privacy Guarantees: State of the Art and the Road Ahead
by: Schlegel, Viktor, et al.
Published: (2025)
by: Schlegel, Viktor, et al.
Published: (2025)
Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models
by: Ding, Yi, et al.
Published: (2025)
by: Ding, Yi, et al.
Published: (2025)
Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
by: Xiong, Yuan, et al.
Published: (2025)
by: Xiong, Yuan, et al.
Published: (2025)
Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study
by: Lee, DongGeon, et al.
Published: (2025)
by: Lee, DongGeon, et al.
Published: (2025)
Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
by: Qu, Jingen, et al.
Published: (2025)
by: Qu, Jingen, et al.
Published: (2025)
Doubly-Universal Adversarial Perturbations: Deceiving Vision-Language Models Across Both Images and Text with a Single Perturbation
by: Kim, Hee-Seon, et al.
Published: (2024)
by: Kim, Hee-Seon, et al.
Published: (2024)
BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators
by: Tian, Yu, et al.
Published: (2024)
by: Tian, Yu, et al.
Published: (2024)
Similar Items
-
CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing
by: Jin, Haibo, et al.
Published: (2021) -
Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
by: Chen, Junxi, et al.
Published: (2025) -
An Experimental Study of Trojan Vulnerabilities in UAV Autonomous Landing
by: Ahmari, Reza, et al.
Published: (2025) -
DeepSight: An All-in-One LM Safety Toolkit
by: Zhang, Bo, et al.
Published: (2026) -
Event Trojan: Asynchronous Event-based Backdoor Attacks
by: Wang, Ruofei, et al.
Published: (2024)