:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huster, Todd, Lin, Peter, Stefanescu, Razvan, Ekwedike, Emmanuel, Chadha, Ritu
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language Cryptography and Security Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2411.03445
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CatchBackdoor: Backdoor Detection via Critical Trojan Neural Path Fuzzing
by: Jin, Haibo, et al.
Published: (2021)

Mind the Trojan Horse: Image Prompt Adapter Enabling Scalable and Deceptive Jailbreaking
by: Chen, Junxi, et al.
Published: (2025)

An Experimental Study of Trojan Vulnerabilities in UAV Autonomous Landing
by: Ahmari, Reza, et al.
Published: (2025)

DeepSight: An All-in-One LM Safety Toolkit
by: Zhang, Bo, et al.
Published: (2026)

Event Trojan: Asynchronous Event-based Backdoor Attacks
by: Wang, Ruofei, et al.
Published: (2024)

A Survey of Trojan Attacks and Defenses to Deep Neural Networks
by: Jin, Lingxin, et al.
Published: (2024)

Safety at Scale: A Comprehensive Survey of Large Model and Agent Safety
by: Ma, Xingjun, et al.
Published: (2025)

Navigation as Attackers Wish? Towards Building Robust Embodied Agents under Federated Learning
by: Zhang, Yunchao, et al.
Published: (2022)

Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
by: Wang, Guangjing, et al.
Published: (2023)

PII-VisBench: Evaluating Personally Identifiable Information Safety in Vision Language Models Along a Continuum of Visibility
by: Shahariar, G M, et al.
Published: (2026)

SafeGen: Mitigating Sexually Explicit Content Generation in Text-to-Image Models
by: Li, Xinfeng, et al.
Published: (2024)

Rethinking Machine Unlearning in Image Generation Models
by: Liu, Renyang, et al.
Published: (2025)

VLMs Can Aggregate Scattered Training Patches
by: Zhou, Zhanhui, et al.
Published: (2025)

Jailbreaking Safeguarded Text-to-Image Models via Large Language Models
by: Jiang, Zhengyuan, et al.
Published: (2025)

VLSBench: Unveiling Visual Leakage in Multimodal Safety
by: Hu, Xuhao, et al.
Published: (2024)

SSL-Cleanse: Trojan Detection and Mitigation in Self-Supervised Learning
by: Zheng, Mengxin, et al.
Published: (2023)

Effective and Efficient Adversarial Detection for Vision-Language Models via A Single Vector
by: Huang, Youcheng, et al.
Published: (2024)

Recovering the Pre-Fine-Tuning Weights of Generative Models
by: Horwitz, Eliahu, et al.
Published: (2024)

Effective Fine-Tuning of Vision Transformers with Low-Rank Adaptation for Privacy-Preserving Image Classification
by: Lin, Haiwei, et al.
Published: (2025)

Selective Amnesia: On Efficient, High-Fidelity and Blind Suppression of Backdoor Effects in Trojaned Machine Learning Models
by: Zhu, Rui, et al.
Published: (2022)

Chain-of-Jailbreak Attack for Image Generation Models via Editing Step by Step
by: Wang, Wenxuan, et al.
Published: (2024)

Is Artificial Intelligence Generated Image Detection a Solved Problem?
by: Li, Ziqiang, et al.
Published: (2025)

Transformers and Large Language Models for Efficient Intrusion Detection Systems: A Comprehensive Survey
by: Kheddar, Hamza
Published: (2024)

CAPTCHA Solving for Native GUI Agents: Automated Reasoning-Action Data Generation and Self-Corrective Training
by: Chen, Yuxi, et al.
Published: (2026)

AgentDoG 1.5: A Lightweight and Scalable Alignment Framework for AI Agent Safety and Security
by: Liu, Dongrui, et al.
Published: (2026)

Are GUI Agents Focused Enough? Automated Distraction via Semantic-level UI Element Injection
by: Yang, Wenkui, et al.
Published: (2026)

Iteratively Prompting Multimodal LLMs to Reproduce Natural and AI-Generated Images
by: Naseh, Ali, et al.
Published: (2024)

IAG: Input-aware Backdoor Attack on VLM-based Visual Grounding
by: Li, Junxian, et al.
Published: (2025)

Image-Based Geolocation Using Large Vision-Language Models
by: Liu, Yi, et al.
Published: (2024)

MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance
by: Pi, Renjie, et al.
Published: (2024)

Privacy-Preserving Federated Learning with Verifiable Fairness Guarantees
by: Ali, Mohammed Himayath, et al.
Published: (2026)

Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection
by: Miao, Ziqi, et al.
Published: (2025)

SlowBA: An efficiency backdoor attack towards VLM-based GUI agents
by: Li, Junxian, et al.
Published: (2026)

Generating Synthetic Data with Formal Privacy Guarantees: State of the Art and the Road Ahead
by: Schlegel, Viktor, et al.
Published: (2025)

Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models
by: Ding, Yi, et al.
Published: (2025)

Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities
by: Xiong, Yuan, et al.
Published: (2025)

Are Vision-Language Models Safe in the Wild? A Meme-Based Benchmark Study
by: Lee, DongGeon, et al.
Published: (2025)

Self-adaptive Dataset Construction for Real-World Multimodal Safety Scenarios
by: Qu, Jingen, et al.
Published: (2025)

Doubly-Universal Adversarial Perturbations: Deceiving Vision-Language Models Across Both Images and Text with a Single Perturbation
by: Kim, Hee-Seon, et al.
Published: (2024)

BSPA: Exploring Black-box Stealthy Prompt Attacks against Image Generators
by: Tian, Yu, et al.
Published: (2024)