:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Walter, Mathew J., Barrett, Aaron, Tam, Kimberly
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.21609
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Behavior-Aware and Generalizable Defense Against Black-Box Adversarial Attacks for ML-Based IDS
by: Ennaji, Sabrine, et al.
Published: (2025)

DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
by: Kang, Mintong, et al.
Published: (2023)

Mitigation of Camouflaged Adversarial Attacks in Autonomous Vehicles--A Case Study Using CARLA Simulator
by: Martinez, Yago Romano, et al.
Published: (2025)

Integrated Simulation Framework for Adversarial Attacks on Autonomous Vehicles
by: Anagnostopoulos, Christos, et al.
Published: (2025)

AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)

LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks
by: Happe, Andreas, et al.
Published: (2023)

Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications
by: Suo, Xuchen
Published: (2024)

Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)

Scam Shield: Multi-Model Voting and Fine-Tuned LLMs Against Adversarial Attacks
by: Chang, Chen-Wei, et al.
Published: (2025)

Adversarial Attacks Against Automated Fact-Checking: A Survey
by: Liu, Fanzhen, et al.
Published: (2025)

CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
by: Patil, KrishnaSaiReddy
Published: (2026)

WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
by: Evtimov, Ivan, et al.
Published: (2025)

Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)

Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks
by: Syed, Toqeer Ali, et al.
Published: (2025)

A White-Box Adversarial Attack Against a Digital Twin
by: Patterson, Wilson, et al.
Published: (2022)

Preventing Non-intrusive Load Monitoring Privacy Invasion: A Precise Adversarial Attack Scheme for Networked Smart Meters
by: He, Jialing, et al.
Published: (2024)

Design and Implementation of a Secure RAG-Enhanced AI Chatbot for Smart Tourism Customer Service: Defending Against Prompt Injection Attacks -- A Case Study of Hsinchu, Taiwan
by: Shih, Yu-Kai, et al.
Published: (2025)

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)

Benchmarking Misuse Mitigation Against Covert Adversaries
by: Brown, Davis, et al.
Published: (2025)

Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)

MATRA: Modeling the Attack Surface of Agentic AI Systems -- OpenClaw Case Study
by: Van hamme, Tim, et al.
Published: (2026)

Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models
by: Ma, Jiachen, et al.
Published: (2024)

Can Adversarial Code Comments Fool AI Security Reviewers -- Large-Scale Empirical Study of Comment-Based Attacks and Defenses Against LLM Code Analysis
by: Thornton, Scott
Published: (2026)

Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems
by: Banerjee, Sarbartha, et al.
Published: (2026)

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks
by: Xiang, Chong, et al.
Published: (2026)

A Model Stealing Attack Against Multi-Exit Networks
by: Pan, Li, et al.
Published: (2023)

Attacking LLMs and AI Agents: Advertisement Embedding Attacks Against Large Language Models
by: Guo, Qiming, et al.
Published: (2025)

How stealthy is stealthy? Studying the Efficacy of Black-Box Adversarial Attacks in the Real World
by: Panebianco, Francesco, et al.
Published: (2025)

Membership Inference Attacks Against Vision-Language Models
by: Hu, Yuke, et al.
Published: (2025)

Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)

Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)

Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents
by: Li, Jiaqi, et al.
Published: (2026)

SLEIGHT-Bench: A Benchmark of Evasion Attacks Against Agent Monitors
by: Najt, Elle, et al.
Published: (2026)

Adversarial Attack-Defense Co-Evolution for LLM Safety Alignment via Tree-Group Dual-Aware Search and Optimization
by: Li, Xurui, et al.
Published: (2025)

Adversarial Machine Learning: Attacks, Defenses, and Open Challenges
by: Jha, Pranav K
Published: (2025)

Defending Against Beta Poisoning Attacks in Machine Learning Models
by: Gulciftci, Nilufer, et al.
Published: (2025)

No Free Lunch for Defending Against Prefilling Attack by In-Context Learning
by: Xue, Zhiyu, et al.
Published: (2024)

Energy-Latency Attacks: A New Adversarial Threat to Deep Learning
by: Meftah, Hanene F. Z. Brachemi, et al.
Published: (2025)

Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey
by: Jain, Bhavuk, et al.
Published: (2026)

TT-SEAL: TTD-Aware Selective Encryption for Adversarially-Robust and Low-Latency Edge AI
by: Min, Kyeongpil, et al.
Published: (2026)