Saved in:
| Main Authors: | Walter, Mathew J., Barrett, Aaron, Tam, Kimberly |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.21609 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Behavior-Aware and Generalizable Defense Against Black-Box Adversarial Attacks for ML-Based IDS
by: Ennaji, Sabrine, et al.
Published: (2025)
by: Ennaji, Sabrine, et al.
Published: (2025)
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
by: Kang, Mintong, et al.
Published: (2023)
by: Kang, Mintong, et al.
Published: (2023)
Mitigation of Camouflaged Adversarial Attacks in Autonomous Vehicles--A Case Study Using CARLA Simulator
by: Martinez, Yago Romano, et al.
Published: (2025)
by: Martinez, Yago Romano, et al.
Published: (2025)
Integrated Simulation Framework for Adversarial Attacks on Autonomous Vehicles
by: Anagnostopoulos, Christos, et al.
Published: (2025)
by: Anagnostopoulos, Christos, et al.
Published: (2025)
AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)
by: Wu, Yixin, et al.
Published: (2025)
LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks
by: Happe, Andreas, et al.
Published: (2023)
by: Happe, Andreas, et al.
Published: (2023)
Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications
by: Suo, Xuchen
Published: (2024)
by: Suo, Xuchen
Published: (2024)
Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Scam Shield: Multi-Model Voting and Fine-Tuned LLMs Against Adversarial Attacks
by: Chang, Chen-Wei, et al.
Published: (2025)
by: Chang, Chen-Wei, et al.
Published: (2025)
Adversarial Attacks Against Automated Fact-Checking: A Survey
by: Liu, Fanzhen, et al.
Published: (2025)
by: Liu, Fanzhen, et al.
Published: (2025)
CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
by: Patil, KrishnaSaiReddy
Published: (2026)
by: Patil, KrishnaSaiReddy
Published: (2026)
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
by: Evtimov, Ivan, et al.
Published: (2025)
by: Evtimov, Ivan, et al.
Published: (2025)
Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)
by: Liu, Fan, et al.
Published: (2024)
Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks
by: Syed, Toqeer Ali, et al.
Published: (2025)
by: Syed, Toqeer Ali, et al.
Published: (2025)
A White-Box Adversarial Attack Against a Digital Twin
by: Patterson, Wilson, et al.
Published: (2022)
by: Patterson, Wilson, et al.
Published: (2022)
Preventing Non-intrusive Load Monitoring Privacy Invasion: A Precise Adversarial Attack Scheme for Networked Smart Meters
by: He, Jialing, et al.
Published: (2024)
by: He, Jialing, et al.
Published: (2024)
Design and Implementation of a Secure RAG-Enhanced AI Chatbot for Smart Tourism Customer Service: Defending Against Prompt Injection Attacks -- A Case Study of Hsinchu, Taiwan
by: Shih, Yu-Kai, et al.
Published: (2025)
by: Shih, Yu-Kai, et al.
Published: (2025)
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)
by: Zhu, Kaijie, et al.
Published: (2025)
Benchmarking Misuse Mitigation Against Covert Adversaries
by: Brown, Davis, et al.
Published: (2025)
by: Brown, Davis, et al.
Published: (2025)
Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)
by: Shah, Parin, et al.
Published: (2024)
MATRA: Modeling the Attack Surface of Agentic AI Systems -- OpenClaw Case Study
by: Van hamme, Tim, et al.
Published: (2026)
by: Van hamme, Tim, et al.
Published: (2026)
Jailbreaking Prompt Attack: A Controllable Adversarial Attack against Diffusion Models
by: Ma, Jiachen, et al.
Published: (2024)
by: Ma, Jiachen, et al.
Published: (2024)
Can Adversarial Code Comments Fool AI Security Reviewers -- Large-Scale Empirical Study of Comment-Based Attacks and Defenses Against LLM Code Analysis
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems
by: Banerjee, Sarbartha, et al.
Published: (2026)
by: Banerjee, Sarbartha, et al.
Published: (2026)
Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks
by: Xiang, Chong, et al.
Published: (2026)
by: Xiang, Chong, et al.
Published: (2026)
A Model Stealing Attack Against Multi-Exit Networks
by: Pan, Li, et al.
Published: (2023)
by: Pan, Li, et al.
Published: (2023)
Attacking LLMs and AI Agents: Advertisement Embedding Attacks Against Large Language Models
by: Guo, Qiming, et al.
Published: (2025)
by: Guo, Qiming, et al.
Published: (2025)
How stealthy is stealthy? Studying the Efficacy of Black-Box Adversarial Attacks in the Real World
by: Panebianco, Francesco, et al.
Published: (2025)
by: Panebianco, Francesco, et al.
Published: (2025)
Membership Inference Attacks Against Vision-Language Models
by: Hu, Yuke, et al.
Published: (2025)
by: Hu, Yuke, et al.
Published: (2025)
Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)
by: Jaiswal, Piyush, et al.
Published: (2026)
Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)
by: Li, Shawn, et al.
Published: (2026)
Poster: ClawdGo: Endogenous Security Awareness Training for Autonomous AI Agents
by: Li, Jiaqi, et al.
Published: (2026)
by: Li, Jiaqi, et al.
Published: (2026)
SLEIGHT-Bench: A Benchmark of Evasion Attacks Against Agent Monitors
by: Najt, Elle, et al.
Published: (2026)
by: Najt, Elle, et al.
Published: (2026)
Adversarial Attack-Defense Co-Evolution for LLM Safety Alignment via Tree-Group Dual-Aware Search and Optimization
by: Li, Xurui, et al.
Published: (2025)
by: Li, Xurui, et al.
Published: (2025)
Adversarial Machine Learning: Attacks, Defenses, and Open Challenges
by: Jha, Pranav K
Published: (2025)
by: Jha, Pranav K
Published: (2025)
Defending Against Beta Poisoning Attacks in Machine Learning Models
by: Gulciftci, Nilufer, et al.
Published: (2025)
by: Gulciftci, Nilufer, et al.
Published: (2025)
No Free Lunch for Defending Against Prefilling Attack by In-Context Learning
by: Xue, Zhiyu, et al.
Published: (2024)
by: Xue, Zhiyu, et al.
Published: (2024)
Energy-Latency Attacks: A New Adversarial Threat to Deep Learning
by: Meftah, Hanene F. Z. Brachemi, et al.
Published: (2025)
by: Meftah, Hanene F. Z. Brachemi, et al.
Published: (2025)
Adversarial Attacks on Multimodal Large Language Models: A Comprehensive Survey
by: Jain, Bhavuk, et al.
Published: (2026)
by: Jain, Bhavuk, et al.
Published: (2026)
TT-SEAL: TTD-Aware Selective Encryption for Adversarially-Robust and Low-Latency Edge AI
by: Min, Kyeongpil, et al.
Published: (2026)
by: Min, Kyeongpil, et al.
Published: (2026)
Similar Items
-
Behavior-Aware and Generalizable Defense Against Black-Box Adversarial Attacks for ML-Based IDS
by: Ennaji, Sabrine, et al.
Published: (2025) -
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
by: Kang, Mintong, et al.
Published: (2023) -
Mitigation of Camouflaged Adversarial Attacks in Autonomous Vehicles--A Case Study Using CARLA Simulator
by: Martinez, Yago Romano, et al.
Published: (2025) -
Integrated Simulation Framework for Adversarial Attacks on Autonomous Vehicles
by: Anagnostopoulos, Christos, et al.
Published: (2025) -
AttackPilot: Autonomous Inference Attacks Against ML Services With LLM-Based Agents
by: Wu, Yixin, et al.
Published: (2025)