Saved in:
| Main Authors: | Deep, Priyal, Emmons, Shane, Fox, Amy, Bacon, Kyle, McAllister, Kelley, Ortiz, Peter, Flautner, Krisztian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.23887 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
by: Bhatt, Manish, et al.
Published: (2026)
by: Bhatt, Manish, et al.
Published: (2026)
System Prompt Extraction Attacks and Defenses in Large Language Models
by: Das, Badhan Chandra, et al.
Published: (2025)
by: Das, Badhan Chandra, et al.
Published: (2025)
Prompt Injection as an Emerging Threat: Evaluating the Resilience of Large Language Models
by: Ganiuly, Daniyal, et al.
Published: (2025)
by: Ganiuly, Daniyal, et al.
Published: (2025)
Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods
by: Chen, Yulin, et al.
Published: (2025)
by: Chen, Yulin, et al.
Published: (2025)
Defending Against Prompt Injection With a Few DefensiveTokens
by: Chen, Sizhe, et al.
Published: (2025)
by: Chen, Sizhe, et al.
Published: (2025)
Defense Against Prompt Injection Attack by Leveraging Attack Techniques
by: Chen, Yulin, et al.
Published: (2024)
by: Chen, Yulin, et al.
Published: (2024)
PromptArmor: Simple yet Effective Prompt Injection Defenses
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
A Novel Evaluation Framework for Assessing Resilience Against Prompt Injection Attacks in Large Language Models
by: Yip, Daniel Wankit, et al.
Published: (2024)
by: Yip, Daniel Wankit, et al.
Published: (2024)
Securing Large Language Models (LLMs) from Prompt Injection Attacks
by: Suri, Omar Farooq Khan, et al.
Published: (2025)
by: Suri, Omar Farooq Khan, et al.
Published: (2025)
Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025)
by: Yeo, Andrew, et al.
Published: (2025)
AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs
by: Wang, Yihan, et al.
Published: (2025)
by: Wang, Yihan, et al.
Published: (2025)
Prompt Control-Flow Integrity: A Priority-Aware Runtime Defense Against Prompt Injection in LLM Systems
by: Alam, Md Takrim Ul, et al.
Published: (2026)
by: Alam, Md Takrim Ul, et al.
Published: (2026)
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)
by: Debenedetti, Edoardo, et al.
Published: (2024)
System-Level Defense against Indirect Prompt Injection Attacks: An Information Flow Control Perspective
by: Wu, Fangzhou, et al.
Published: (2024)
by: Wu, Fangzhou, et al.
Published: (2024)
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
by: Yin, Chenlong, et al.
Published: (2026)
by: Yin, Chenlong, et al.
Published: (2026)
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)
by: Cao, Tri, et al.
Published: (2026)
Prompt Injection Attacks on Large Language Models in Oncology
by: Clusmann, Jan, et al.
Published: (2024)
by: Clusmann, Jan, et al.
Published: (2024)
Jatmo: Prompt Injection Defense by Task-Specific Finetuning
by: Piet, Julien, et al.
Published: (2023)
by: Piet, Julien, et al.
Published: (2023)
Defenses at Odds: Measuring and Explaining Defense Conflicts in Large Language Models
by: Meng, Xiangtao, et al.
Published: (2026)
by: Meng, Xiangtao, et al.
Published: (2026)
ESLD (External Surrogate Latent Defense): A Latent-Space Architecture for Faster, Stronger Prompt-Injection Defense
by: Narendra, Yash
Published: (2026)
by: Narendra, Yash
Published: (2026)
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)
by: Zhu, Kaijie, et al.
Published: (2025)
A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks
by: Hossain, S M Asif, et al.
Published: (2025)
by: Hossain, S M Asif, et al.
Published: (2025)
ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction
by: Wang, Che, et al.
Published: (2026)
by: Wang, Che, et al.
Published: (2026)
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
by: Liu, Yupei, et al.
Published: (2023)
by: Liu, Yupei, et al.
Published: (2023)
An Early Categorization of Prompt Injection Attacks on Large Language Models
by: Rossi, Sippo, et al.
Published: (2024)
by: Rossi, Sippo, et al.
Published: (2024)
AttackEval: A Systematic Empirical Study of Prompt Injection Attack Effectiveness Against Large Language Models
by: Wang, Jackson
Published: (2026)
by: Wang, Jackson
Published: (2026)
Misleading Large Language Models used (or misused) in Scientific Peer-Reviewing via Hidden Prompt-Injection Attacks
by: Collu, Matteo Gioele, et al.
Published: (2025)
by: Collu, Matteo Gioele, et al.
Published: (2025)
Evaluating Prompt Injection Defenses for Educational LLM Tutors: Security-Usability-Latency Trade-offs
by: Maiorano, Alexandre Cristovão
Published: (2026)
by: Maiorano, Alexandre Cristovão
Published: (2026)
Invisible Injections: Exploiting Vision-Language Models Through Steganographic Prompt Embedding
by: Pathade, Chetan
Published: (2025)
by: Pathade, Chetan
Published: (2025)
System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)
by: Li, Zongze, et al.
Published: (2025)
Backdoored Retrievers for Prompt Injection Attacks on Retrieval Augmented Generation of Large Language Models
by: Clop, Cody, et al.
Published: (2024)
by: Clop, Cody, et al.
Published: (2024)
FATH: Authentication-based Test-time Defense against Indirect Prompt Injection Attacks
by: Wang, Jiongxiao, et al.
Published: (2024)
by: Wang, Jiongxiao, et al.
Published: (2024)
Agent Privilege Separation in OpenClaw: A Structural Defense Against Prompt Injection
by: Cheng, Darren, et al.
Published: (2026)
by: Cheng, Darren, et al.
Published: (2026)
Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents
by: Zhan, Qiusi, et al.
Published: (2025)
by: Zhan, Qiusi, et al.
Published: (2025)
LocalAlign: Enabling Generalizable Prompt Injection Defense via Generation of Near-Target Adversarial Examples for Alignment Training
by: Gong, Yuyang, et al.
Published: (2026)
by: Gong, Yuyang, et al.
Published: (2026)
Dynamic Probabilistic Noise Injection for Membership Inference Defense
by: Forough, Javad, et al.
Published: (2025)
by: Forough, Javad, et al.
Published: (2025)
Goal-guided Generative Prompt Injection Attack on Large Language Models
by: Zhang, Chong, et al.
Published: (2024)
by: Zhang, Chong, et al.
Published: (2024)
Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection
by: Yan, Jun, et al.
Published: (2023)
by: Yan, Jun, et al.
Published: (2023)
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025)
by: Xiong, Junjie, et al.
Published: (2025)
Similar Items
-
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025) -
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
by: Bhatt, Manish, et al.
Published: (2026) -
System Prompt Extraction Attacks and Defenses in Large Language Models
by: Das, Badhan Chandra, et al.
Published: (2025) -
Prompt Injection as an Emerging Threat: Evaluating the Resilience of Large Language Models
by: Ganiuly, Daniyal, et al.
Published: (2025) -
Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods
by: Chen, Yulin, et al.
Published: (2025)