Saved in:
| Main Authors: | Guo, Xuyang, Huang, Zekai, Song, Zhao, Zhang, Jiahao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.13214 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PromptArmor: Simple yet Effective Prompt Injection Defenses
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
by: Yu, Jiahao, et al.
Published: (2024)
by: Yu, Jiahao, et al.
Published: (2024)
Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks
by: Hao, Run, et al.
Published: (2025)
by: Hao, Run, et al.
Published: (2025)
Assessing Prompt Injection Risks in 200+ Custom GPTs
by: Yu, Jiahao, et al.
Published: (2023)
by: Yu, Jiahao, et al.
Published: (2023)
Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)
by: Jaiswal, Piyush, et al.
Published: (2026)
SequentialBreak: Large Language Models Can be Fooled by Embedding Jailbreak Prompts into Sequential Prompt Chains
by: Saiem, Bijoy Ahmed, et al.
Published: (2024)
by: Saiem, Bijoy Ahmed, et al.
Published: (2024)
Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025)
by: Yeo, Andrew, et al.
Published: (2025)
Dagger Behind Smile: Fool LLMs with a Happy Ending Story
by: Song, Xurui, et al.
Published: (2025)
by: Song, Xurui, et al.
Published: (2025)
AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs
by: Wang, Che, et al.
Published: (2026)
by: Wang, Che, et al.
Published: (2026)
SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents
by: Du, Mengyao, et al.
Published: (2026)
by: Du, Mengyao, et al.
Published: (2026)
ShadowCode: Towards (Automatic) External Prompt Injection Attack against Code LLMs
by: Yang, Yuchen, et al.
Published: (2024)
by: Yang, Yuchen, et al.
Published: (2024)
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
by: Liu, Ruofan, et al.
Published: (2025)
by: Liu, Ruofan, et al.
Published: (2025)
AgentVigil: Generic Black-Box Red-teaming for Indirect Prompt Injection against LLM Agents
by: Wang, Zhun, et al.
Published: (2025)
by: Wang, Zhun, et al.
Published: (2025)
Bypassing Prompt Injection Detectors through Evasive Injections
by: Rahman, Md Jahedur, et al.
Published: (2026)
by: Rahman, Md Jahedur, et al.
Published: (2026)
PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
Defeating Prompt Injections by Design
by: Debenedetti, Edoardo, et al.
Published: (2025)
by: Debenedetti, Edoardo, et al.
Published: (2025)
Decoding Latent Attack Surfaces in LLMs: Prompt Injection via HTML in Web Summarization
by: Verma, Ishaan, et al.
Published: (2025)
by: Verma, Ishaan, et al.
Published: (2025)
Breaking the Protocol: Security Analysis of the Model Context Protocol Specification and Prompt Injection Vulnerabilities in Tool-Integrated LLM Agents
by: Maloyan, Narek, et al.
Published: (2026)
by: Maloyan, Narek, et al.
Published: (2026)
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
by: Hung, Kuo-Han, et al.
Published: (2024)
by: Hung, Kuo-Han, et al.
Published: (2024)
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)
by: Wang, Zhilong, et al.
Published: (2025)
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction
by: Wang, Che, et al.
Published: (2026)
by: Wang, Che, et al.
Published: (2026)
WASP: Benchmarking Web Agent Security Against Prompt Injection Attacks
by: Evtimov, Ivan, et al.
Published: (2025)
by: Evtimov, Ivan, et al.
Published: (2025)
Optimization-based Prompt Injection Attack to LLM-as-a-Judge
by: Shi, Jiawen, et al.
Published: (2024)
by: Shi, Jiawen, et al.
Published: (2024)
System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)
by: Li, Zongze, et al.
Published: (2025)
LivePI: More Realistic Benchmarking of Agents Against Indirect Prompt Injection
by: Zhao, Lei, et al.
Published: (2026)
by: Zhao, Lei, et al.
Published: (2026)
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)
by: Zhu, Kaijie, et al.
Published: (2025)
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
by: Bhatt, Manish, et al.
Published: (2026)
by: Bhatt, Manish, et al.
Published: (2026)
Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks
by: Chen, Sizhe, et al.
Published: (2025)
by: Chen, Sizhe, et al.
Published: (2025)
Know Thy Enemy: Securing LLMs Against Prompt Injection via Diverse Data Synthesis and Instruction-Level Chain-of-Thought Learning
by: Chang, Zhiyuan, et al.
Published: (2026)
by: Chang, Zhiyuan, et al.
Published: (2026)
Enhancing Jailbreak Attacks on LLMs via Persona Prompts
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
ClawGuard: A Runtime Security Framework for Tool-Augmented LLM Agents Against Indirect Prompt Injection
by: Zhao, Wei, et al.
Published: (2026)
by: Zhao, Wei, et al.
Published: (2026)
QueryIPI: Query-agnostic Indirect Prompt Injection on Coding Agents
by: Xie, Yuchong, et al.
Published: (2025)
by: Xie, Yuchong, et al.
Published: (2025)
Prompt Injection 2.0: Hybrid AI Threats
by: McHugh, Jeremy, et al.
Published: (2025)
by: McHugh, Jeremy, et al.
Published: (2025)
Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Defending against Indirect Prompt Injection by Instruction Detection
by: Wen, Tongyu, et al.
Published: (2025)
by: Wen, Tongyu, et al.
Published: (2025)
Evaluation of Prompt Injection Defenses in Large Language Models
by: Deep, Priyal, et al.
Published: (2026)
by: Deep, Priyal, et al.
Published: (2026)
Soft Begging: Modular and Efficient Shielding of LLMs against Prompt Injection and Jailbreaking based on Prompt Tuning
by: Ostermann, Simon, et al.
Published: (2024)
by: Ostermann, Simon, et al.
Published: (2024)
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
by: Wang, Haozhen, et al.
Published: (2026)
by: Wang, Haozhen, et al.
Published: (2026)
Similar Items
-
PromptArmor: Simple yet Effective Prompt Injection Defenses
by: Shi, Tianneng, et al.
Published: (2025) -
PROMPTFUZZ: Harnessing Fuzzing Techniques for Robust Testing of Prompt Injection in LLMs
by: Yu, Jiahao, et al.
Published: (2024) -
Fooling the Watchers: Breaking AIGC Detectors via Semantic Prompt Attacks
by: Hao, Run, et al.
Published: (2025) -
Assessing Prompt Injection Risks in 200+ Custom GPTs
by: Yu, Jiahao, et al.
Published: (2023) -
Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)