Saved in:
| Main Author: | Corll, J Alex |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.11875 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection
by: Corll, J Alex
Published: (2026)
by: Corll, J Alex
Published: (2026)
Defeating Prompt Injections by Design
by: Debenedetti, Edoardo, et al.
Published: (2025)
by: Debenedetti, Edoardo, et al.
Published: (2025)
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
Defending against Indirect Prompt Injection by Instruction Detection
by: Wen, Tongyu, et al.
Published: (2025)
by: Wen, Tongyu, et al.
Published: (2025)
SecInfer: Preventing Prompt Injection via Inference-time Scaling
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
Unified Threat Detection and Mitigation Framework (UTDMF): Combating Prompt Injection, Deception, and Bias in Enterprise-Scale Transformers
by: KumarRavindran, Santhosh
Published: (2025)
by: KumarRavindran, Santhosh
Published: (2025)
Bypassing Prompt Injection Detectors through Evasive Injections
by: Rahman, Md Jahedur, et al.
Published: (2026)
by: Rahman, Md Jahedur, et al.
Published: (2026)
PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
How Not to Detect Prompt Injections with an LLM
by: Choudhary, Sarthak, et al.
Published: (2025)
by: Choudhary, Sarthak, et al.
Published: (2025)
Evaluation of Prompt Injection Defenses in Large Language Models
by: Deep, Priyal, et al.
Published: (2026)
by: Deep, Priyal, et al.
Published: (2026)
SnapGuard: Lightweight Prompt Injection Detection for Screenshot-Based Web Agents
by: Du, Mengyao, et al.
Published: (2026)
by: Du, Mengyao, et al.
Published: (2026)
PromptArmor: Simple yet Effective Prompt Injection Defenses
by: Shi, Tianneng, et al.
Published: (2025)
by: Shi, Tianneng, et al.
Published: (2025)
IPI-proxy: An Intercepting Proxy for Red-Teaming Web-Browsing AI Agents Against Indirect Prompt Injection
by: Chia-Pei, et al.
Published: (2026)
by: Chia-Pei, et al.
Published: (2026)
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)
by: Wang, Zhilong, et al.
Published: (2025)
Prompt Injection as an Emerging Threat: Evaluating the Resilience of Large Language Models
by: Ganiuly, Daniyal, et al.
Published: (2025)
by: Ganiuly, Daniyal, et al.
Published: (2025)
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
by: Hung, Kuo-Han, et al.
Published: (2024)
by: Hung, Kuo-Han, et al.
Published: (2024)
CASCADE: A Cascaded Hybrid Defense Architecture for Prompt Injection Detection in MCP-Based Systems
by: Turgut, İpek Abasıkeleş, et al.
Published: (2026)
by: Turgut, İpek Abasıkeleş, et al.
Published: (2026)
F2A: An Innovative Approach for Prompt Injection by Utilizing Feign Security Detection Agents
by: Ren, Yupeng
Published: (2024)
by: Ren, Yupeng
Published: (2024)
How Vulnerable Are AI Agents to Indirect Prompt Injections? Insights from a Large-Scale Public Competition
by: Dziemian, Mateusz, et al.
Published: (2026)
by: Dziemian, Mateusz, et al.
Published: (2026)
Analysis of LLMs Against Prompt Injection and Jailbreak Attacks
by: Jaiswal, Piyush, et al.
Published: (2026)
by: Jaiswal, Piyush, et al.
Published: (2026)
Prompt Injection 2.0: Hybrid AI Threats
by: McHugh, Jeremy, et al.
Published: (2025)
by: McHugh, Jeremy, et al.
Published: (2025)
Assessing Prompt Injection Risks in 200+ Custom GPTs
by: Yu, Jiahao, et al.
Published: (2023)
by: Yu, Jiahao, et al.
Published: (2023)
Securing AI Agents Against Prompt Injection Attacks
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
by: Ramakrishnan, Badrinath, et al.
Published: (2025)
Prompt Injection Attacks on Large Language Models in Oncology
by: Clusmann, Jan, et al.
Published: (2024)
by: Clusmann, Jan, et al.
Published: (2024)
WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents
by: Liu, Yinuo, et al.
Published: (2025)
by: Liu, Yinuo, et al.
Published: (2025)
System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)
by: Li, Zongze, et al.
Published: (2025)
The Defense Trilemma: Why Prompt Injection Defense Wrappers Fail?
by: Bhatt, Manish, et al.
Published: (2026)
by: Bhatt, Manish, et al.
Published: (2026)
Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025)
by: Yeo, Andrew, et al.
Published: (2025)
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
Optimization-based Prompt Injection Attack to LLM-as-a-Judge
by: Shi, Jiawen, et al.
Published: (2024)
by: Shi, Jiawen, et al.
Published: (2024)
CourtGuard: A Local, Multiagent Prompt Injection Classifier
by: Wu, Isaac, et al.
Published: (2025)
by: Wu, Isaac, et al.
Published: (2025)
LLMail-Inject: A Dataset from a Realistic Adaptive Prompt Injection Challenge
by: Abdelnabi, Sahar, et al.
Published: (2025)
by: Abdelnabi, Sahar, et al.
Published: (2025)
Prompt Injection as Role Confusion
by: Ye, Charles, et al.
Published: (2026)
by: Ye, Charles, et al.
Published: (2026)
Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models
by: Xiong, Junjie, et al.
Published: (2025)
by: Xiong, Junjie, et al.
Published: (2025)
WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents
by: Wang, Xilong, et al.
Published: (2026)
by: Wang, Xilong, et al.
Published: (2026)
Adversarial Reinforcement Learning for Detecting False Data Injection Attacks in Vehicular Routing
by: Eghtesad, Taha, et al.
Published: (2026)
by: Eghtesad, Taha, et al.
Published: (2026)
Signed-Prompt: A New Approach to Prevent Prompt Injection Attacks Against LLM-Integrated Applications
by: Suo, Xuchen
Published: (2024)
by: Suo, Xuchen
Published: (2024)
Enhancing SQL Injection Detection and Prevention Using Generative Models
by: Dasari, Naga Sai, et al.
Published: (2025)
by: Dasari, Naga Sai, et al.
Published: (2025)
Overcoming the Retrieval Barrier: Indirect Prompt Injection in the Wild for LLM Systems
by: Chang, Hongyan, et al.
Published: (2026)
by: Chang, Hongyan, et al.
Published: (2026)
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)
by: Cao, Tri, et al.
Published: (2026)
Similar Items
-
Peak + Accumulation: A Proxy-Level Scoring Formula for Multi-Turn LLM Attack Detection
by: Corll, J Alex
Published: (2026) -
Defeating Prompt Injections by Design
by: Debenedetti, Edoardo, et al.
Published: (2025) -
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025) -
Defending against Indirect Prompt Injection by Instruction Detection
by: Wen, Tongyu, et al.
Published: (2025) -
SecInfer: Preventing Prompt Injection via Inference-time Scaling
by: Liu, Yupei, et al.
Published: (2025)