Saved in:
| Main Authors: | Zou, Wei, Liu, Yupei, Wang, Yanting, Chen, Ying, Gong, Neil, Jia, Jinyuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.14005 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
by: Liu, Yupei, et al.
Published: (2023)
by: Liu, Yupei, et al.
Published: (2023)
SecInfer: Preventing Prompt Injection via Inference-time Scaling
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
by: Yin, Chenlong, et al.
Published: (2026)
by: Yin, Chenlong, et al.
Published: (2026)
PIArena: A Platform for Prompt Injection Evaluation
by: Geng, Runpeng, et al.
Published: (2026)
by: Geng, Runpeng, et al.
Published: (2026)
PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)
by: Geng, Runpeng, et al.
Published: (2025)
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
TracLLM: A Generic Framework for Attributing Long Context LLMs
by: Wang, Yanting, et al.
Published: (2025)
by: Wang, Yanting, et al.
Published: (2025)
CleanBase: Detecting Malicious Documents in RAG Knowledge Databases
by: Jin, Weifei, et al.
Published: (2026)
by: Jin, Weifei, et al.
Published: (2026)
AgentWatcher: A Rule-based Prompt Injection Monitor
by: Wang, Yanting, et al.
Published: (2026)
by: Wang, Yanting, et al.
Published: (2026)
TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning
by: Liu, Yupei, et al.
Published: (2025)
by: Liu, Yupei, et al.
Published: (2025)
FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2026)
by: Wang, Yanting, et al.
Published: (2026)
AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2025)
by: Wang, Yanting, et al.
Published: (2025)
Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
by: Shao, Zedian, et al.
Published: (2024)
by: Shao, Zedian, et al.
Published: (2024)
Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening
by: Zhang, Mohan, et al.
Published: (2026)
by: Zhang, Mohan, et al.
Published: (2026)
Evaluating LLM-based Personal Information Extraction and Countermeasures
by: Liu, Yupei, et al.
Published: (2024)
by: Liu, Yupei, et al.
Published: (2024)
PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
by: Zou, Wei, et al.
Published: (2024)
by: Zou, Wei, et al.
Published: (2024)
ObliInjection: Order-Oblivious Prompt Injection Attack to LLM Agents with Multi-source Data
by: Wang, Reachal, et al.
Published: (2025)
by: Wang, Reachal, et al.
Published: (2025)
CorruptEncoder: Data Poisoning based Backdoor Attacks to Contrastive Learning
by: Zhang, Jinghuai, et al.
Published: (2022)
by: Zhang, Jinghuai, et al.
Published: (2022)
Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents
by: Zhan, Qiusi, et al.
Published: (2025)
by: Zhan, Qiusi, et al.
Published: (2025)
AlignSentinel: Alignment-Aware Detection of Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2026)
by: Jia, Yuqi, et al.
Published: (2026)
Attention Tracker: Detecting Prompt Injection Attacks in LLMs
by: Hung, Kuo-Han, et al.
Published: (2024)
by: Hung, Kuo-Han, et al.
Published: (2024)
A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks
by: Hossain, S M Asif, et al.
Published: (2025)
by: Hossain, S M Asif, et al.
Published: (2025)
FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models
by: Wang, Yanting, et al.
Published: (2024)
by: Wang, Yanting, et al.
Published: (2024)
How Not to Detect Prompt Injections with an LLM
by: Choudhary, Sarthak, et al.
Published: (2025)
by: Choudhary, Sarthak, et al.
Published: (2025)
Competitive Advantage Attacks to Decentralized Federated Learning
by: Jia, Yuqi, et al.
Published: (2023)
by: Jia, Yuqi, et al.
Published: (2023)
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)
by: Debenedetti, Edoardo, et al.
Published: (2024)
TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models
by: Nie, Yuzhou., et al.
Published: (2024)
by: Nie, Yuzhou., et al.
Published: (2024)
MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models
by: Wang, Yanting, et al.
Published: (2024)
by: Wang, Yanting, et al.
Published: (2024)
Poisoning the Watchtower: Prompt Injection Attacks Against LLM-Augmented Security Operations Through Adversarial Log Content
by: Pandey, Rohan, et al.
Published: (2026)
by: Pandey, Rohan, et al.
Published: (2026)
Prompt Injection Attack to Tool Selection in LLM Agents
by: Shi, Jiawen, et al.
Published: (2025)
by: Shi, Jiawen, et al.
Published: (2025)
GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
by: Li, Rongchang, et al.
Published: (2024)
by: Li, Rongchang, et al.
Published: (2024)
Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks
by: Pasquini, Dario, et al.
Published: (2024)
by: Pasquini, Dario, et al.
Published: (2024)
Design Patterns for Securing LLM Agents against Prompt Injections
by: Beurer-Kellner, Luca, et al.
Published: (2025)
by: Beurer-Kellner, Luca, et al.
Published: (2025)
PLeak: Prompt Leaking Attacks against Large Language Model Applications
by: Hui, Bo, et al.
Published: (2024)
by: Hui, Bo, et al.
Published: (2024)
Prompt Injection Attacks on Large Language Models in Oncology
by: Clusmann, Jan, et al.
Published: (2024)
by: Clusmann, Jan, et al.
Published: (2024)
Defending Against Indirect Prompt Injection Attacks With Spotlighting
by: Hines, Keegan, et al.
Published: (2024)
by: Hines, Keegan, et al.
Published: (2024)
Backdoored Retrievers for Prompt Injection Attacks on Retrieval Augmented Generation of Large Language Models
by: Clop, Cody, et al.
Published: (2024)
by: Clop, Cody, et al.
Published: (2024)
EnsembleSHAP: Faithful and Certifiably Robust Attribution for Random Subspace Method
by: Wang, Yanting, et al.
Published: (2026)
by: Wang, Yanting, et al.
Published: (2026)
Similar Items
-
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
by: Liu, Yupei, et al.
Published: (2023) -
SecInfer: Preventing Prompt Injection via Inference-time Scaling
by: Liu, Yupei, et al.
Published: (2025) -
PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025) -
DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025) -
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
by: Yin, Chenlong, et al.
Published: (2026)