:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zou, Wei, Liu, Yupei, Wang, Yanting, Chen, Ying, Gong, Neil, Jia, Jinyuan
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security Machine Learning
Online Access:	https://arxiv.org/abs/2510.14005
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Formalizing and Benchmarking Prompt Injection Attacks and Defenses
by: Liu, Yupei, et al.
Published: (2023)

SecInfer: Preventing Prompt Injection via Inference-time Scaling
by: Liu, Yupei, et al.
Published: (2025)

PromptLocate: Localizing Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)

DataSentinel: A Game-Theoretic Detection of Prompt Injection Attacks
by: Liu, Yupei, et al.
Published: (2025)

PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
by: Yin, Chenlong, et al.
Published: (2026)

PIArena: A Platform for Prompt Injection Evaluation
by: Geng, Runpeng, et al.
Published: (2026)

PISanitizer: Preventing Prompt Injection to Long-Context LLMs via Prompt Sanitization
by: Geng, Runpeng, et al.
Published: (2025)

A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)

TracLLM: A Generic Framework for Attributing Long Context LLMs
by: Wang, Yanting, et al.
Published: (2025)

CleanBase: Detecting Malicious Documents in RAG Knowledge Databases
by: Jin, Weifei, et al.
Published: (2026)

AgentWatcher: A Rule-based Prompt Injection Monitor
by: Wang, Yanting, et al.
Published: (2026)

TrojanDec: Data-free Detection of Trojan Inputs in Self-supervised Learning
by: Liu, Yupei, et al.
Published: (2025)

FlashRT: Towards Computationally and Memory Efficient Red-Teaming for Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2026)

AttnTrace: Contextual Attribution of Prompt Injection and Knowledge Corruption
by: Wang, Yanting, et al.
Published: (2025)

Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
by: Shao, Zedian, et al.
Published: (2024)

Measuring Real-World Prompt Injection Attacks in LLM-based Resume Screening
by: Zhang, Mohan, et al.
Published: (2026)

Evaluating LLM-based Personal Information Extraction and Countermeasures
by: Liu, Yupei, et al.
Published: (2024)

PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
by: Zou, Wei, et al.
Published: (2024)

ObliInjection: Order-Oblivious Prompt Injection Attack to LLM Agents with Multi-source Data
by: Wang, Reachal, et al.
Published: (2025)

CorruptEncoder: Data Poisoning based Backdoor Attacks to Contrastive Learning
by: Zhang, Jinghuai, et al.
Published: (2022)

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents
by: Zhan, Qiusi, et al.
Published: (2025)

AlignSentinel: Alignment-Aware Detection of Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2026)

Attention Tracker: Detecting Prompt Injection Attacks in LLMs
by: Hung, Kuo-Han, et al.
Published: (2024)

A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks
by: Hossain, S M Asif, et al.
Published: (2025)

FCert: Certifiably Robust Few-Shot Classification in the Era of Foundation Models
by: Wang, Yanting, et al.
Published: (2024)

How Not to Detect Prompt Injections with an LLM
by: Choudhary, Sarthak, et al.
Published: (2025)

Competitive Advantage Attacks to Decentralized Federated Learning
by: Jia, Yuqi, et al.
Published: (2023)

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)

TrojFM: Resource-efficient Backdoor Attacks against Very Large Foundation Models
by: Nie, Yuzhou., et al.
Published: (2024)

MMCert: Provable Defense against Adversarial Attacks to Multi-modal Models
by: Wang, Yanting, et al.
Published: (2024)

Poisoning the Watchtower: Prompt Injection Attacks Against LLM-Augmented Security Operations Through Adversarial Log Content
by: Pandey, Rohan, et al.
Published: (2026)

Prompt Injection Attack to Tool Selection in LLM Agents
by: Shi, Jiawen, et al.
Published: (2025)

GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
by: Li, Rongchang, et al.
Published: (2024)

Neural Exec: Learning (and Learning from) Execution Triggers for Prompt Injection Attacks
by: Pasquini, Dario, et al.
Published: (2024)

Design Patterns for Securing LLM Agents against Prompt Injections
by: Beurer-Kellner, Luca, et al.
Published: (2025)

PLeak: Prompt Leaking Attacks against Large Language Model Applications
by: Hui, Bo, et al.
Published: (2024)

Prompt Injection Attacks on Large Language Models in Oncology
by: Clusmann, Jan, et al.
Published: (2024)

Defending Against Indirect Prompt Injection Attacks With Spotlighting
by: Hines, Keegan, et al.
Published: (2024)

Backdoored Retrievers for Prompt Injection Attacks on Retrieval Augmented Generation of Large Language Models
by: Clop, Cody, et al.
Published: (2024)

EnsembleSHAP: Faithful and Certifiably Robust Attribution for Random Subspace Method
by: Wang, Yanting, et al.
Published: (2026)