:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Pandey, Rohan, Bhujang, Archit
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security Machine Learning
Online Access:	https://arxiv.org/abs/2605.24421
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Adaptive Attacks Break Defenses Against Indirect Prompt Injection Attacks on LLM Agents
by: Zhan, Qiusi, et al.
Published: (2025)

A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks
by: Hossain, S M Asif, et al.
Published: (2025)

SecureLearn -- An Attack-agnostic Defense for Multiclass Machine Learning Against Data Poisoning Attacks
by: Paracha, Anum, et al.
Published: (2025)

Secure Retrieval-Augmented Generation against Poisoning Attacks
by: Cheng, Zirui, et al.
Published: (2025)

Defending Against Indirect Prompt Injection Attacks With Spotlighting
by: Hines, Keegan, et al.
Published: (2024)

Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
by: Shao, Zedian, et al.
Published: (2024)

Design Patterns for Securing LLM Agents against Prompt Injections
by: Beurer-Kellner, Luca, et al.
Published: (2025)

Adversarial Prompt Evaluation: Systematic Benchmarking of Guardrails Against Prompt Input Attacks on LLMs
by: Zizzo, Giulio, et al.
Published: (2025)

Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation
by: Liu, Yinuo, et al.
Published: (2025)

PIShield: Detecting Prompt Injection Attacks via Intrinsic LLM Features
by: Zou, Wei, et al.
Published: (2025)

Comments on "Privacy-Enhanced Federated Learning Against Poisoning Adversaries"
by: Schneider, Thomas, et al.
Published: (2024)

Backdoored Retrievers for Prompt Injection Attacks on Retrieval Augmented Generation of Large Language Models
by: Clop, Cody, et al.
Published: (2024)

The Attacker Moves Second: Stronger Adaptive Attacks Bypass Defenses Against Llm Jailbreaks and Prompt Injections
by: Nasr, Milad, et al.
Published: (2025)

GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
by: Li, Rongchang, et al.
Published: (2024)

A Systematic Review of Poisoning Attacks Against Large Language Models
by: Fendley, Neil, et al.
Published: (2025)

Securing Large Language Models (LLMs) from Prompt Injection Attacks
by: Suri, Omar Farooq Khan, et al.
Published: (2025)

LogJack: Indirect Prompt Injection Through Cloud Logs Against LLM Debugging Agents
by: Shah, Harsh
Published: (2026)

Exploring Secure Machine Learning Through Payload Injection and FGSM Attacks on ResNet-50
by: Yadav, Umesh, et al.
Published: (2025)

PoisonedParrot: Subtle Data Poisoning Attacks to Elicit Copyright-Infringing Content from Large Language Models
by: Panaitescu-Liess, Michael-Andrei, et al.
Published: (2025)

SleeperNets: Universal Backdoor Poisoning Attacks Against Reinforcement Learning Agents
by: Rathbun, Ethan, et al.
Published: (2024)

Provable Robustness of (Graph) Neural Networks Against Data Poisoning and Backdoor Attacks
by: Gosch, Lukas, et al.
Published: (2024)

Defending Against Sophisticated Poisoning Attacks with RL-based Aggregation in Federated Learning
by: Wang, Yujing, et al.
Published: (2024)

Online Poisoning Attack Against Reinforcement Learning under Black-box Environments
by: Li, Jianhui, et al.
Published: (2024)

AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
by: Debenedetti, Edoardo, et al.
Published: (2024)

SecAlign: Defending Against Prompt Injection with Preference Optimization
by: Chen, Sizhe, et al.
Published: (2024)

Lessons from Defending Gemini Against Indirect Prompt Injections
by: Shi, Chongyang, et al.
Published: (2025)

LeakSealer: A Semisupervised Defense for LLMs Against Prompt Injection and Leakage Attacks
by: Panebianco, Francesco, et al.
Published: (2025)

Traceback of Poisoning Attacks to Retrieval-Augmented Generation
by: Zhang, Baolei, et al.
Published: (2025)

Robustness Against Adversarial Attacks via Learning Confined Adversarial Polytopes
by: Hamidi, Shayan Mohajer, et al.
Published: (2024)

Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks
by: Halloran, John T., et al.
Published: (2026)

Universal Black-Box Reward Poisoning Attack against Offline Reinforcement Learning
by: Xu, Yinglun, et al.
Published: (2024)

Practical Poisoning Attacks against Retrieval-Augmented Generation
by: Zhang, Baolei, et al.
Published: (2025)

Benchmarking Poisoning Attacks against Retrieval-Augmented Generation
by: Zhang, Baolei, et al.
Published: (2025)

Transferable Availability Poisoning Attacks
by: Liu, Yiyong, et al.
Published: (2023)

Secure Aggregation is Not Private Against Membership Inference Attacks
by: Ngo, Khac-Hoang, et al.
Published: (2024)

PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
by: Zou, Wei, et al.
Published: (2024)

How to Defend Against Large-scale Model Poisoning Attacks in Federated Learning: A Vertical Solution
by: Wang, Jinbo, et al.
Published: (2024)

Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
by: Xu, Yuancheng, et al.
Published: (2024)

Poison Attacks and Adversarial Prompts Against an Informed University Virtual Assistant
by: Fernandez, Ivan A., et al.
Published: (2024)

Krait: A Backdoor Attack Against Graph Prompt Tuning
by: Song, Ying, et al.
Published: (2024)