Saved in:
| Main Authors: | Sun, Desen, Hon, Jason, Wang, Howe, Rajan, Saarth, Xu, Meng, Liu, Sihang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.10600 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Attacks on Approximate Caches in Text-to-Image Diffusion Models
by: Sun, Desen, et al.
Published: (2025)
by: Sun, Desen, et al.
Published: (2025)
TrojanEdit: Multimodal Backdoor Attack Against Image Editing Model
by: Guo, Ji, et al.
Published: (2024)
by: Guo, Ji, et al.
Published: (2024)
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
by: Chen, Ruoxi, et al.
Published: (2023)
by: Chen, Ruoxi, et al.
Published: (2023)
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
by: Wang, Haozhen, et al.
Published: (2026)
by: Wang, Haozhen, et al.
Published: (2026)
Embedding Poisoning: Bypassing Safety Alignment via Embedding Semantic Shift
by: Yuan, Shuai, et al.
Published: (2025)
by: Yuan, Shuai, et al.
Published: (2025)
Side-Channel Attacks on Open vSwitch
by: Kim, Daewoo, et al.
Published: (2026)
by: Kim, Daewoo, et al.
Published: (2026)
EditMark: Watermarking Large Language Models based on Model Editing
by: Li, Shuai, et al.
Published: (2025)
by: Li, Shuai, et al.
Published: (2025)
Pandora: Jailbreak GPTs by Retrieval Augmented Generation Poisoning
by: Deng, Gelei, et al.
Published: (2024)
by: Deng, Gelei, et al.
Published: (2024)
Poisoned-MRAG: Knowledge Poisoning Attacks to Multimodal Retrieval Augmented Generation
by: Liu, Yinuo, et al.
Published: (2025)
by: Liu, Yinuo, et al.
Published: (2025)
FedRecAttack: Model Poisoning Attack to Federated Recommendation
by: Rong, Dazhong, et al.
Published: (2022)
by: Rong, Dazhong, et al.
Published: (2022)
Enhancing Prompt Injection Attacks to LLMs via Poisoning Alignment
by: Shao, Zedian, et al.
Published: (2024)
by: Shao, Zedian, et al.
Published: (2024)
Data Poisoning for In-context Learning
by: He, Pengfei, et al.
Published: (2024)
by: He, Pengfei, et al.
Published: (2024)
MIRAGE: Misleading Retrieval-Augmented Generation via Black-box and Query-agnostic Poisoning Attacks
by: Chen, Tailun, et al.
Published: (2025)
by: Chen, Tailun, et al.
Published: (2025)
Deep-Research Agents Can Be Poisoned via User-Generated Content
by: Zhang, Tingwei, et al.
Published: (2026)
by: Zhang, Tingwei, et al.
Published: (2026)
Silent Branding Attack: Trigger-free Data Poisoning Attack on Text-to-Image Diffusion Models
by: Jang, Sangwon, et al.
Published: (2025)
by: Jang, Sangwon, et al.
Published: (2025)
DRIP: Defending Prompt Injection via Token-wise Representation Editing and Residual Instruction Fusion
by: Liu, Ruofan, et al.
Published: (2025)
by: Liu, Ruofan, et al.
Published: (2025)
Mitigating Data Poisoning Attacks to Local Differential Privacy
by: Li, Xiaolin, et al.
Published: (2025)
by: Li, Xiaolin, et al.
Published: (2025)
Sharpness-Aware Data Poisoning Attack
by: He, Pengfei, et al.
Published: (2023)
by: He, Pengfei, et al.
Published: (2023)
Concept-ROT: Poisoning Concepts in Large Language Models with Model Editing
by: Grimes, Keltin, et al.
Published: (2024)
by: Grimes, Keltin, et al.
Published: (2024)
TRUSTDESC: Preventing Tool Poisoning in LLM Applications via Trusted Description Generation
by: Ye, Hengkai, et al.
Published: (2026)
by: Ye, Hengkai, et al.
Published: (2026)
PINA: Prompt Injection Attack against Navigation Agents
by: Liu, Jiani, et al.
Published: (2026)
by: Liu, Jiani, et al.
Published: (2026)
VocBulwark: Towards Practical Generative Speech Watermarking via Additional-Parameter Injection
by: Liu, Weizhi, et al.
Published: (2026)
by: Liu, Weizhi, et al.
Published: (2026)
Model Context Protocol Threat Modeling and Analyzing Vulnerabilities to Prompt Injection with Tool Poisoning
by: Huang, Charoes, et al.
Published: (2026)
by: Huang, Charoes, et al.
Published: (2026)
System Prompt Poisoning: Persistent Attacks on Large Language Models Beyond User Injection
by: Li, Zongze, et al.
Published: (2025)
by: Li, Zongze, et al.
Published: (2025)
LocalAlign: Enabling Generalizable Prompt Injection Defense via Generation of Near-Target Adversarial Examples for Alignment Training
by: Gong, Yuyang, et al.
Published: (2026)
by: Gong, Yuyang, et al.
Published: (2026)
RefineRAG: Word-Level Poisoning Attacks via Retriever-Guided Text Refinement
by: Wang, Ziye, et al.
Published: (2026)
by: Wang, Ziye, et al.
Published: (2026)
A Learning-Based Attack Framework to Break SOTA Poisoning Defenses in Federated Learning
by: Yang, Yuxin, et al.
Published: (2024)
by: Yang, Yuxin, et al.
Published: (2024)
PoisonCatcher: Revealing and Identifying LDP Poisoning Attacks in IIoT
by: Shuai, Lisha, et al.
Published: (2024)
by: Shuai, Lisha, et al.
Published: (2024)
PREE: Towards Harmless and Adaptive Fingerprint Editing in Large Language Models via Knowledge Prefix Enhancement
by: Yue, Xubin, et al.
Published: (2025)
by: Yue, Xubin, et al.
Published: (2025)
ForgetMark: Stealthy Fingerprint Embedding via Targeted Unlearning in Language Models
by: Xu, Zhenhua, et al.
Published: (2026)
by: Xu, Zhenhua, et al.
Published: (2026)
The Art of Hide and Seek: Making Pickle-Based Model Supply Chain Poisoning Stealthy Again
by: Liu, Tong, et al.
Published: (2025)
by: Liu, Tong, et al.
Published: (2025)
Nightshade: Prompt-Specific Poisoning Attacks on Text-to-Image Generative Models
by: Shan, Shawn, et al.
Published: (2023)
by: Shan, Shawn, et al.
Published: (2023)
LDPRecover: Recovering Frequencies from Poisoning Attacks against Local Differential Privacy
by: Sun, Xinyue, et al.
Published: (2024)
by: Sun, Xinyue, et al.
Published: (2024)
On the Feasibility of Poisoning Text-to-Image AI Models via Adversarial Mislabeling
by: Wu, Stanley, et al.
Published: (2025)
by: Wu, Stanley, et al.
Published: (2025)
Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
by: Dai, Yanbo, et al.
Published: (2025)
by: Dai, Yanbo, et al.
Published: (2025)
The Gradient Puppeteer: Adversarial Domination in Gradient Leakage Attacks through Model Poisoning
by: Xiang, Kunlan, et al.
Published: (2025)
by: Xiang, Kunlan, et al.
Published: (2025)
Invisible Injections: Exploiting Vision-Language Models Through Steganographic Prompt Embedding
by: Pathade, Chetan
Published: (2025)
by: Pathade, Chetan
Published: (2025)
EditLord: Learning Code Transformation Rules for Code Editing
by: Li, Weichen, et al.
Published: (2025)
by: Li, Weichen, et al.
Published: (2025)
Cordon-MAS: Defending RAG against Knowledge Poisoning via Information-Flow Control
by: Yu, Zhe, et al.
Published: (2026)
by: Yu, Zhe, et al.
Published: (2026)
SLICE: Semantic Latent Injection via Compartmentalized Embedding for Image Watermarking
by: Gao, Zheng, et al.
Published: (2026)
by: Gao, Zheng, et al.
Published: (2026)
Similar Items
-
Attacks on Approximate Caches in Text-to-Image Diffusion Models
by: Sun, Desen, et al.
Published: (2025) -
TrojanEdit: Multimodal Backdoor Attack Against Image Editing Model
by: Guo, Ji, et al.
Published: (2024) -
EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models
by: Chen, Ruoxi, et al.
Published: (2023) -
PIDP-Attack: Combining Prompt Injection with Database Poisoning Attacks on Retrieval-Augmented Generation Systems
by: Wang, Haozhen, et al.
Published: (2026) -
Embedding Poisoning: Bypassing Safety Alignment via Embedding Semantic Shift
by: Yuan, Shuai, et al.
Published: (2025)