Saved in:
| Main Authors: | Draganov, Andrew, Dur, Tolga H., Bhongade, Anandmayi, Phuong, Mary |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04899 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025)
by: Li, Chloe, et al.
Published: (2025)
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
by: Xu, Yuancheng, et al.
Published: (2024)
by: Xu, Yuancheng, et al.
Published: (2024)
How Reliable Are AI Attackers Against a Fixed Vulnerable Target? A 400-Run Empirical Study of LLM Penetration Testing Consistency
by: Erdem, Galip Tolga
Published: (2026)
by: Erdem, Galip Tolga
Published: (2026)
Data Poisoning in Deep Learning: A Survey
by: Zhao, Pinlong, et al.
Published: (2025)
by: Zhao, Pinlong, et al.
Published: (2025)
Have You Poisoned My Data? Defending Neural Networks against Data Poisoning
by: De Gaspari, Fabio, et al.
Published: (2024)
by: De Gaspari, Fabio, et al.
Published: (2024)
Defending Against Beta Poisoning Attacks in Machine Learning Models
by: Gulciftci, Nilufer, et al.
Published: (2025)
by: Gulciftci, Nilufer, et al.
Published: (2025)
Be Kind, Rewrite: Benign Projections via Rewriting Defend Against LLM Data Poisoning Attacks
by: Halloran, John T., et al.
Published: (2026)
by: Halloran, John T., et al.
Published: (2026)
Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
by: Jiang, Shuli, et al.
Published: (2024)
by: Jiang, Shuli, et al.
Published: (2024)
When and Where do Data Poisons Attack Textual Inversion?
by: Styborski, Jeremy, et al.
Published: (2025)
by: Styborski, Jeremy, et al.
Published: (2025)
CBPF: Filtering Poisoned Data Based on Composite Backdoor Attack
by: Xia, Hanfeng, et al.
Published: (2024)
by: Xia, Hanfeng, et al.
Published: (2024)
Through the Stealth Lens: Attention-Aware Defenses Against Poisoning in RAG
by: Choudhary, Sarthak, et al.
Published: (2025)
by: Choudhary, Sarthak, et al.
Published: (2025)
FIDELIS: Blockchain-Enabled Protection Against Poisoning Attacks in Federated Learning
by: Carney, Jane, et al.
Published: (2025)
by: Carney, Jane, et al.
Published: (2025)
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
by: Fu, Tingchen, et al.
Published: (2024)
by: Fu, Tingchen, et al.
Published: (2024)
Precision Guided Approach to Mitigate Data Poisoning Attacks in Federated Learning
by: Kumar, K Naveen, et al.
Published: (2024)
by: Kumar, K Naveen, et al.
Published: (2024)
Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks
by: Cotroneo, Domenico, et al.
Published: (2023)
by: Cotroneo, Domenico, et al.
Published: (2023)
Scaling Trends for Data Poisoning in LLMs
by: Bowen, Dillon, et al.
Published: (2024)
by: Bowen, Dillon, et al.
Published: (2024)
Unraveling the Connections between Privacy and Certified Robustness in Federated Learning Against Poisoning Attacks
by: Xie, Chulin, et al.
Published: (2022)
by: Xie, Chulin, et al.
Published: (2022)
Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning
by: Pawlak, Stanisław, et al.
Published: (2025)
by: Pawlak, Stanisław, et al.
Published: (2025)
Data Poisoning Vulnerabilities Across Healthcare AI Architectures: A Security Threat Analysis
by: Abtahi, Farhad, et al.
Published: (2025)
by: Abtahi, Farhad, et al.
Published: (2025)
Robustness Analysis of Machine Learning Models for IoT Intrusion Detection Under Data Poisoning Attacks
by: Wulnye, Fortunatus Aabangbio, et al.
Published: (2026)
by: Wulnye, Fortunatus Aabangbio, et al.
Published: (2026)
Building Better Environments for Autonomous Cyber Defence
by: Hicks, Chris, et al.
Published: (2026)
by: Hicks, Chris, et al.
Published: (2026)
Poisoned Acoustics
by: Dahme, Harrison
Published: (2026)
by: Dahme, Harrison
Published: (2026)
SuperLocalMemory: Privacy-Preserving Multi-Agent Memory with Bayesian Trust Defense Against Memory Poisoning
by: Bhardwaj, Varun Pratap
Published: (2026)
by: Bhardwaj, Varun Pratap
Published: (2026)
Data Poisoning Attacks on Off-Policy Policy Evaluation Methods
by: Lobo, Elita, et al.
Published: (2024)
by: Lobo, Elita, et al.
Published: (2024)
The Stronger the Diffusion Model, the Easier the Backdoor: Data Poisoning to Induce Copyright Breaches Without Adjusting Finetuning Pipeline
by: Wang, Haonan, et al.
Published: (2024)
by: Wang, Haonan, et al.
Published: (2024)
PID: Prompt-Independent Data Protection Against Latent Diffusion Models
by: Li, Ang, et al.
Published: (2024)
by: Li, Ang, et al.
Published: (2024)
Data to Defense: The Role of Curation in Customizing LLMs Against Jailbreaking Attacks
by: Liu, Xiaoqun, et al.
Published: (2024)
by: Liu, Xiaoqun, et al.
Published: (2024)
Supply-Chain Poisoning Attacks Against LLM Coding Agent Skill Ecosystems
by: Qu, Yubin, et al.
Published: (2026)
by: Qu, Yubin, et al.
Published: (2026)
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning
by: Zhao, Shuai, et al.
Published: (2024)
by: Zhao, Shuai, et al.
Published: (2024)
Poison Once, Exploit Forever: Environment-Injected Memory Poisoning Attacks on Web Agents
by: Zou, Wei, et al.
Published: (2026)
by: Zou, Wei, et al.
Published: (2026)
FedNIA: Noise-Induced Activation Analysis for Mitigating Data Poisoning in FL
by: Hallaji, Ehsan, et al.
Published: (2025)
by: Hallaji, Ehsan, et al.
Published: (2025)
Concealing Backdoor Model Updates in Federated Learning by Trigger-Optimized Data Poisoning
by: Zhang, Yujie, et al.
Published: (2024)
by: Zhang, Yujie, et al.
Published: (2024)
Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space
by: Zhou, Xingfu, et al.
Published: (2025)
by: Zhou, Xingfu, et al.
Published: (2025)
Persistent Pre-Training Poisoning of LLMs
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
On The Dangers of Poisoned LLMs In Security Automation
by: Karlsen, Patrick, et al.
Published: (2025)
by: Karlsen, Patrick, et al.
Published: (2025)
Virus Infection Attack on LLMs: Your Poisoning Can Spread "VIA" Synthetic Data
by: Liang, Zi, et al.
Published: (2025)
by: Liang, Zi, et al.
Published: (2025)
SAFELOC: Overcoming Data Poisoning Attacks in Heterogeneous Federated Machine Learning for Indoor Localization
by: Singampalli, Akhil, et al.
Published: (2024)
by: Singampalli, Akhil, et al.
Published: (2024)
Machine Unlearning Fails to Remove Data Poisoning Attacks
by: Pawelczyk, Martin, et al.
Published: (2024)
by: Pawelczyk, Martin, et al.
Published: (2024)
CSC: Turning the Adversary's Poison against Itself
by: Shi, Yuchen, et al.
Published: (2026)
by: Shi, Yuchen, et al.
Published: (2026)
Detecting Data Poisoning in Code Generation LLMs via Black-Box, Vulnerability-Oriented Scanning
by: Yan, Shenao, et al.
Published: (2026)
by: Yan, Shenao, et al.
Published: (2026)
Similar Items
-
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
by: Li, Chloe, et al.
Published: (2025) -
Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models
by: Xu, Yuancheng, et al.
Published: (2024) -
How Reliable Are AI Attackers Against a Fixed Vulnerable Target? A 400-Run Empirical Study of LLM Penetration Testing Consistency
by: Erdem, Galip Tolga
Published: (2026) -
Data Poisoning in Deep Learning: A Survey
by: Zhao, Pinlong, et al.
Published: (2025) -
Have You Poisoned My Data? Defending Neural Networks against Data Poisoning
by: De Gaspari, Fabio, et al.
Published: (2024)