Saved in:
| Main Authors: | Fogel, Ariel, Hofman, Omer, Cohen, Eilon, Vainshtein, Roman |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04653 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
When Scanners Lie: Evaluator Instability in LLM Red-Teaming
by: Erez, Lidor, et al.
Published: (2026)
by: Erez, Lidor, et al.
Published: (2026)
Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis
by: Brokman, Jonathan, et al.
Published: (2024)
by: Brokman, Jonathan, et al.
Published: (2024)
Compromising Embodied Agents with Contextual Backdoor Attacks
by: Liu, Aishan, et al.
Published: (2024)
by: Liu, Aishan, et al.
Published: (2024)
Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
by: Braun, Tobias, et al.
Published: (2025)
by: Braun, Tobias, et al.
Published: (2025)
DiffusionHijack: Supply-Chain PRNG Backdoor Attack on Diffusion Models and Quantum Random Number Defense
by: You, Ziyang, et al.
Published: (2026)
by: You, Ziyang, et al.
Published: (2026)
Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG
by: Singh, Inderjeet, et al.
Published: (2026)
by: Singh, Inderjeet, et al.
Published: (2026)
Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
by: Boisvert, Léo, et al.
Published: (2025)
by: Boisvert, Léo, et al.
Published: (2025)
BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
by: Xiang, Zhen, et al.
Published: (2024)
by: Xiang, Zhen, et al.
Published: (2024)
Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification
by: Zhang, Boyang, et al.
Published: (2024)
by: Zhang, Boyang, et al.
Published: (2024)
ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates
by: Jiang, Fengqing, et al.
Published: (2024)
by: Jiang, Fengqing, et al.
Published: (2024)
STAR: Detecting Inference-time Backdoors in LLM Reasoning via State-Transition Amplification Ratio
by: Park, Seong-Gyu, et al.
Published: (2026)
by: Park, Seong-Gyu, et al.
Published: (2026)
MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval
by: Srivastava, Saksham Sahai, et al.
Published: (2025)
by: Srivastava, Saksham Sahai, et al.
Published: (2025)
DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs
by: Guo, Zhen, et al.
Published: (2025)
by: Guo, Zhen, et al.
Published: (2025)
Cross-LLM Generalization of Behavioral Backdoor Detection in AI Agent Supply Chains
by: Sanna, Arun Chowdary
Published: (2025)
by: Sanna, Arun Chowdary
Published: (2025)
Detecting Backdoor Attacks via Similarity in Semantic Communication Systems
by: Wei, Ziyang, et al.
Published: (2025)
by: Wei, Ziyang, et al.
Published: (2025)
Machine Learning Models Have a Supply Chain Problem
by: Meiklejohn, Sarah, et al.
Published: (2025)
by: Meiklejohn, Sarah, et al.
Published: (2025)
MRMMIA: Membership Inference Attacks on Memory in Chat Agents
by: Chen, Kai, et al.
Published: (2026)
by: Chen, Kai, et al.
Published: (2026)
Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
by: Moradi, Fatemeh, et al.
Published: (2025)
by: Moradi, Fatemeh, et al.
Published: (2025)
Detection of Compromised Functions in a Serverless Cloud Environment
by: Lavi, Danielle, et al.
Published: (2024)
by: Lavi, Danielle, et al.
Published: (2024)
Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
by: Wen, Yuxin, et al.
Published: (2024)
by: Wen, Yuxin, et al.
Published: (2024)
Fake or Compromised? Making Sense of Malicious Clients in Federated Learning
by: Mozaffari, Hamid, et al.
Published: (2024)
by: Mozaffari, Hamid, et al.
Published: (2024)
BoBa: Boosting Backdoor Detection through Data Distribution Inference in Federated Learning
by: Jiang, Zhengyuan, et al.
Published: (2024)
by: Jiang, Zhengyuan, et al.
Published: (2024)
FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
by: Xu, Nuo, et al.
Published: (2025)
by: Xu, Nuo, et al.
Published: (2025)
End-to-End Anti-Backdoor Learning on Images and Time Series
by: Jiang, Yujing, et al.
Published: (2024)
by: Jiang, Yujing, et al.
Published: (2024)
SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks
by: Amit, Guy, et al.
Published: (2024)
by: Amit, Guy, et al.
Published: (2024)
Detecting Compromised IoT Devices Using Autoencoders with Sequential Hypothesis Testing
by: Mainuddin, Md, et al.
Published: (2024)
by: Mainuddin, Md, et al.
Published: (2024)
Authority Backdoor: A Certifiable Backdoor Mechanism for Authoring DNNs
by: Yang, Han, et al.
Published: (2025)
by: Yang, Han, et al.
Published: (2025)
BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning
by: Wu, Baoyuan, et al.
Published: (2024)
by: Wu, Baoyuan, et al.
Published: (2024)
BadTemplate: A Training-Free Backdoor Attack via Chat Template Against Large Language Models
by: Wang, Zihan, et al.
Published: (2026)
by: Wang, Zihan, et al.
Published: (2026)
DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
by: Saha, Aadirupa, et al.
Published: (2024)
by: Saha, Aadirupa, et al.
Published: (2024)
Publishing Neural Networks in Drug Discovery Might Compromise Training Data Privacy
by: Krüger, Fabian P., et al.
Published: (2024)
by: Krüger, Fabian P., et al.
Published: (2024)
Backdoor Learning Curves: Explaining Backdoor Poisoning Beyond Influence Functions
by: Cinà, Antonio Emanuele, et al.
Published: (2021)
by: Cinà, Antonio Emanuele, et al.
Published: (2021)
Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
by: Draguns, Andis, et al.
Published: (2024)
by: Draguns, Andis, et al.
Published: (2024)
Hardware-Triggered Backdoors
by: Möller, Jonas, et al.
Published: (2026)
by: Möller, Jonas, et al.
Published: (2026)
Whispers in the Machine: Confidentiality in Agentic Systems
by: Evertz, Jonathan, et al.
Published: (2024)
by: Evertz, Jonathan, et al.
Published: (2024)
Hiding Backdoors within Event Sequence Data via Poisoning Attacks
by: Ermilova, Alina, et al.
Published: (2023)
by: Ermilova, Alina, et al.
Published: (2023)
DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
by: Hou, Sizai, et al.
Published: (2024)
by: Hou, Sizai, et al.
Published: (2024)
A Survey of Learning-Based Intrusion Detection Systems for In-Vehicle Network
by: Althunayyan, Muzun, et al.
Published: (2025)
by: Althunayyan, Muzun, et al.
Published: (2025)
Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation
by: Lin, Weilin, et al.
Published: (2024)
by: Lin, Weilin, et al.
Published: (2024)
Similar Items
-
When Scanners Lie: Evaluator Instability in LLM Red-Teaming
by: Erez, Lidor, et al.
Published: (2026) -
Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis
by: Brokman, Jonathan, et al.
Published: (2024) -
Compromising Embodied Agents with Contextual Backdoor Attacks
by: Liu, Aishan, et al.
Published: (2024) -
Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
by: Braun, Tobias, et al.
Published: (2025) -
DiffusionHijack: Supply-Chain PRNG Backdoor Attack on Diffusion Models and Quantum Random Number Defense
by: You, Ziyang, et al.
Published: (2026)