:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Fogel, Ariel, Hofman, Omer, Cohen, Eilon, Vainshtein, Roman
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security Machine Learning
Online Access:	https://arxiv.org/abs/2602.04653
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

When Scanners Lie: Evaluator Instability in LLM Red-Teaming
by: Erez, Lidor, et al.
Published: (2026)

Insights and Current Gaps in Open-Source LLM Vulnerability Scanners: A Comparative Analysis
by: Brokman, Jonathan, et al.
Published: (2024)

Compromising Embodied Agents with Contextual Backdoor Attacks
by: Liu, Aishan, et al.
Published: (2024)

Erased but Not Forgotten: How Backdoors Compromise Concept Erasure
by: Braun, Tobias, et al.
Published: (2025)

DiffusionHijack: Supply-Chain PRNG Backdoor Attack on Diffusion Models and Quantum Random Number Defense
by: You, Ziyang, et al.
Published: (2026)

Model Supply Chain Poisoning: Backdooring Pre-trained Models via Embedding Indistinguishability
by: Wang, Hao, et al.
Published: (2024)

Adversarial Intent is a Latent Variable: Stateful Trust Inference for Securing Multimodal Agentic RAG
by: Singh, Inderjeet, et al.
Published: (2026)

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain
by: Boisvert, Léo, et al.
Published: (2025)

BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
by: Xiang, Zhen, et al.
Published: (2024)

Breaking Agents: Compromising Autonomous LLM Agents Through Malfunction Amplification
by: Zhang, Boyang, et al.
Published: (2024)

ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates
by: Jiang, Fengqing, et al.
Published: (2024)

STAR: Detecting Inference-time Backdoors in LLM Reasoning via State-Transition Amplification Ratio
by: Park, Seong-Gyu, et al.
Published: (2026)

MemoryGraft: Persistent Compromise of LLM Agents via Poisoned Experience Retrieval
by: Srivastava, Saksham Sahai, et al.
Published: (2025)

DarkMind: Latent Chain-of-Thought Backdoor in Customized LLMs
by: Guo, Zhen, et al.
Published: (2025)

Cross-LLM Generalization of Behavioral Backdoor Detection in AI Agent Supply Chains
by: Sanna, Arun Chowdary
Published: (2025)

Detecting Backdoor Attacks via Similarity in Semantic Communication Systems
by: Wei, Ziyang, et al.
Published: (2025)

Machine Learning Models Have a Supply Chain Problem
by: Meiklejohn, Sarah, et al.
Published: (2025)

MRMMIA: Membership Inference Attacks on Memory in Chat Agents
by: Chen, Kai, et al.
Published: (2026)

Semi-Supervised Supply Chain Fraud Detection with Unsupervised Pre-Filtering
by: Moradi, Fatemeh, et al.
Published: (2025)

Detection of Compromised Functions in a Serverless Cloud Environment
by: Lavi, Danielle, et al.
Published: (2024)

Privacy Backdoors: Enhancing Membership Inference through Poisoning Pre-trained Models
by: Wen, Yuxin, et al.
Published: (2024)

Fake or Compromised? Making Sense of Malicious Clients in Federated Learning
by: Mozaffari, Hamid, et al.
Published: (2024)

BoBa: Boosting Backdoor Detection through Data Distribution Inference in Federated Learning
by: Jiang, Zhengyuan, et al.
Published: (2024)

FHE-Agent: Automating CKKS Configuration for Practical Encrypted Inference via an LLM-Guided Agentic Framework
by: Xu, Nuo, et al.
Published: (2025)

End-to-End Anti-Backdoor Learning on Images and Time Series
by: Jiang, Yujing, et al.
Published: (2024)

SoK: Reducing the Vulnerability of Fine-tuned Language Models to Membership Inference Attacks
by: Amit, Guy, et al.
Published: (2024)

Detecting Compromised IoT Devices Using Autoencoders with Sequential Hypothesis Testing
by: Mainuddin, Md, et al.
Published: (2024)

Authority Backdoor: A Certifiable Backdoor Mechanism for Authoring DNNs
by: Yang, Han, et al.
Published: (2025)

BackdoorBench: A Comprehensive Benchmark and Analysis of Backdoor Learning
by: Wu, Baoyuan, et al.
Published: (2024)

BadTemplate: A Training-Free Backdoor Attack via Chat Template Against Large Language Models
by: Wang, Zihan, et al.
Published: (2026)

DP-Dueling: Learning from Preference Feedback without Compromising User Privacy
by: Saha, Aadirupa, et al.
Published: (2024)

Publishing Neural Networks in Drug Discovery Might Compromise Training Data Privacy
by: Krüger, Fabian P., et al.
Published: (2024)

Backdoor Learning Curves: Explaining Backdoor Poisoning Beyond Influence Functions
by: Cinà, Antonio Emanuele, et al.
Published: (2021)

Unelicitable Backdoors in Language Models via Cryptographic Transformer Circuits
by: Draguns, Andis, et al.
Published: (2024)

Hardware-Triggered Backdoors
by: Möller, Jonas, et al.
Published: (2026)

Whispers in the Machine: Confidentiality in Agentic Systems
by: Evertz, Jonathan, et al.
Published: (2024)

Hiding Backdoors within Event Sequence Data via Poisoning Attacks
by: Ermilova, Alina, et al.
Published: (2023)

DeDe: Detecting Backdoor Samples for SSL Encoders via Decoders
by: Hou, Sizai, et al.
Published: (2024)

A Survey of Learning-Based Intrusion Detection Systems for In-Vehicle Network
by: Althunayyan, Muzun, et al.
Published: (2025)

Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation
by: Lin, Weilin, et al.
Published: (2024)