Saved in:
| Main Authors: | Truong, Vu Tuan, Dang, Luan Ba, Le, Long Bao |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2408.03400 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Critical-CoT: A Robust Defense Framework against Reasoning-Level Backdoor Attacks in Large Language Models
by: Truong, Vu Tuan, et al.
Published: (2026)
by: Truong, Vu Tuan, et al.
Published: (2026)
Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses
by: Yichao, Wu, et al.
Published: (2025)
by: Yichao, Wu, et al.
Published: (2025)
A Survey on Model Extraction Attacks and Defenses for Large Language Models
by: Zhao, Kaixiang, et al.
Published: (2025)
by: Zhao, Kaixiang, et al.
Published: (2025)
A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments
by: Zhao, Kaixiang, et al.
Published: (2025)
by: Zhao, Kaixiang, et al.
Published: (2025)
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
by: Huang, Tiansheng, et al.
Published: (2024)
by: Huang, Tiansheng, et al.
Published: (2024)
A Systematic Survey of Model Extraction Attacks and Defenses: State-of-the-Art and Perspectives
by: Zhao, Kaixiang, et al.
Published: (2025)
by: Zhao, Kaixiang, et al.
Published: (2025)
Jailbreak Attacks and Defenses Against Large Language Models: A Survey
by: Yi, Sibo, et al.
Published: (2024)
by: Yi, Sibo, et al.
Published: (2024)
Attacks and Defenses Against LLM Fingerprinting
by: Kurian, Kevin, et al.
Published: (2025)
by: Kurian, Kevin, et al.
Published: (2025)
A Causal Perspective for Enhancing Jailbreak Attack and Defense
by: Pan, Licheng, et al.
Published: (2026)
by: Pan, Licheng, et al.
Published: (2026)
Optimal Defenses Against Gradient Reconstruction Attacks
by: Chen, Yuxiao, et al.
Published: (2024)
by: Chen, Yuxiao, et al.
Published: (2024)
Stealthy Poisoning Attacks Bypass Defenses in Regression Settings
by: Carnerero-Cano, Javier, et al.
Published: (2026)
by: Carnerero-Cano, Javier, et al.
Published: (2026)
SoK: Benchmarking Poisoning Attacks and Defenses in Federated Learning
by: Zhang, Heyi, et al.
Published: (2025)
by: Zhang, Heyi, et al.
Published: (2025)
Backdoor Vectors: a Task Arithmetic View on Backdoor Attacks and Defenses
by: Pawlak, Stanisław, et al.
Published: (2025)
by: Pawlak, Stanisław, et al.
Published: (2025)
Semantic Chameleon: Corpus-Dependent Poisoning Attacks and Defenses in RAG Systems
by: Thornton, Scott
Published: (2026)
by: Thornton, Scott
Published: (2026)
LeakSealer: A Semisupervised Defense for LLMs Against Prompt Injection and Leakage Attacks
by: Panebianco, Francesco, et al.
Published: (2025)
by: Panebianco, Francesco, et al.
Published: (2025)
A Survey of Privacy Threats and Defense in Vertical Federated Learning: From Model Life Cycle Perspective
by: Yu, Lei, et al.
Published: (2024)
by: Yu, Lei, et al.
Published: (2024)
Defending the Edge: Representative-Attention Defense against Backdoor Attacks in Federated Learning
by: Obioma, Chibueze Peace, et al.
Published: (2025)
by: Obioma, Chibueze Peace, et al.
Published: (2025)
The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)
by: Kim, Juhee, et al.
Published: (2026)
MetaDefense: Defending Finetuning-based Jailbreak Attack Before and During Generation
by: Jiang, Weisen, et al.
Published: (2025)
by: Jiang, Weisen, et al.
Published: (2025)
Real-world Adversarial Defense against Patch Attacks based on Diffusion Model
by: Wei, Xingxing, et al.
Published: (2024)
by: Wei, Xingxing, et al.
Published: (2024)
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
by: Liu, Yupei, et al.
Published: (2023)
by: Liu, Yupei, et al.
Published: (2023)
A Survey of Privacy-Preserving Model Explanations: Privacy Risks, Attacks, and Countermeasures
by: Nguyen, Thanh Tam, et al.
Published: (2024)
by: Nguyen, Thanh Tam, et al.
Published: (2024)
DIFFender: Diffusion-Based Adversarial Defense against Patch Attacks
by: Kang, Caixin, et al.
Published: (2023)
by: Kang, Caixin, et al.
Published: (2023)
Diffusion-Driven Synthetic Tabular Data Generation for Enhanced DoS/DDoS Attack Classification
by: B, Aravind, et al.
Published: (2026)
by: B, Aravind, et al.
Published: (2026)
AI Security in the Foundation Model Era: A Comprehensive Survey from a Unified Perspective
by: Wang, Zhenyi, et al.
Published: (2026)
by: Wang, Zhenyi, et al.
Published: (2026)
A Comprehensive Study of Supervised Machine Learning Models for Zero-Day Attack Detection: Analyzing Performance on Imbalanced Data
by: Lotfi, Zahra, et al.
Published: (2025)
by: Lotfi, Zahra, et al.
Published: (2025)
LaFA: Latent Feature Attacks on Non-negative Matrix Factorization
by: Vu, Minh, et al.
Published: (2024)
by: Vu, Minh, et al.
Published: (2024)
PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics
by: Bhat, Sunay, et al.
Published: (2024)
by: Bhat, Sunay, et al.
Published: (2024)
Thought Purity: A Defense Framework For Chain-of-Thought Attack
by: Xue, Zihao, et al.
Published: (2025)
by: Xue, Zihao, et al.
Published: (2025)
UIFV: Data Reconstruction Attack in Vertical Federated Learning
by: Yang, Jirui, et al.
Published: (2024)
by: Yang, Jirui, et al.
Published: (2024)
RedVisor: Reasoning-Aware Prompt Injection Defense via Zero-Copy KV Cache Reuse
by: Liu, Mingrui, et al.
Published: (2026)
by: Liu, Mingrui, et al.
Published: (2026)
How Does a Deep Learning Model Architecture Impact Its Privacy? A Comprehensive Study of Privacy Attacks on CNNs and Transformers
by: Zhang, Guangsheng, et al.
Published: (2022)
by: Zhang, Guangsheng, et al.
Published: (2022)
Winning the MIDST Challenge: New Membership Inference Attacks on Diffusion Models for Tabular Data Synthesis
by: Wu, Xiaoyu, et al.
Published: (2025)
by: Wu, Xiaoyu, et al.
Published: (2025)
Disrupting Model Merging: A Parameter-Level Defense Without Sacrificing Accuracy
by: Junhao, Wei, et al.
Published: (2025)
by: Junhao, Wei, et al.
Published: (2025)
LoRID: Low-Rank Iterative Diffusion for Adversarial Purification
by: Zollicoffer, Geigh, et al.
Published: (2024)
by: Zollicoffer, Geigh, et al.
Published: (2024)
Large Language Models in Cybersecurity: Applications, Vulnerabilities, and Defense Techniques
by: Jaffal, Niveen O., et al.
Published: (2025)
by: Jaffal, Niveen O., et al.
Published: (2025)
Twin Auto-Encoder Model for Learning Separable Representation in Cyberattack Detection
by: Dinh, Phai Vu, et al.
Published: (2024)
by: Dinh, Phai Vu, et al.
Published: (2024)
LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures
by: Aguilera-Martínez, Francisco, et al.
Published: (2025)
by: Aguilera-Martínez, Francisco, et al.
Published: (2025)
Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval
by: Chen, Taiye, et al.
Published: (2025)
by: Chen, Taiye, et al.
Published: (2025)
A Survey of Recent Backdoor Attacks and Defenses in Large Language Models
by: Zhao, Shuai, et al.
Published: (2024)
by: Zhao, Shuai, et al.
Published: (2024)
Similar Items
-
Critical-CoT: A Robust Defense Framework against Reasoning-Level Backdoor Attacks in Large Language Models
by: Truong, Vu Tuan, et al.
Published: (2026) -
Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses
by: Yichao, Wu, et al.
Published: (2025) -
A Survey on Model Extraction Attacks and Defenses for Large Language Models
by: Zhao, Kaixiang, et al.
Published: (2025) -
A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments
by: Zhao, Kaixiang, et al.
Published: (2025) -
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey
by: Huang, Tiansheng, et al.
Published: (2024)