Saved in:
| Main Author: | You, Doohee |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.18988 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)
by: Tong, Haibo, et al.
Published: (2025)
Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025)
by: Yeo, Andrew, et al.
Published: (2025)
CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
by: Patil, KrishnaSaiReddy
Published: (2026)
by: Patil, KrishnaSaiReddy
Published: (2026)
ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack
by: Lin, Xingwei, et al.
Published: (2026)
by: Lin, Xingwei, et al.
Published: (2026)
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue
by: Shen, Xinjie, et al.
Published: (2026)
by: Shen, Xinjie, et al.
Published: (2026)
Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection
by: Kulkarni, Prashant
Published: (2026)
by: Kulkarni, Prashant
Published: (2026)
MT-JailBench: A Modular Benchmark for Understanding Multi-Turn Jailbreak Attacks
by: Zhang, Xinkai, et al.
Published: (2026)
by: Zhang, Xinkai, et al.
Published: (2026)
Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
by: Russinovich, Mark, et al.
Published: (2024)
by: Russinovich, Mark, et al.
Published: (2024)
Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security
by: Dai, Muzhi, et al.
Published: (2025)
by: Dai, Muzhi, et al.
Published: (2025)
HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models
by: Narula, Sidhant, et al.
Published: (2025)
by: Narula, Sidhant, et al.
Published: (2025)
A Method for Enhancing the Safety of Large Model Generation Based on Multi-dimensional Attack and Defense
by: Zhai, Keke
Published: (2024)
by: Zhai, Keke
Published: (2024)
Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)
by: Li, Shawn, et al.
Published: (2026)
Threats, Attacks, and Defenses in Machine Unlearning: A Survey
by: Liu, Ziyao, et al.
Published: (2024)
by: Liu, Ziyao, et al.
Published: (2024)
Exploring Backdoor Attack and Defense for LLM-empowered Recommendations
by: Ning, Liangbo, et al.
Published: (2025)
by: Ning, Liangbo, et al.
Published: (2025)
Adversarial Machine Learning: Attacks, Defenses, and Open Challenges
by: Jha, Pranav K
Published: (2025)
by: Jha, Pranav K
Published: (2025)
Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks
by: Gibbs, Tom, et al.
Published: (2024)
by: Gibbs, Tom, et al.
Published: (2024)
Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
by: Jiang, Shuli, et al.
Published: (2024)
by: Jiang, Shuli, et al.
Published: (2024)
Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model
by: Swileh, Mohammed N., et al.
Published: (2024)
by: Swileh, Mohammed N., et al.
Published: (2024)
The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)
by: Kim, Juhee, et al.
Published: (2026)
Evolving Security in LLMs: A Study of Jailbreak Attacks and Defenses
by: Shang, Zhengchun, et al.
Published: (2025)
by: Shang, Zhengchun, et al.
Published: (2025)
Recent Advances in Attack and Defense Approaches of Large Language Models
by: Cui, Jing, et al.
Published: (2024)
by: Cui, Jing, et al.
Published: (2024)
A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)
by: Jia, Yuqi, et al.
Published: (2025)
Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models
by: Wen, Jinming, et al.
Published: (2025)
by: Wen, Jinming, et al.
Published: (2025)
FedSecurity: Benchmarking Attacks and Defenses in Federated Learning and Federated LLMs
by: Han, Shanshan, et al.
Published: (2023)
by: Han, Shanshan, et al.
Published: (2023)
Data to Defense: The Role of Curation in Customizing LLMs Against Jailbreaking Attacks
by: Liu, Xiaoqun, et al.
Published: (2024)
by: Liu, Xiaoqun, et al.
Published: (2024)
Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning
by: Xu, Runhua, et al.
Published: (2025)
by: Xu, Runhua, et al.
Published: (2025)
AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning
by: Olukola, Oluseyi, et al.
Published: (2026)
by: Olukola, Oluseyi, et al.
Published: (2026)
Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models
by: Rayhan, Naheed, et al.
Published: (2026)
by: Rayhan, Naheed, et al.
Published: (2026)
Reasoning-Augmented Conversation for Multi-Turn Jailbreak Attacks on Large Language Models
by: Ying, Zonghao, et al.
Published: (2025)
by: Ying, Zonghao, et al.
Published: (2025)
Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions
by: Xu, Yuming, et al.
Published: (2026)
by: Xu, Yuming, et al.
Published: (2026)
SHIELD: An Auto-Healing Agentic Defense Framework for LLM Resource Exhaustion Attacks
by: Sivaroopan, Nirhoshan, et al.
Published: (2026)
by: Sivaroopan, Nirhoshan, et al.
Published: (2026)
Attacks and Defenses Against LLM Fingerprinting
by: Kurian, Kevin, et al.
Published: (2025)
by: Kurian, Kevin, et al.
Published: (2025)
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)
by: Zhu, Kaijie, et al.
Published: (2025)
A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
by: Xu, Zihao, et al.
Published: (2024)
by: Xu, Zihao, et al.
Published: (2024)
TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks
by: Mo, Xiaoxing, et al.
Published: (2025)
by: Mo, Xiaoxing, et al.
Published: (2025)
Intellectual Property in Graph-Based Machine Learning as a Service: Attacks and Defenses
by: Li, Lincan, et al.
Published: (2025)
by: Li, Lincan, et al.
Published: (2025)
Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks
by: Fu, Haowei, et al.
Published: (2025)
by: Fu, Haowei, et al.
Published: (2025)
Data Duplication: A Novel Multi-Purpose Attack Paradigm in Machine Unlearning
by: Ye, Dayong, et al.
Published: (2025)
by: Ye, Dayong, et al.
Published: (2025)
The Echo Chamber Multi-Turn LLM Jailbreak
by: Alobaid, Ahmad, et al.
Published: (2026)
by: Alobaid, Ahmad, et al.
Published: (2026)
SOK: A Taxonomy of Attack Vectors and Defense Strategies for Agentic Supply Chain Runtime
by: Jiang, Xiaochong, et al.
Published: (2026)
by: Jiang, Xiaochong, et al.
Published: (2026)
Similar Items
-
Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025) -
Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025) -
CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
by: Patil, KrishnaSaiReddy
Published: (2026) -
ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack
by: Lin, Xingwei, et al.
Published: (2026) -
One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue
by: Shen, Xinjie, et al.
Published: (2026)