:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	You, Doohee
Format:	Preprint
Published:	2026
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.18988
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)

Multimodal Prompt Injection Attacks: Risks and Defenses for Modern LLMs
by: Yeo, Andrew, et al.
Published: (2025)

CivicShield: A Cross-Domain Defense-in-Depth Framework for Securing Government-Facing AI Chatbots Against Multi-Turn Adversarial Attacks
by: Patil, KrishnaSaiReddy
Published: (2026)

ICON: Intent-Context Coupling for Efficient Multi-Turn Jailbreak Attack
by: Lin, Xingwei, et al.
Published: (2026)

One Turn Too Late: Response-Aware Defense Against Hidden Malicious Intent in Multi-Turn Dialogue
by: Shen, Xinjie, et al.
Published: (2026)

Latent Adversarial Detection: Adaptive Probing of LLM Activations for Multi-Turn Attack Detection
by: Kulkarni, Prashant
Published: (2026)

MT-JailBench: A Modular Benchmark for Understanding Multi-Turn Jailbreak Attacks
by: Zhang, Xinkai, et al.
Published: (2026)

Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
by: Russinovich, Mark, et al.
Published: (2024)

Secure Tug-of-War (SecTOW): Iterative Defense-Attack Training with Reinforcement Learning for Multimodal Model Security
by: Dai, Muzhi, et al.
Published: (2025)

HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models
by: Narula, Sidhant, et al.
Published: (2025)

A Method for Enhancing the Safety of Large Model Generation Based on Multi-dimensional Attack and Defense
by: Zhai, Keke
Published: (2024)

Defenses Against Prompt Attacks Learn Surface Heuristics
by: Li, Shawn, et al.
Published: (2026)

Threats, Attacks, and Defenses in Machine Unlearning: A Survey
by: Liu, Ziyao, et al.
Published: (2024)

Exploring Backdoor Attack and Defense for LLM-empowered Recommendations
by: Ning, Liangbo, et al.
Published: (2025)

Adversarial Machine Learning: Attacks, Defenses, and Open Challenges
by: Jha, Pranav K
Published: (2025)

Emerging Vulnerabilities in Frontier Models: Multi-Turn Jailbreak Attacks
by: Gibbs, Tom, et al.
Published: (2024)

Turning Generative Models Degenerate: The Power of Data Poisoning Attacks
by: Jiang, Shuli, et al.
Published: (2024)

Unseen Attack Detection in Software-Defined Networking Using a BERT-Based Large Language Model
by: Swileh, Mohammed N., et al.
Published: (2024)

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)

Evolving Security in LLMs: A Study of Jailbreak Attacks and Defenses
by: Shang, Zhengchun, et al.
Published: (2025)

Recent Advances in Attack and Defense Approaches of Large Language Models
by: Cui, Jing, et al.
Published: (2024)

A Critical Evaluation of Defenses against Prompt Injection Attacks
by: Jia, Yuqi, et al.
Published: (2025)

Investigating Vulnerabilities and Defenses Against Audio-Visual Attacks: A Comprehensive Survey Emphasizing Multimodal Models
by: Wen, Jinming, et al.
Published: (2025)

FedSecurity: Benchmarking Attacks and Defenses in Federated Learning and Federated LLMs
by: Han, Shanshan, et al.
Published: (2023)

Data to Defense: The Role of Curation in Customizing LLMs Against Jailbreaking Attacks
by: Liu, Xiaoqun, et al.
Published: (2024)

Dual Defense: Enhancing Privacy and Mitigating Poisoning Attacks in Federated Learning
by: Xu, Runhua, et al.
Published: (2025)

AMDS: Attack-Aware Multi-Stage Defense System for Network Intrusion Detection with Two-Stage Adaptive Weight Learning
by: Olukola, Oluseyi, et al.
Published: (2026)

Transient Turn Injection: Exposing Stateless Multi-Turn Vulnerabilities in Large Language Models
by: Rayhan, Naheed, et al.
Published: (2026)

Reasoning-Augmented Conversation for Multi-Turn Jailbreak Attacks on Large Language Models
by: Ying, Zonghao, et al.
Published: (2025)

Securing Retrieval-Augmented Generation: A Taxonomy of Attacks, Defenses, and Future Directions
by: Xu, Yuming, et al.
Published: (2026)

SHIELD: An Auto-Healing Agentic Defense Framework for LLM Resource Exhaustion Attacks
by: Sivaroopan, Nirhoshan, et al.
Published: (2026)

Attacks and Defenses Against LLM Fingerprinting
by: Kurian, Kevin, et al.
Published: (2025)

MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents
by: Zhu, Kaijie, et al.
Published: (2025)

A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
by: Xu, Zihao, et al.
Published: (2024)

TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks
by: Mo, Xiaoxing, et al.
Published: (2025)

Intellectual Property in Graph-Based Machine Learning as a Service: Attacks and Defenses
by: Li, Lincan, et al.
Published: (2025)

Ensemble Privacy Defense for Knowledge-Intensive LLMs against Membership Inference Attacks
by: Fu, Haowei, et al.
Published: (2025)

Data Duplication: A Novel Multi-Purpose Attack Paradigm in Machine Unlearning
by: Ye, Dayong, et al.
Published: (2025)

The Echo Chamber Multi-Turn LLM Jailbreak
by: Alobaid, Ahmad, et al.
Published: (2026)

SOK: A Taxonomy of Attack Vectors and Defense Strategies for Agentic Supply Chain Runtime
by: Jiang, Xiaochong, et al.
Published: (2026)