:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zhou, Yuxuan, Peng, Yuzhao, Bai, Yang, Gao, Kuofeng, Zhang, Yihao, Zhang, Yechao, Chen, Xun, Yu, Tao, Dai, Tao, Xia, Shu-Tao
Format:	Preprint
Published:	2025
Subjects:	Cryptography and Security
Online Access:	https://arxiv.org/abs/2511.08367
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework
by: Zhou, Yuxuan, et al.
Published: (2025)

Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability
by: Zhang, Yechao, et al.
Published: (2023)

Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
by: Liu, Haitong, et al.
Published: (2025)

Imperceptible Jailbreaking against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2025)

Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs
by: Li, Jinmin, et al.
Published: (2024)

Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation
by: Zhou, Yuxuan, et al.
Published: (2025)

Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models
by: Gao, Kuofeng, et al.
Published: (2025)

Denial-of-Service Poisoning Attacks against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2024)

Coward: Collision-based OOD Watermarking for Practical Proactive Federated Backdoor Detection
by: Li, Wenjie, et al.
Published: (2025)

Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
by: Gao, Kuofeng, et al.
Published: (2024)

Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense
by: Chen, Zejian, et al.
Published: (2026)

WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking
by: Tan, Yuqi, et al.
Published: (2024)

Give Them an Inch and They Will Take a Mile:Understanding and Measuring Caller Identity Confusion in MCP-Based AI Systems
by: Huang, Yuhang, et al.
Published: (2026)

Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation
by: Guo, Ziyi, et al.
Published: (2024)

Boosting Jailbreak Attack with Momentum
by: Zhang, Yihao, et al.
Published: (2024)

Alleviating the Fear of Losing Alignment in LLM Fine-tuning
by: Yang, Kang, et al.
Published: (2025)

Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
by: Yang, Sheng, et al.
Published: (2024)

Sliced Rényi Pufferfish Privacy: Directional Additive Noise Mechanism and Private Learning with Gradient Clipping
by: Zhang, Tao, et al.
Published: (2025)

Residual-PAC Privacy: Automatic Privacy Control Beyond the Gaussian Barrier
by: Zhang, Tao, et al.
Published: (2025)

BackdoorIndicator: Leveraging OOD Data for Proactive Backdoor Detection in Federated Learning
by: Li, Songze, et al.
Published: (2024)

Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets
by: Zhang, Yechao, et al.
Published: (2025)

SINCon: Mitigate LLM-Generated Malicious Message Injection Attack for Rumor Detection
by: Zhang, Mingqing, et al.
Published: (2025)

Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
by: Shen, Guangyu, et al.
Published: (2024)

Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt
by: Ying, Zonghao, et al.
Published: (2024)

Unveiling the Safety of GPT-4o: An Empirical Study using Jailbreak Attacks
by: Ying, Zonghao, et al.
Published: (2024)

Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger
by: Zhu, Mingyan, et al.
Published: (2023)

CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models
by: Lv, Huijie, et al.
Published: (2024)

STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
by: Mao, Xutao, et al.
Published: (2026)

JailbreakLens: Interpreting Jailbreak Mechanism in the Lens of Representation and Circuit
by: He, Zeqing, et al.
Published: (2024)

Security and Privacy on Generative Data in AIGC: A Survey
by: Wang, Tao, et al.
Published: (2023)

T2VShield: Model-Agnostic Jailbreak Defense for Text-to-Video Models
by: Liang, Siyuan, et al.
Published: (2025)

"MCP Does Not Stand for Misuse Cryptography Protocol": Uncovering Cryptographic Misuse in Model Context Protocol at Scale
by: Yan, Biwei, et al.
Published: (2025)

Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings
by: Ying, Zonghao, et al.
Published: (2025)

Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography
by: Li, Songze, et al.
Published: (2025)

Understanding and Enhancing the Transferability of Jailbreaking Attacks
by: Lin, Runqi, et al.
Published: (2025)

HTS-Attack: Heuristic Token Search for Jailbreaking Text-to-Image Models
by: Gao, Sensen, et al.
Published: (2024)

SmartIntentNN: Towards Smart Contract Intent Detection
by: Huang, Youwei, et al.
Published: (2022)

Blackbox Dataset Inference for LLM
by: Zhou, Ruikai, et al.
Published: (2025)

Reasoning-Augmented Conversation for Multi-Turn Jailbreak Attacks on Large Language Models
by: Ying, Zonghao, et al.
Published: (2025)

Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems
by: Dong, Yueyan, et al.
Published: (2026)