Saved in:
| Main Authors: | Zhou, Yuxuan, Peng, Yuzhao, Bai, Yang, Gao, Kuofeng, Zhang, Yihao, Zhang, Yechao, Chen, Xun, Yu, Tao, Dai, Tao, Xia, Shu-Tao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.08367 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework
by: Zhou, Yuxuan, et al.
Published: (2025)
by: Zhou, Yuxuan, et al.
Published: (2025)
Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability
by: Zhang, Yechao, et al.
Published: (2023)
by: Zhang, Yechao, et al.
Published: (2023)
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
by: Liu, Haitong, et al.
Published: (2025)
by: Liu, Haitong, et al.
Published: (2025)
Imperceptible Jailbreaking against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2025)
by: Gao, Kuofeng, et al.
Published: (2025)
Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs
by: Li, Jinmin, et al.
Published: (2024)
by: Li, Jinmin, et al.
Published: (2024)
Improving Deepfake Detection with Reinforcement Learning-Based Adaptive Data Augmentation
by: Zhou, Yuxuan, et al.
Published: (2025)
by: Zhou, Yuxuan, et al.
Published: (2025)
Towards Dataset Copyright Evasion Attack against Personalized Text-to-Image Diffusion Models
by: Gao, Kuofeng, et al.
Published: (2025)
by: Gao, Kuofeng, et al.
Published: (2025)
Denial-of-Service Poisoning Attacks against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2024)
by: Gao, Kuofeng, et al.
Published: (2024)
Coward: Collision-based OOD Watermarking for Practical Proactive Federated Backdoor Detection
by: Li, Wenjie, et al.
Published: (2025)
by: Li, Wenjie, et al.
Published: (2025)
Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images
by: Gao, Kuofeng, et al.
Published: (2024)
by: Gao, Kuofeng, et al.
Published: (2024)
Jailbreaking LLMs & VLMs: Mechanisms, Evaluation, and Unified Defense
by: Chen, Zejian, et al.
Published: (2026)
by: Chen, Zejian, et al.
Published: (2026)
WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking
by: Tan, Yuqi, et al.
Published: (2024)
by: Tan, Yuqi, et al.
Published: (2024)
Give Them an Inch and They Will Take a Mile:Understanding and Measuring Caller Identity Confusion in MCP-Based AI Systems
by: Huang, Yuhang, et al.
Published: (2026)
by: Huang, Yuhang, et al.
Published: (2026)
Take a Step Further: Understanding Page Spray in Linux Kernel Exploitation
by: Guo, Ziyi, et al.
Published: (2024)
by: Guo, Ziyi, et al.
Published: (2024)
Boosting Jailbreak Attack with Momentum
by: Zhang, Yihao, et al.
Published: (2024)
by: Zhang, Yihao, et al.
Published: (2024)
Alleviating the Fear of Losing Alignment in LLM Fine-tuning
by: Yang, Kang, et al.
Published: (2025)
by: Yang, Kang, et al.
Published: (2025)
Not All Prompts Are Secure: A Switchable Backdoor Attack Against Pre-trained Vision Transformers
by: Yang, Sheng, et al.
Published: (2024)
by: Yang, Sheng, et al.
Published: (2024)
Sliced Rényi Pufferfish Privacy: Directional Additive Noise Mechanism and Private Learning with Gradient Clipping
by: Zhang, Tao, et al.
Published: (2025)
by: Zhang, Tao, et al.
Published: (2025)
Residual-PAC Privacy: Automatic Privacy Control Beyond the Gaussian Barrier
by: Zhang, Tao, et al.
Published: (2025)
by: Zhang, Tao, et al.
Published: (2025)
BackdoorIndicator: Leveraging OOD Data for Proactive Backdoor Detection in Federated Learning
by: Li, Songze, et al.
Published: (2024)
by: Li, Songze, et al.
Published: (2024)
Secure Transfer Learning: Training Clean Models Against Backdoor in (Both) Pre-trained Encoders and Downstream Datasets
by: Zhang, Yechao, et al.
Published: (2025)
by: Zhang, Yechao, et al.
Published: (2025)
SINCon: Mitigate LLM-Generated Malicious Message Injection Attack for Rumor Detection
by: Zhang, Mingqing, et al.
Published: (2025)
by: Zhang, Mingqing, et al.
Published: (2025)
Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
by: Shen, Guangyu, et al.
Published: (2024)
by: Shen, Guangyu, et al.
Published: (2024)
Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt
by: Ying, Zonghao, et al.
Published: (2024)
by: Ying, Zonghao, et al.
Published: (2024)
Unveiling the Safety of GPT-4o: An Empirical Study using Jailbreak Attacks
by: Ying, Zonghao, et al.
Published: (2024)
by: Ying, Zonghao, et al.
Published: (2024)
Towards Sample-specific Backdoor Attack with Clean Labels via Attribute Trigger
by: Zhu, Mingyan, et al.
Published: (2023)
by: Zhu, Mingyan, et al.
Published: (2023)
CodeChameleon: Personalized Encryption Framework for Jailbreaking Large Language Models
by: Lv, Huijie, et al.
Published: (2024)
by: Lv, Huijie, et al.
Published: (2024)
STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack
by: Mao, Xutao, et al.
Published: (2026)
by: Mao, Xutao, et al.
Published: (2026)
JailbreakLens: Interpreting Jailbreak Mechanism in the Lens of Representation and Circuit
by: He, Zeqing, et al.
Published: (2024)
by: He, Zeqing, et al.
Published: (2024)
Security and Privacy on Generative Data in AIGC: A Survey
by: Wang, Tao, et al.
Published: (2023)
by: Wang, Tao, et al.
Published: (2023)
T2VShield: Model-Agnostic Jailbreak Defense for Text-to-Video Models
by: Liang, Siyuan, et al.
Published: (2025)
by: Liang, Siyuan, et al.
Published: (2025)
"MCP Does Not Stand for Misuse Cryptography Protocol": Uncovering Cryptographic Misuse in Model Context Protocol at Scale
by: Yan, Biwei, et al.
Published: (2025)
by: Yan, Biwei, et al.
Published: (2025)
Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings
by: Ying, Zonghao, et al.
Published: (2025)
by: Ying, Zonghao, et al.
Published: (2025)
Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography
by: Li, Songze, et al.
Published: (2025)
by: Li, Songze, et al.
Published: (2025)
Understanding and Enhancing the Transferability of Jailbreaking Attacks
by: Lin, Runqi, et al.
Published: (2025)
by: Lin, Runqi, et al.
Published: (2025)
HTS-Attack: Heuristic Token Search for Jailbreaking Text-to-Image Models
by: Gao, Sensen, et al.
Published: (2024)
by: Gao, Sensen, et al.
Published: (2024)
SmartIntentNN: Towards Smart Contract Intent Detection
by: Huang, Youwei, et al.
Published: (2022)
by: Huang, Youwei, et al.
Published: (2022)
Blackbox Dataset Inference for LLM
by: Zhou, Ruikai, et al.
Published: (2025)
by: Zhou, Ruikai, et al.
Published: (2025)
Reasoning-Augmented Conversation for Multi-Turn Jailbreak Attacks on Large Language Models
by: Ying, Zonghao, et al.
Published: (2025)
by: Ying, Zonghao, et al.
Published: (2025)
Low Rank Comes with Low Security: Gradient Assembly Poisoning Attacks against Distributed LoRA-based LLM Systems
by: Dong, Yueyan, et al.
Published: (2026)
by: Dong, Yueyan, et al.
Published: (2026)
Similar Items
-
JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework
by: Zhou, Yuxuan, et al.
Published: (2025) -
Why Does Little Robustness Help? A Further Step Towards Understanding Adversarial Transferability
by: Zhang, Yechao, et al.
Published: (2023) -
Protecting Your Video Content: Disrupting Automated Video-based LLM Annotations
by: Liu, Haitong, et al.
Published: (2025) -
Imperceptible Jailbreaking against Large Language Models
by: Gao, Kuofeng, et al.
Published: (2025) -
Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs
by: Li, Jinmin, et al.
Published: (2024)