:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Yongxiang, Peng, Bowen, Liu, Li, Li, Xiang
Format:	Preprint
Published:	2024
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2410.13891
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

VTarbel: Targeted Label Attack with Minimal Knowledge on Detector-enhanced Vertical Federated Learning
by: Tan, Juntao, et al.
Published: (2025)

Jailbreaking GPT-4V via Self-Adversarial Attacks with System Prompts
by: Wu, Yuanwei, et al.
Published: (2023)

Targeted Bit-Flip Attacks on LLM-Based Agents
by: Wang, Jialai, et al.
Published: (2026)

Everywhere Attack: Attacking Locally and Globally to Boost Targeted Transferability
by: Zeng, Hui, et al.
Published: (2025)

TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks
by: Mo, Xiaoxing, et al.
Published: (2025)

Involuntary Jailbreak: On Self-Prompting Attacks
by: Guo, Yangyang, et al.
Published: (2025)

Dynamic Target Attack
by: Xiu, Kedong, et al.
Published: (2025)

LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation
by: Li, Xingyu, et al.
Published: (2025)

ShadowMerge: A Novel Poisoning Attack on Graph-Based Agent Memory via Relation-Channel Conflicts
by: Luo, Yang, et al.
Published: (2026)

AEGIS: White-Box Attack Path Generation using LLMs and Training Effectiveness Evaluation for Large-Scale Cyber Defence Exercises
by: Tung, Ivan K., et al.
Published: (2026)

SFIBA: Spatial-based Full-target Invisible Backdoor Attacks
by: Yin, Yangxu, et al.
Published: (2025)

Investigating Deep Watermark Security: An Adversarial Transferability Perspective
by: Qi, Biqing, et al.
Published: (2024)

New Wide-Net-Casting Jailbreak Attacks Risk Large Models
by: Xiang, Qiuchi, et al.
Published: (2026)

AutoDAN-Reasoning: Enhancing Strategies Exploration based Jailbreak Attacks with Test-Time Scaling
by: Liu, Xiaogeng, et al.
Published: (2025)

FFCBA: Feature-based Full-target Clean-label Backdoor Attacks
by: Yin, Yangxu, et al.
Published: (2025)

A Cross-Language Investigation into Jailbreak Attacks in Large Language Models
by: Li, Jie, et al.
Published: (2024)

Breaking PEFT Limitations: Leveraging Weak-to-Strong Knowledge Transfer for Backdoor Attacks in LLMs
by: Zhao, Shuai, et al.
Published: (2024)

Investigating White-Box Attacks for On-Device Models
by: Zhou, Mingyi, et al.
Published: (2024)

Is Monitoring Enough? Strategic Agent Selection For Stealthy Attack in Multi-Agent Discussions
by: Xiang, Qiuchi, et al.
Published: (2026)

Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models
by: Luo, Jiayi, et al.
Published: (2025)

A Model Stealing Attack Against Multi-Exit Networks
by: Pan, Li, et al.
Published: (2023)

Not What You Asked For: Typographic Attacks in Household Robot Manipulation
by: Iranmanesh, Ali, et al.
Published: (2026)

Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
by: Chen, Mingjie, et al.
Published: (2025)

LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
by: Yan, Zihe, et al.
Published: (2025)

DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern
by: Pang, Xiaoyi, et al.
Published: (2026)

A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
by: Xu, Zihao, et al.
Published: (2024)

Generating Is Believing: Membership Inference Attacks against Retrieval-Augmented Generation
by: Li, Yuying, et al.
Published: (2024)

Delayed Backdoor Attacks: Exploring the Temporal Dimension as a New Attack Surface in Pre-Trained Models
by: Ding, Zikang, et al.
Published: (2026)

Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation
by: Shang, Yingjia, et al.
Published: (2025)

The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)

BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models
by: Liu, Shuaitong, et al.
Published: (2025)

Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
by: Harel-Canada, Fabrice Y, et al.
Published: (2025)

GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models
by: Xu, Zuyao, et al.
Published: (2026)

Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)

Injection, Attack and Erasure: Revocable Backdoor Attacks via Machine Unlearning
by: Song, Baogang, et al.
Published: (2025)

MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
by: Zhang, Dongsen, et al.
Published: (2025)

Red-teaming the Multimodal Reasoning: Jailbreaking Vision-Language Models via Cross-modal Entanglement Attacks
by: Yan, Yu, et al.
Published: (2026)

To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)

No Attack Required: Semantic Fuzzing for Specification Violations in Agent Skills
by: Li, Ying, et al.
Published: (2026)

EdgeShield: A Universal and Efficient Edge Computing Framework for Robust AI
by: Zhong, Duo, et al.
Published: (2024)