Saved in:
| Main Authors: | Liu, Yongxiang, Peng, Bowen, Liu, Li, Li, Xiang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.13891 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VTarbel: Targeted Label Attack with Minimal Knowledge on Detector-enhanced Vertical Federated Learning
by: Tan, Juntao, et al.
Published: (2025)
by: Tan, Juntao, et al.
Published: (2025)
Jailbreaking GPT-4V via Self-Adversarial Attacks with System Prompts
by: Wu, Yuanwei, et al.
Published: (2023)
by: Wu, Yuanwei, et al.
Published: (2023)
Targeted Bit-Flip Attacks on LLM-Based Agents
by: Wang, Jialai, et al.
Published: (2026)
by: Wang, Jialai, et al.
Published: (2026)
Everywhere Attack: Attacking Locally and Globally to Boost Targeted Transferability
by: Zeng, Hui, et al.
Published: (2025)
by: Zeng, Hui, et al.
Published: (2025)
TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks
by: Mo, Xiaoxing, et al.
Published: (2025)
by: Mo, Xiaoxing, et al.
Published: (2025)
Involuntary Jailbreak: On Self-Prompting Attacks
by: Guo, Yangyang, et al.
Published: (2025)
by: Guo, Yangyang, et al.
Published: (2025)
Dynamic Target Attack
by: Xiu, Kedong, et al.
Published: (2025)
by: Xiu, Kedong, et al.
Published: (2025)
LoopLLM: Transferable Energy-Latency Attacks in LLMs via Repetitive Generation
by: Li, Xingyu, et al.
Published: (2025)
by: Li, Xingyu, et al.
Published: (2025)
ShadowMerge: A Novel Poisoning Attack on Graph-Based Agent Memory via Relation-Channel Conflicts
by: Luo, Yang, et al.
Published: (2026)
by: Luo, Yang, et al.
Published: (2026)
AEGIS: White-Box Attack Path Generation using LLMs and Training Effectiveness Evaluation for Large-Scale Cyber Defence Exercises
by: Tung, Ivan K., et al.
Published: (2026)
by: Tung, Ivan K., et al.
Published: (2026)
SFIBA: Spatial-based Full-target Invisible Backdoor Attacks
by: Yin, Yangxu, et al.
Published: (2025)
by: Yin, Yangxu, et al.
Published: (2025)
Investigating Deep Watermark Security: An Adversarial Transferability Perspective
by: Qi, Biqing, et al.
Published: (2024)
by: Qi, Biqing, et al.
Published: (2024)
New Wide-Net-Casting Jailbreak Attacks Risk Large Models
by: Xiang, Qiuchi, et al.
Published: (2026)
by: Xiang, Qiuchi, et al.
Published: (2026)
AutoDAN-Reasoning: Enhancing Strategies Exploration based Jailbreak Attacks with Test-Time Scaling
by: Liu, Xiaogeng, et al.
Published: (2025)
by: Liu, Xiaogeng, et al.
Published: (2025)
FFCBA: Feature-based Full-target Clean-label Backdoor Attacks
by: Yin, Yangxu, et al.
Published: (2025)
by: Yin, Yangxu, et al.
Published: (2025)
A Cross-Language Investigation into Jailbreak Attacks in Large Language Models
by: Li, Jie, et al.
Published: (2024)
by: Li, Jie, et al.
Published: (2024)
Breaking PEFT Limitations: Leveraging Weak-to-Strong Knowledge Transfer for Backdoor Attacks in LLMs
by: Zhao, Shuai, et al.
Published: (2024)
by: Zhao, Shuai, et al.
Published: (2024)
Investigating White-Box Attacks for On-Device Models
by: Zhou, Mingyi, et al.
Published: (2024)
by: Zhou, Mingyi, et al.
Published: (2024)
Is Monitoring Enough? Strategic Agent Selection For Stealthy Attack in Multi-Agent Discussions
by: Xiang, Qiuchi, et al.
Published: (2026)
by: Xiang, Qiuchi, et al.
Published: (2026)
Towards Effective, Stealthy, and Persistent Backdoor Attacks Targeting Graph Foundation Models
by: Luo, Jiayi, et al.
Published: (2025)
by: Luo, Jiayi, et al.
Published: (2025)
A Model Stealing Attack Against Multi-Exit Networks
by: Pan, Li, et al.
Published: (2023)
by: Pan, Li, et al.
Published: (2023)
Not What You Asked For: Typographic Attacks in Household Robot Manipulation
by: Iranmanesh, Ali, et al.
Published: (2026)
by: Iranmanesh, Ali, et al.
Published: (2026)
Explainer-guided Targeted Adversarial Attacks against Binary Code Similarity Detection Models
by: Chen, Mingjie, et al.
Published: (2025)
by: Chen, Mingjie, et al.
Published: (2025)
LaSM: Layer-wise Scaling Mechanism for Defending Pop-up Attack on GUI Agents
by: Yan, Zihe, et al.
Published: (2025)
by: Yan, Zihe, et al.
Published: (2025)
DualSentinel: A Lightweight Framework for Detecting Targeted Attacks in Black-box LLM via Dual Entropy Lull Pattern
by: Pang, Xiaoyi, et al.
Published: (2026)
by: Pang, Xiaoyi, et al.
Published: (2026)
A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
by: Xu, Zihao, et al.
Published: (2024)
by: Xu, Zihao, et al.
Published: (2024)
Generating Is Believing: Membership Inference Attacks against Retrieval-Augmented Generation
by: Li, Yuying, et al.
Published: (2024)
by: Li, Yuying, et al.
Published: (2024)
Delayed Backdoor Attacks: Exploring the Temporal Dimension as a New Attack Surface in Pre-Trained Models
by: Ding, Zikang, et al.
Published: (2026)
by: Ding, Zikang, et al.
Published: (2026)
Medusa: Cross-Modal Transferable Adversarial Attacks on Multimodal Medical Retrieval-Augmented Generation
by: Shang, Yingjia, et al.
Published: (2025)
by: Shang, Yingjia, et al.
Published: (2025)
The Attack and Defense Landscape of Agentic AI: A Comprehensive Survey
by: Kim, Juhee, et al.
Published: (2026)
by: Kim, Juhee, et al.
Published: (2026)
BadThink: Triggered Overthinking Attacks on Chain-of-Thought Reasoning in Large Language Models
by: Liu, Shuaitong, et al.
Published: (2025)
by: Liu, Shuaitong, et al.
Published: (2025)
Sandcastles in the Storm: Revisiting the (Im)possibility of Strong Watermarking
by: Harel-Canada, Fabrice Y, et al.
Published: (2025)
by: Harel-Canada, Fabrice Y, et al.
Published: (2025)
GhostCite: A Large-Scale Analysis of Citation Validity in the Age of Large Language Models
by: Xu, Zuyao, et al.
Published: (2026)
by: Xu, Zuyao, et al.
Published: (2026)
Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)
by: Shah, Parin, et al.
Published: (2024)
Injection, Attack and Erasure: Revocable Backdoor Attacks via Machine Unlearning
by: Song, Baogang, et al.
Published: (2025)
by: Song, Baogang, et al.
Published: (2025)
MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents
by: Zhang, Dongsen, et al.
Published: (2025)
by: Zhang, Dongsen, et al.
Published: (2025)
Red-teaming the Multimodal Reasoning: Jailbreaking Vision-Language Models via Cross-modal Entanglement Attacks
by: Yan, Yu, et al.
Published: (2026)
by: Yan, Yu, et al.
Published: (2026)
To Protect the LLM Agent Against the Prompt Injection Attack with Polymorphic Prompt
by: Wang, Zhilong, et al.
Published: (2025)
by: Wang, Zhilong, et al.
Published: (2025)
No Attack Required: Semantic Fuzzing for Specification Violations in Agent Skills
by: Li, Ying, et al.
Published: (2026)
by: Li, Ying, et al.
Published: (2026)
EdgeShield: A Universal and Efficient Edge Computing Framework for Robust AI
by: Zhong, Duo, et al.
Published: (2024)
by: Zhong, Duo, et al.
Published: (2024)
Similar Items
-
VTarbel: Targeted Label Attack with Minimal Knowledge on Detector-enhanced Vertical Federated Learning
by: Tan, Juntao, et al.
Published: (2025) -
Jailbreaking GPT-4V via Self-Adversarial Attacks with System Prompts
by: Wu, Yuanwei, et al.
Published: (2023) -
Targeted Bit-Flip Attacks on LLM-Based Agents
by: Wang, Jialai, et al.
Published: (2026) -
Everywhere Attack: Attacking Locally and Globally to Boost Targeted Transferability
by: Zeng, Hui, et al.
Published: (2025) -
TED-LaST: Towards Robust Backdoor Defense Against Adaptive Attacks
by: Mo, Xiaoxing, et al.
Published: (2025)