:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Bryan, Bagchi, Sounak, Wang, Zizhan
Format:	Preprint
Published:	2024
Subjects:	Cryptography and Security Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.06181
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Quantum Properties Trojans (QuPTs) for Attacking Quantum Neural Networks
by: Bhowmik, Sounak, et al.
Published: (2025)

When Efficiency Backfires: Cascading LLMs Trigger Cascade Failure under Adversarial Attack
by: Sun, Zehan, et al.
Published: (2026)

Living Off the LLM: How LLMs Will Change Adversary Tactics
by: Oesch, Sean, et al.
Published: (2025)

CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations
by: Li, Xiaohu, et al.
Published: (2025)

From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection
by: Wang, Haowei, et al.
Published: (2024)

WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)

Towards Assuring EU AI Act Compliance and Adversarial Robustness of LLMs
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)

SEASONED: Semantic-Enhanced Self-Counterfactual Explainable Detection of Adversarial Exploiter Contracts
by: Ai, Xng, et al.
Published: (2025)

Quantum-Enhanced Adversarial Robustness in Artificial Intelligence
by: Sen, Jaydip
Published: (2026)

Reflect-Guard: Enhancing LLM Safeguards against Adversarial Prompts via Logical Self-Reflection
by: Lin, Lixing, et al.
Published: (2026)

Enhancing Jailbreak Attacks on LLMs via Persona Prompts
by: Zhang, Zheng, et al.
Published: (2025)

Scam Shield: Multi-Model Voting and Fine-Tuned LLMs Against Adversarial Attacks
by: Chang, Chen-Wei, et al.
Published: (2025)

AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs
by: Wang, Che, et al.
Published: (2026)

Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)

FlipAttack: Jailbreak LLMs via Flipping
by: Liu, Yue, et al.
Published: (2024)

Enhancing Security and Privacy in Federated Learning using Low-Dimensional Update Representation and Proximity-Based Defense
by: Li, Wenjie, et al.
Published: (2024)

Joint Universal Adversarial Perturbations with Interpretations
by: Ning, Liang-bo, et al.
Published: (2024)

RECUR: Resource Exhaustion Attack via Recursive-Entropy Guided Counterfactual Utilization and Reflection
by: Wang, Ziwei, et al.
Published: (2026)

EXPLICATE: Enhancing Phishing Detection through Explainable AI and LLM-Powered Interpretability
by: Lim, Bryan, et al.
Published: (2025)

Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
by: Fastowski, Alina, et al.
Published: (2025)

Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)

DrLLM: Prompt-Enhanced Distributed Denial-of-Service Resistance Method with Large Language Models
by: Yin, Zhenyu, et al.
Published: (2024)

Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents
by: Shavit, Doron
Published: (2026)

Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks
by: Soltani, Nasim, et al.
Published: (2026)

Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)

NCCR: to Evaluate the Robustness of Neural Networks and Adversarial Examples
by: Pu, Shi, et al.
Published: (2025)

An explainable Recursive Feature Elimination to detect Advanced Persistent Threats using Random Forest classifier
by: Mutalib, Noor Hazlina Abdul, et al.
Published: (2025)

Bitstream Collisions in Neural Image Compression via Adversarial Perturbations
by: Madden, Jordan, et al.
Published: (2025)

Enhancing Source Code Security with LLMs: Demystifying The Challenges and Generating Reliable Repairs
by: Islam, Nafis Tanveer, et al.
Published: (2024)

Co-Evolutionary Multi-Modal Alignment via Structured Adversarial Evolution
by: Shi, Guoxin, et al.
Published: (2026)

Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs
by: Panfilov, Alexander, et al.
Published: (2026)

DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
by: Kang, Mintong, et al.
Published: (2023)

Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)

Differentiation-Based Extraction of Proprietary Data from Fine-Tuned LLMs
by: Li, Zongjie, et al.
Published: (2025)

Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces
by: Zhang, Yilin, et al.
Published: (2026)

An Attack Method for Medical Insurance Claim Fraud Detection based on Generative Adversarial Network
by: Pang, Yining, et al.
Published: (2025)

Security-aware Semantic-driven ISAC via Paired Adversarial Residual Networks
by: Liu, Yu, et al.
Published: (2025)

MEASER: Malware embedding attacks on open-source LLMs
by: Tan, Ming, et al.
Published: (2025)

Cloud-based XAI Services for Assessing Open Repository Models Under Adversarial Attacks
by: Wang, Zerui, et al.
Published: (2024)

Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses
by: Yichao, Wu, et al.
Published: (2025)