Saved in:
| Main Authors: | Li, Bryan, Bagchi, Sounak, Wang, Zizhan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.06181 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Quantum Properties Trojans (QuPTs) for Attacking Quantum Neural Networks
by: Bhowmik, Sounak, et al.
Published: (2025)
by: Bhowmik, Sounak, et al.
Published: (2025)
When Efficiency Backfires: Cascading LLMs Trigger Cascade Failure under Adversarial Attack
by: Sun, Zehan, et al.
Published: (2026)
by: Sun, Zehan, et al.
Published: (2026)
Living Off the LLM: How LLMs Will Change Adversary Tactics
by: Oesch, Sean, et al.
Published: (2025)
by: Oesch, Sean, et al.
Published: (2025)
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations
by: Li, Xiaohu, et al.
Published: (2025)
by: Li, Xiaohu, et al.
Published: (2025)
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection
by: Wang, Haowei, et al.
Published: (2024)
by: Wang, Haowei, et al.
Published: (2024)
WARD: Adversarially Robust Defense of Web Agents Against Prompt Injections
by: Cao, Tri, et al.
Published: (2026)
by: Cao, Tri, et al.
Published: (2026)
Towards Assuring EU AI Act Compliance and Adversarial Robustness of LLMs
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
SEASONED: Semantic-Enhanced Self-Counterfactual Explainable Detection of Adversarial Exploiter Contracts
by: Ai, Xng, et al.
Published: (2025)
by: Ai, Xng, et al.
Published: (2025)
Quantum-Enhanced Adversarial Robustness in Artificial Intelligence
by: Sen, Jaydip
Published: (2026)
by: Sen, Jaydip
Published: (2026)
Reflect-Guard: Enhancing LLM Safeguards against Adversarial Prompts via Logical Self-Reflection
by: Lin, Lixing, et al.
Published: (2026)
by: Lin, Lixing, et al.
Published: (2026)
Enhancing Jailbreak Attacks on LLMs via Persona Prompts
by: Zhang, Zheng, et al.
Published: (2025)
by: Zhang, Zheng, et al.
Published: (2025)
Scam Shield: Multi-Model Voting and Fine-Tuned LLMs Against Adversarial Attacks
by: Chang, Chen-Wei, et al.
Published: (2025)
by: Chang, Chen-Wei, et al.
Published: (2025)
AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs
by: Wang, Che, et al.
Published: (2026)
by: Wang, Che, et al.
Published: (2026)
Enhancing TinyML Security: Study of Adversarial Attack Transferability
by: Shah, Parin, et al.
Published: (2024)
by: Shah, Parin, et al.
Published: (2024)
FlipAttack: Jailbreak LLMs via Flipping
by: Liu, Yue, et al.
Published: (2024)
by: Liu, Yue, et al.
Published: (2024)
Enhancing Security and Privacy in Federated Learning using Low-Dimensional Update Representation and Proximity-Based Defense
by: Li, Wenjie, et al.
Published: (2024)
by: Li, Wenjie, et al.
Published: (2024)
Joint Universal Adversarial Perturbations with Interpretations
by: Ning, Liang-bo, et al.
Published: (2024)
by: Ning, Liang-bo, et al.
Published: (2024)
RECUR: Resource Exhaustion Attack via Recursive-Entropy Guided Counterfactual Utilization and Reflection
by: Wang, Ziwei, et al.
Published: (2026)
by: Wang, Ziwei, et al.
Published: (2026)
EXPLICATE: Enhancing Phishing Detection through Explainable AI and LLM-Powered Interpretability
by: Lim, Bryan, et al.
Published: (2025)
by: Lim, Bryan, et al.
Published: (2025)
Injecting Falsehoods: Adversarial Man-in-the-Middle Attacks Undermining Factual Recall in LLMs
by: Fastowski, Alina, et al.
Published: (2025)
by: Fastowski, Alina, et al.
Published: (2025)
Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs
by: Liu, Fan, et al.
Published: (2024)
by: Liu, Fan, et al.
Published: (2024)
DrLLM: Prompt-Enhanced Distributed Denial-of-Service Resistance Method with Large Language Models
by: Yin, Zhenyu, et al.
Published: (2024)
by: Yin, Zhenyu, et al.
Published: (2024)
Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents
by: Shavit, Doron
Published: (2026)
by: Shavit, Doron
Published: (2026)
Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks
by: Soltani, Nasim, et al.
Published: (2026)
by: Soltani, Nasim, et al.
Published: (2026)
Developing Assurance Cases for Adversarial Robustness and Regulatory Compliance in LLMs
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
by: Momcilovic, Tomas Bueno, et al.
Published: (2024)
NCCR: to Evaluate the Robustness of Neural Networks and Adversarial Examples
by: Pu, Shi, et al.
Published: (2025)
by: Pu, Shi, et al.
Published: (2025)
An explainable Recursive Feature Elimination to detect Advanced Persistent Threats using Random Forest classifier
by: Mutalib, Noor Hazlina Abdul, et al.
Published: (2025)
by: Mutalib, Noor Hazlina Abdul, et al.
Published: (2025)
Bitstream Collisions in Neural Image Compression via Adversarial Perturbations
by: Madden, Jordan, et al.
Published: (2025)
by: Madden, Jordan, et al.
Published: (2025)
Enhancing Source Code Security with LLMs: Demystifying The Challenges and Generating Reliable Repairs
by: Islam, Nafis Tanveer, et al.
Published: (2024)
by: Islam, Nafis Tanveer, et al.
Published: (2024)
Co-Evolutionary Multi-Modal Alignment via Structured Adversarial Evolution
by: Shi, Guoxin, et al.
Published: (2026)
by: Shi, Guoxin, et al.
Published: (2026)
Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs
by: Panfilov, Alexander, et al.
Published: (2026)
by: Panfilov, Alexander, et al.
Published: (2026)
DiffAttack: Evasion Attacks Against Diffusion-Based Adversarial Purification
by: Kang, Mintong, et al.
Published: (2023)
by: Kang, Mintong, et al.
Published: (2023)
Bidirectional Intention Inference Enhances LLMs' Defense Against Multi-Turn Jailbreak Attacks
by: Tong, Haibo, et al.
Published: (2025)
by: Tong, Haibo, et al.
Published: (2025)
Differentiation-Based Extraction of Proprietary Data from Fine-Tuned LLMs
by: Li, Zongjie, et al.
Published: (2025)
by: Li, Zongjie, et al.
Published: (2025)
Don't Click That: Teaching Web Agents to Resist Deceptive Interfaces
by: Zhang, Yilin, et al.
Published: (2026)
by: Zhang, Yilin, et al.
Published: (2026)
An Attack Method for Medical Insurance Claim Fraud Detection based on Generative Adversarial Network
by: Pang, Yining, et al.
Published: (2025)
by: Pang, Yining, et al.
Published: (2025)
Security-aware Semantic-driven ISAC via Paired Adversarial Residual Networks
by: Liu, Yu, et al.
Published: (2025)
by: Liu, Yu, et al.
Published: (2025)
MEASER: Malware embedding attacks on open-source LLMs
by: Tan, Ming, et al.
Published: (2025)
by: Tan, Ming, et al.
Published: (2025)
Cloud-based XAI Services for Assessing Open Repository Models Under Adversarial Attacks
by: Wang, Zerui, et al.
Published: (2024)
by: Wang, Zerui, et al.
Published: (2024)
Enhancing Security in Deep Reinforcement Learning: A Comprehensive Survey on Adversarial Attacks and Defenses
by: Yichao, Wu, et al.
Published: (2025)
by: Yichao, Wu, et al.
Published: (2025)
Similar Items
-
Quantum Properties Trojans (QuPTs) for Attacking Quantum Neural Networks
by: Bhowmik, Sounak, et al.
Published: (2025) -
When Efficiency Backfires: Cascading LLMs Trigger Cascade Failure under Adversarial Attack
by: Sun, Zehan, et al.
Published: (2026) -
Living Off the LLM: How LLMs Will Change Adversary Tactics
by: Oesch, Sean, et al.
Published: (2025) -
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal Representations
by: Li, Xiaohu, et al.
Published: (2025) -
From Allies to Adversaries: Manipulating LLM Tool-Calling through Adversarial Injection
by: Wang, Haowei, et al.
Published: (2024)