Saved in:
| Main Authors: | Heibel, John, Lowd, Daniel |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.11072 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Train to Defend: First Defense Against Cryptanalytic Neural Network Parameter Extraction Attacks
by: Kurian, Ashley, et al.
Published: (2025)
by: Kurian, Ashley, et al.
Published: (2025)
Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset
by: Palit, Sayon, et al.
Published: (2025)
by: Palit, Sayon, et al.
Published: (2025)
Biometrics Employing Neural Network
by: Bhuiyan, Sajjad
Published: (2024)
by: Bhuiyan, Sajjad
Published: (2024)
Guardians of the Web: The Evolution and Future of Website Information Security
by: Islam, Md Saiful, et al.
Published: (2025)
by: Islam, Md Saiful, et al.
Published: (2025)
Mitigating the Impact of Malware Evolution on API Sequence-based Windows Malware Detector
by: Wei, Xingyuan, et al.
Published: (2024)
by: Wei, Xingyuan, et al.
Published: (2024)
Eliminating Backdoors in Neural Code Models for Secure Code Understanding
by: Sun, Weisong, et al.
Published: (2024)
by: Sun, Weisong, et al.
Published: (2024)
Toward Intelligent and Secure Cloud: Large Language Model Empowered Proactive Defense
by: Zhou, Yuyang, et al.
Published: (2024)
by: Zhou, Yuyang, et al.
Published: (2024)
PromptSAM+: Malware Detection based on Prompt Segment Anything Model
by: Wei, Xingyuan, et al.
Published: (2024)
by: Wei, Xingyuan, et al.
Published: (2024)
Seeing Is Not Always Believing: Invisible Collision Attack and Defence on Pre-Trained Models
by: Deng, Minghang, et al.
Published: (2023)
by: Deng, Minghang, et al.
Published: (2023)
Impact of Phonetics on Speaker Identity in Adversarial Voice Attack
by: Dar, Daniyal Kabir, et al.
Published: (2025)
by: Dar, Daniyal Kabir, et al.
Published: (2025)
A Self-Improving Architecture for Dynamic Safety in Large Language Models
by: Slater, Tyler
Published: (2025)
by: Slater, Tyler
Published: (2025)
Temporal Attack Pattern Detection in Multi-Agent AI Workflows: An Open Framework for Training Trace-Based Security Models
by: Del Rosario, Ron F.
Published: (2025)
by: Del Rosario, Ron F.
Published: (2025)
Adversarial Attacks on Large Language Models Using Regularized Relaxation
by: Chacko, Samuel Jacob, et al.
Published: (2024)
by: Chacko, Samuel Jacob, et al.
Published: (2024)
PatchBlock: A Lightweight Defense Against Adversarial Patches for Embedded EdgeAI Devices
by: Chattopadhyay, Nandish, et al.
Published: (2026)
by: Chattopadhyay, Nandish, et al.
Published: (2026)
Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
by: Charles, Richard M., et al.
Published: (2025)
by: Charles, Richard M., et al.
Published: (2025)
Real AI Agents with Fake Memories: Fatal Context Manipulation Attacks on Web3 Agents
by: Patlan, Atharv Singh, et al.
Published: (2025)
by: Patlan, Atharv Singh, et al.
Published: (2025)
BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization
by: Zhou, Xueyang, et al.
Published: (2025)
by: Zhou, Xueyang, et al.
Published: (2025)
Hacking, The Lazy Way: LLM Augmented Pentesting
by: Goyal, Dhruva, et al.
Published: (2024)
by: Goyal, Dhruva, et al.
Published: (2024)
MalPurifier: Enhancing Android Malware Detection with Adversarial Purification against Evasion Attacks
by: Zhou, Yuyang, et al.
Published: (2023)
by: Zhou, Yuyang, et al.
Published: (2023)
Differential Robustness in Transformer Language Models: Empirical Evaluation Under Adversarial Text Attacks
by: Gidatkar, Taniya, et al.
Published: (2025)
by: Gidatkar, Taniya, et al.
Published: (2025)
Orion: Fuzzing Workflow Automation
by: Bazalii, Max, et al.
Published: (2025)
by: Bazalii, Max, et al.
Published: (2025)
Efficient LLM Safety Evaluation through Multi-Agent Debate
by: Lin, Dachuan, et al.
Published: (2025)
by: Lin, Dachuan, et al.
Published: (2025)
An Intelligent Native Network Slicing Security Architecture Empowered by Federated Learning
by: Moreira, Rodrigo, et al.
Published: (2024)
by: Moreira, Rodrigo, et al.
Published: (2024)
Jailbreaking Attacks vs. Content Safety Filters: How Far Are We in the LLM Safety Arms Race?
by: Xin, Yuan, et al.
Published: (2025)
by: Xin, Yuan, et al.
Published: (2025)
Bypassing LLM Guardrails: An Empirical Analysis of Evasion Attacks against Prompt Injection and Jailbreak Detection Systems
by: Hackett, William, et al.
Published: (2025)
by: Hackett, William, et al.
Published: (2025)
The Quantum State Continuity Problem and Temporal Enforcement Against Fork Attacks
by: Ünsal, Samet
Published: (2025)
by: Ünsal, Samet
Published: (2025)
VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs
by: Gu, Zixuan, et al.
Published: (2025)
by: Gu, Zixuan, et al.
Published: (2025)
Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults
by: Usman, Rana Muhammad
Published: (2026)
by: Usman, Rana Muhammad
Published: (2026)
Privacy-preserving Universal Adversarial Defense for Black-box Models
by: Li, Qiao, et al.
Published: (2024)
by: Li, Qiao, et al.
Published: (2024)
Blind Spots in the Guard: How Domain-Camouflaged Injection Attacks Evade Detection in Multi-Agent LLM Systems
by: Pai, Aaditya
Published: (2026)
by: Pai, Aaditya
Published: (2026)
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
by: Anderson, Maya, et al.
Published: (2024)
by: Anderson, Maya, et al.
Published: (2024)
Exploiting Latent Space Discontinuities for Building Universal LLM Jailbreaks and Data Extraction Attacks
by: Paim, Kayua Oleques, et al.
Published: (2025)
by: Paim, Kayua Oleques, et al.
Published: (2025)
Safeguarding Efficacy in Large Language Models: Evaluating Resistance to Human-Written and Algorithmic Adversarial Prompts
by: Downey-Webb, Tiarnaigh, et al.
Published: (2025)
by: Downey-Webb, Tiarnaigh, et al.
Published: (2025)
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
by: Zhu, Yuxuan, et al.
Published: (2025)
by: Zhu, Yuxuan, et al.
Published: (2025)
Power-Softmax: Towards Secure LLM Inference over Encrypted Data
by: Zimerman, Itamar, et al.
Published: (2024)
by: Zimerman, Itamar, et al.
Published: (2024)
Combating Phone Scams with LLM-based Detection: Where Do We Stand?
by: Shen, Zitong, et al.
Published: (2024)
by: Shen, Zitong, et al.
Published: (2024)
Enabling Transparent Cyber Threat Intelligence Combining Large Language Models and Domain Ontologies
by: Cotti, Luca, et al.
Published: (2025)
by: Cotti, Luca, et al.
Published: (2025)
Large Language Models are Autonomous Cyber Defenders
by: Castro, Sebastián R., et al.
Published: (2025)
by: Castro, Sebastián R., et al.
Published: (2025)
NatGVD: Natural Adversarial Example Attack towards Graph-based Vulnerability Detection
by: Rath, Avilash, et al.
Published: (2025)
by: Rath, Avilash, et al.
Published: (2025)
TrojanTime: Backdoor Attacks on Time Series Classification
by: Dong, Chang, et al.
Published: (2025)
by: Dong, Chang, et al.
Published: (2025)
Similar Items
-
Train to Defend: First Defense Against Cryptanalytic Neural Network Parameter Extraction Attacks
by: Kurian, Ashley, et al.
Published: (2025) -
Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset
by: Palit, Sayon, et al.
Published: (2025) -
Biometrics Employing Neural Network
by: Bhuiyan, Sajjad
Published: (2024) -
Guardians of the Web: The Evolution and Future of Website Information Security
by: Islam, Md Saiful, et al.
Published: (2025) -
Mitigating the Impact of Malware Evolution on API Sequence-based Windows Malware Detector
by: Wei, Xingyuan, et al.
Published: (2024)