Saved in:
| Main Authors: | Moulton, Richard Helder, O'Brien, Austin, Hastings, John D. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.15081 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Predicting Known Vulnerabilities from Attack Descriptions Using Sentence Transformers
by: Othman, Refat
Published: (2026)
by: Othman, Refat
Published: (2026)
Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
by: Bakker, Isabelle, et al.
Published: (2025)
by: Bakker, Isabelle, et al.
Published: (2025)
SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots
by: Adebimpe, Adetayo, et al.
Published: (2025)
by: Adebimpe, Adetayo, et al.
Published: (2025)
Code as a Weapon: A Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests
by: Young, Richard J., et al.
Published: (2026)
by: Young, Richard J., et al.
Published: (2026)
Towards Agentic Investigation of Security Alerts
by: Eilertsen, Even, et al.
Published: (2026)
by: Eilertsen, Even, et al.
Published: (2026)
AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models
by: Dawson, Ads, et al.
Published: (2025)
by: Dawson, Ads, et al.
Published: (2025)
The Automation Advantage in AI Red Teaming
by: Mulla, Rob, et al.
Published: (2025)
by: Mulla, Rob, et al.
Published: (2025)
Detecting Prompt Injection Attacks Against Application Using Classifiers
by: Shaheer, Safwan, et al.
Published: (2025)
by: Shaheer, Safwan, et al.
Published: (2025)
Beyond the Benchmark: Innovative Defenses Against Prompt Injection Attacks
by: Shaheer, Safwan, et al.
Published: (2025)
by: Shaheer, Safwan, et al.
Published: (2025)
Hidden in Memory: Sleeper Memory Poisoning in LLM Agents
by: Pulipaka, Sidharth, et al.
Published: (2026)
by: Pulipaka, Sidharth, et al.
Published: (2026)
Safeguarding Virtual Healthcare: A Novel Attacker-Centric Model for Data Security and Privacy
by: Herath, Suvineetha, et al.
Published: (2024)
by: Herath, Suvineetha, et al.
Published: (2024)
Binary-30K: A Heterogeneous Dataset for Deep Learning in Binary Analysis and Malware Detection
by: Bommarito II, Michael J.
Published: (2025)
by: Bommarito II, Michael J.
Published: (2025)
AgentSentry: Mitigating Indirect Prompt Injection in LLM Agents via Temporal Causal Diagnostics and Context Purification
by: Zhang, Tian, et al.
Published: (2026)
by: Zhang, Tian, et al.
Published: (2026)
Privately Fine-Tuned LLMs Preserve Temporal Dynamics in Tabular Data
by: Rosenblatt, Lucas, et al.
Published: (2026)
by: Rosenblatt, Lucas, et al.
Published: (2026)
DP-2Stage: Adapting Language Models as Differentially Private Tabular Data Generators
by: Afonja, Tejumade, et al.
Published: (2024)
by: Afonja, Tejumade, et al.
Published: (2024)
Security Considerations for Multi-agent Systems
by: Nguyen, Tam, et al.
Published: (2026)
by: Nguyen, Tam, et al.
Published: (2026)
Refusal Evaluation in Coding LLMs and Code Agents: A Systematic Review of Thirteen Malicious-Code Prompt Corpora (2023-2025)
by: Young, Richard J., et al.
Published: (2026)
by: Young, Richard J., et al.
Published: (2026)
AegisShield: Democratizing Cyber Threat Modeling with Generative AI
by: Grofsky, Matthew
Published: (2025)
by: Grofsky, Matthew
Published: (2025)
Evaluating the Reliability of Digital Forensic Evidence Discovered by Large Language Model: A Case Study
by: Khatiwala, Jeel Piyushkumar, et al.
Published: (2026)
by: Khatiwala, Jeel Piyushkumar, et al.
Published: (2026)
Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection
by: Palaniappan, Nikitha M., et al.
Published: (2026)
by: Palaniappan, Nikitha M., et al.
Published: (2026)
Can Safety Fine-Tuning Be More Principled? Lessons Learned from Cybersecurity
by: Williams-King, David, et al.
Published: (2025)
by: Williams-King, David, et al.
Published: (2025)
Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice
by: Ge, Yuxu
Published: (2026)
by: Ge, Yuxu
Published: (2026)
MEMSAD: Gradient-Coupled Anomaly Detection for Memory Poisoning in Retrieval-Augmented Agents
by: Gowda, Ishrith
Published: (2026)
by: Gowda, Ishrith
Published: (2026)
AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework
by: Nathanson, Samuel, et al.
Published: (2025)
by: Nathanson, Samuel, et al.
Published: (2025)
Continuous Discovery of Vulnerabilities in LLM Serving Systems with Fuzzing
by: Zhao, Yunze, et al.
Published: (2026)
by: Zhao, Yunze, et al.
Published: (2026)
Secure and Scalable Blockchain Voting: A Comparative Framework and the Role of Large Language Models
by: Kiashemshaki, Kiana, et al.
Published: (2025)
by: Kiashemshaki, Kiana, et al.
Published: (2025)
From Attack Descriptions to Vulnerabilities: A Sentence Transformer-Based Approach
by: Othman, Refat, et al.
Published: (2025)
by: Othman, Refat, et al.
Published: (2025)
A Secure, Manifest-Based Framework for Delegated Privilege Promotion
by: Chowdhury, Rajarshi, et al.
Published: (2026)
by: Chowdhury, Rajarshi, et al.
Published: (2026)
Attacking interpretable NLP systems
by: Abdukhamidov, Eldor, et al.
Published: (2025)
by: Abdukhamidov, Eldor, et al.
Published: (2025)
Toward Secure and Compliant AI: Organizational Standards and Protocols for NLP Model Lifecycle Management
by: Arora, Sunil, et al.
Published: (2025)
by: Arora, Sunil, et al.
Published: (2025)
Securing Agentic AI Systems -- A Multilayer Security Framework
by: Arora, Sunil, et al.
Published: (2025)
by: Arora, Sunil, et al.
Published: (2025)
Confronting the Reproducibility Crisis: A Case Study of Challenges in Cybersecurity AI
by: Moulton, Richard H., et al.
Published: (2024)
by: Moulton, Richard H., et al.
Published: (2024)
CEKER: A Generalizable LLM Framework for Literature Analysis with a Case Study in Unikernel Security
by: Wollman, Alex, et al.
Published: (2024)
by: Wollman, Alex, et al.
Published: (2024)
Can AI Keep a Secret? Contextual Integrity Verification: A Provable Security Architecture for LLMs
by: Gupta, Aayush
Published: (2025)
by: Gupta, Aayush
Published: (2025)
Hardening the OSv Unikernel with Efficient Address Randomization: Design and Performance Evaluation
by: Wollman, Alex, et al.
Published: (2026)
by: Wollman, Alex, et al.
Published: (2026)
ROI: A method for identifying organizations receiving personal data
by: Rodriguez, David, et al.
Published: (2022)
by: Rodriguez, David, et al.
Published: (2022)
Impact of Data Snooping on Deep Learning Models for Locating Vulnerabilities in Lifted Code
by: McCully, Gary A., et al.
Published: (2024)
by: McCully, Gary A., et al.
Published: (2024)
IDFace: Face Template Protection for Efficient and Secure Identification
by: Kim, Sunpill, et al.
Published: (2025)
by: Kim, Sunpill, et al.
Published: (2025)
LLM Honeypot: Leveraging Large Language Models as Advanced Interactive Honeypot Systems
by: Otal, Hakan T., et al.
Published: (2024)
by: Otal, Hakan T., et al.
Published: (2024)
Comparing Unidirectional, Bidirectional, and Word2vec Models for Discovering Vulnerabilities in Compiled Lifted Code
by: McCully, Gary A., et al.
Published: (2024)
by: McCully, Gary A., et al.
Published: (2024)
Similar Items
-
Predicting Known Vulnerabilities from Attack Descriptions Using Sentence Transformers
by: Othman, Refat
Published: (2026) -
Autonomous Penetration Testing: Solving Capture-the-Flag Challenges with LLMs
by: Bakker, Isabelle, et al.
Published: (2025) -
SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots
by: Adebimpe, Adetayo, et al.
Published: (2025) -
Code as a Weapon: A Consensus-Labeled Prompt Bank for Measuring Coding-Model Compliance with Malicious-Code Requests
by: Young, Richard J., et al.
Published: (2026) -
Towards Agentic Investigation of Security Alerts
by: Eilertsen, Even, et al.
Published: (2026)