Saved in:
| Main Authors: | Zhang, Bingxue, Gao, Yang, Zhu, Feida, Shen, Yanyan, Shi, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.14860 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SALLIE: Safeguarding Against Latent Language & Image Exploits
by: Azov, Guy, et al.
Published: (2026)
by: Azov, Guy, et al.
Published: (2026)
Understanding Adversarial Transferability in Vision-Language Models for Autonomous Driving: A Cross-Architecture Analysis
by: Fernandez, David, et al.
Published: (2026)
by: Fernandez, David, et al.
Published: (2026)
RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks
by: Chakraborty, Amit, et al.
Published: (2025)
by: Chakraborty, Amit, et al.
Published: (2025)
Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps
by: Chona, Alankrit, et al.
Published: (2026)
by: Chona, Alankrit, et al.
Published: (2026)
Multi-Agent Honeypot-Based Request-Response Context Dataset for Improved SQL Injection Detection Performance
by: Yu, Hao, et al.
Published: (2026)
by: Yu, Hao, et al.
Published: (2026)
Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice
by: Ge, Yuxu
Published: (2026)
by: Ge, Yuxu
Published: (2026)
AegisShield: Democratizing Cyber Threat Modeling with Generative AI
by: Grofsky, Matthew
Published: (2025)
by: Grofsky, Matthew
Published: (2025)
Illuminating the Black Box: Real-Time Monitoring of Backdoor Unlearning in CNNs via Explainable AI
by: Hoang, Tien Dat
Published: (2025)
by: Hoang, Tien Dat
Published: (2025)
Cybersecurity of Teleoperated Quadruped Robots: A Systematic Survey of Vulnerabilities, Threats, and Open Defense Gaps
by: Sabouri, Mohammad
Published: (2026)
by: Sabouri, Mohammad
Published: (2026)
MASH: Evading Black-Box AI-Generated Text Detectors via Style Humanization
by: Gu, Yongtong, et al.
Published: (2026)
by: Gu, Yongtong, et al.
Published: (2026)
Breaking the Illusion of Security via Interpretation: Interpretable Vision Transformer Systems under Attack
by: Abdukhamidov, Eldor, et al.
Published: (2025)
by: Abdukhamidov, Eldor, et al.
Published: (2025)
Security Attack and Defense Strategies for Autonomous Agent Frameworks: A Layered Review with OpenClaw as a Case Study
by: Xu, Luyao, et al.
Published: (2026)
by: Xu, Luyao, et al.
Published: (2026)
Post-quantum Federated Learning: Secure And Scalable Threat Intelligence For Collaborative Cyber Defense
by: Nayak, Prabhudarshi, et al.
Published: (2026)
by: Nayak, Prabhudarshi, et al.
Published: (2026)
Scalable APT Malware Classification via Parallel Feature Extraction and GPU-Accelerated Learning
by: Subedar, Noah, et al.
Published: (2025)
by: Subedar, Noah, et al.
Published: (2025)
Is My Data in Your Retrieval Database? Membership Inference Attacks Against Retrieval Augmented Generation
by: Anderson, Maya, et al.
Published: (2024)
by: Anderson, Maya, et al.
Published: (2024)
Sensitivity Uncertainty Alignment in Large Language Models
by: Hiremath, Prakul Sunil, et al.
Published: (2026)
by: Hiremath, Prakul Sunil, et al.
Published: (2026)
Countermind: A Multi-Layered Security Architecture for Large Language Models
by: Schwarz, Dominik
Published: (2025)
by: Schwarz, Dominik
Published: (2025)
Adaptive Defense Orchestration for RAG: A Sentinel-Strategist Architecture against Multi-Vector Attacks
by: Pallerla, Pranav, et al.
Published: (2026)
by: Pallerla, Pranav, et al.
Published: (2026)
Benchmarking Large Language Models for IoC Recovery under Adversarial Code Obfuscation and Encryption
by: Morales, Jaime, et al.
Published: (2026)
by: Morales, Jaime, et al.
Published: (2026)
LLM Scalability Risk for Agentic-AI and Model Supply Chain Security
by: Ahi, Kiarash, et al.
Published: (2026)
by: Ahi, Kiarash, et al.
Published: (2026)
SCAFDS: Edge-Feature Graph Attention for Interbank Fraud Detection with Attribution-Grounded SAR Generation
by: Uddin, Mohammad Nasir
Published: (2026)
by: Uddin, Mohammad Nasir
Published: (2026)
Dr. Jekyll and Mr. Hyde: Two Faces of LLMs
by: Collu, Matteo Gioele, et al.
Published: (2023)
by: Collu, Matteo Gioele, et al.
Published: (2023)
A Relevance Model for Threat-Centric Ranking of Cybersecurity Vulnerabilities
by: McCoy, Corren, et al.
Published: (2024)
by: McCoy, Corren, et al.
Published: (2024)
Towards Low-Latency and Adaptive Ransomware Detection Using Contrastive Learning
by: Pan, Zhixin, et al.
Published: (2025)
by: Pan, Zhixin, et al.
Published: (2025)
Density-aware Sample-specific Attack
by: Wang, Qiyuan, et al.
Published: (2026)
by: Wang, Qiyuan, et al.
Published: (2026)
ADMIn: Attacks on Dataset, Model and Input. A Threat Model for AI Based Software
by: Kumar, Vimal, et al.
Published: (2024)
by: Kumar, Vimal, et al.
Published: (2024)
Measuring Harmfulness of Computer-Using Agents
by: Tian, Aaron Xuxiang, et al.
Published: (2025)
by: Tian, Aaron Xuxiang, et al.
Published: (2025)
From nuclear safety to LLM security: Applying non-probabilistic risk management strategies to build safe and secure LLM-powered systems
by: Gutfraind, Alexander, et al.
Published: (2025)
by: Gutfraind, Alexander, et al.
Published: (2025)
A Validated Prompt Bank for Malicious Code Generation: Separating Executable Weapons from Security Knowledge in 1,554 Consensus-Labeled Prompts
by: Young, Richard J., et al.
Published: (2026)
by: Young, Richard J., et al.
Published: (2026)
A High-Recall Cost-Sensitive Machine Learning Framework for Real-Time Online Banking Transaction Fraud Detection
by: R., Karthikeyan V., et al.
Published: (2026)
by: R., Karthikeyan V., et al.
Published: (2026)
Cross-LLM Generalization of Behavioral Backdoor Detection in AI Agent Supply Chains
by: Sanna, Arun Chowdary
Published: (2025)
by: Sanna, Arun Chowdary
Published: (2025)
Phishing Detection System: An Ensemble Approach Using Character-Level CNN and Feature Engineering
by: Dubey, Rudra, et al.
Published: (2025)
by: Dubey, Rudra, et al.
Published: (2025)
An Agentic Multi-Agent Architecture for Cybersecurity Risk Management
by: Gupta, Ravish, et al.
Published: (2026)
by: Gupta, Ravish, et al.
Published: (2026)
Benchmarking Autonomous Agents against Temporal, Spatial, and Semantic Evasions
by: Ma, Jianan, et al.
Published: (2026)
by: Ma, Jianan, et al.
Published: (2026)
AIRTBench: Measuring Autonomous AI Red Teaming Capabilities in Language Models
by: Dawson, Ads, et al.
Published: (2025)
by: Dawson, Ads, et al.
Published: (2025)
The Automation Advantage in AI Red Teaming
by: Mulla, Rob, et al.
Published: (2025)
by: Mulla, Rob, et al.
Published: (2025)
Static Attribution of Android Residential Proxy Malware Using Graph Kernels
by: Clark, Peter, et al.
Published: (2026)
by: Clark, Peter, et al.
Published: (2026)
Adversarial Feeds Steer LLM Agent Decisions Against Their Defaults
by: Usman, Rana Muhammad
Published: (2026)
by: Usman, Rana Muhammad
Published: (2026)
$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy
by: Dang, Kieu, et al.
Published: (2025)
by: Dang, Kieu, et al.
Published: (2025)
Activation Differences Reveal Backdoors: A Comparison of SAE Architectures
by: Kumar, Sachin
Published: (2026)
by: Kumar, Sachin
Published: (2026)
Similar Items
-
SALLIE: Safeguarding Against Latent Language & Image Exploits
by: Azov, Guy, et al.
Published: (2026) -
Understanding Adversarial Transferability in Vision-Language Models for Autonomous Driving: A Cross-Architecture Analysis
by: Fernandez, David, et al.
Published: (2026) -
RADEP: A Resilient Adaptive Defense Framework Against Model Extraction Attacks
by: Chakraborty, Amit, et al.
Published: (2025) -
Cyber Defense Benchmark: Agentic Threat Hunting Evaluation for LLMs in SecOps
by: Chona, Alankrit, et al.
Published: (2026) -
Multi-Agent Honeypot-Based Request-Response Context Dataset for Improved SQL Injection Detection Performance
by: Yu, Hao, et al.
Published: (2026)