Saved in:
| Main Authors: | Buonocore, Tommaso Mario, Parimbelli, Enea |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.13581 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy
by: Dang, Kieu, et al.
Published: (2025)
by: Dang, Kieu, et al.
Published: (2025)
Breaking to Build: A Threat Model of Prompt-Based Attacks for Securing LLMs
by: Hill, Brennen, et al.
Published: (2025)
by: Hill, Brennen, et al.
Published: (2025)
BreakFun: Jailbreaking LLMs via Schema Exploitation
by: Oskooei, Amirkia Rafiei, et al.
Published: (2025)
by: Oskooei, Amirkia Rafiei, et al.
Published: (2025)
"Abuse Risks are Often Inherent to Product Features": Exploring AI Vendors' Bug Bounty and Responsible Disclosure Policies
by: Piao, Yangheran, et al.
Published: (2025)
by: Piao, Yangheran, et al.
Published: (2025)
Semantically Guided Adversarial Testing of Vision Models Using Language Models
by: Filus, Katarzyna, et al.
Published: (2025)
by: Filus, Katarzyna, et al.
Published: (2025)
Revisiting Third-Party Library Detection: A Ground Truth Dataset and Its Implications Across Security Tasks
by: Gu, Jintao, et al.
Published: (2025)
by: Gu, Jintao, et al.
Published: (2025)
Explainable Attention-Based LSTM Framework for Early Detection of AI-Assisted Ransomware via File System Behavioral Analysis
by: Nayak, Prabhudarshi, et al.
Published: (2026)
by: Nayak, Prabhudarshi, et al.
Published: (2026)
SCARA: A Semantics-Constrained Autonomous Remediation Agent for Opaque Industrial Software Vulnerabilities
by: Ning, Bowei, et al.
Published: (2026)
by: Ning, Bowei, et al.
Published: (2026)
Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks
by: Merves, Tyler H., et al.
Published: (2026)
by: Merves, Tyler H., et al.
Published: (2026)
A Method for Quantifying Human Risk and a Blueprint for LLM Integration
by: Canale, Giuseppe
Published: (2025)
by: Canale, Giuseppe
Published: (2025)
Sola-Visibility-ISPM: Benchmarking Agentic AI for Identity Security Posture Management Visibility
by: Engelberg, Gal, et al.
Published: (2026)
by: Engelberg, Gal, et al.
Published: (2026)
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents
by: Basu, Abhinaba
Published: (2026)
by: Basu, Abhinaba
Published: (2026)
Exploiting Web Search Tools of AI Agents for Data Exfiltration
by: Rall, Dennis, et al.
Published: (2025)
by: Rall, Dennis, et al.
Published: (2025)
PACT: Reducing Alert Fatigue in Low-Prevalence SOC Streams with Triggered Active Learning
by: Ndichu, Samuel, et al.
Published: (2026)
by: Ndichu, Samuel, et al.
Published: (2026)
On the Vulnerability of Applying Retrieval-Augmented Generation within Knowledge-Intensive Application Domains
by: Xian, Xun, et al.
Published: (2024)
by: Xian, Xun, et al.
Published: (2024)
Watermarking for AI Content Detection: A Review on Text, Visual, and Audio Modalities
by: Cao, Lele
Published: (2025)
by: Cao, Lele
Published: (2025)
Safety, Security, and Cognitive Risks in World Models
by: Parmar, Manoj
Published: (2026)
by: Parmar, Manoj
Published: (2026)
DRIFT: Drift-Resilient Invariant-Feature Transformer for DGA Detection
by: Lee, Chaeyoung, et al.
Published: (2026)
by: Lee, Chaeyoung, et al.
Published: (2026)
Retrieval Augmented Classification for Confidential Documents
by: Chang, Yeseul E., et al.
Published: (2026)
by: Chang, Yeseul E., et al.
Published: (2026)
Operationalizing Cybersecurity Governance for Mitigation Planning with Attack-Path Modeling and Reinforcement Learning
by: Huff, Philip, et al.
Published: (2026)
by: Huff, Philip, et al.
Published: (2026)
Can AI Keep a Secret? Contextual Integrity Verification: A Provable Security Architecture for LLMs
by: Gupta, Aayush
Published: (2025)
by: Gupta, Aayush
Published: (2025)
LegalGuardian: A Privacy-Preserving Framework for Secure Integration of Large Language Models in Legal Practice
by: Demir, M. Mikail, et al.
Published: (2025)
by: Demir, M. Mikail, et al.
Published: (2025)
A Dual-Path Generative Framework for Zero-Day Fraud Detection in Banking Systems
by: Ismail, Nasim Abdirahman, et al.
Published: (2026)
by: Ismail, Nasim Abdirahman, et al.
Published: (2026)
Cross-Domain Malware Detection via Probability-Level Fusion of Lightweight Gradient Boosting Models
by: Mohamed, Omar Khalid Ali
Published: (2025)
by: Mohamed, Omar Khalid Ali
Published: (2025)
Hardening x402: PII-Safe Agentic Payments via Pre-Execution Metadata Filtering
by: Stantchev, Vladimir
Published: (2026)
by: Stantchev, Vladimir
Published: (2026)
Expanding the Attack Scenarios of SAE J1939: A Comprehensive Analysis of Established and Novel Vulnerabilities in Transport Protocol
by: Lee, Hwejae, et al.
Published: (2024)
by: Lee, Hwejae, et al.
Published: (2024)
ConvXformer: Differentially Private Hybrid ConvNeXt-Transformer for Inertial Navigation
by: Tariq, Omer, et al.
Published: (2025)
by: Tariq, Omer, et al.
Published: (2025)
Exploratory Analysis of Cyberattack Patterns on E-Commerce Platforms Using Statistical Methods
by: Adeniya, Fatimo Adenike
Published: (2025)
by: Adeniya, Fatimo Adenike
Published: (2025)
An Organization-Scoped LLM Agent Runtime Architecture for Regulated Cybersecurity Operations
by: Fatouros, George, et al.
Published: (2026)
by: Fatouros, George, et al.
Published: (2026)
Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges
by: Muzsai, Lajos, et al.
Published: (2025)
by: Muzsai, Lajos, et al.
Published: (2025)
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
by: Muzsai, Lajos, et al.
Published: (2024)
by: Muzsai, Lajos, et al.
Published: (2024)
MASH: Evading Black-Box AI-Generated Text Detectors via Style Humanization
by: Gu, Yongtong, et al.
Published: (2026)
by: Gu, Yongtong, et al.
Published: (2026)
Benchmarking Large Language Models for IoC Recovery under Adversarial Code Obfuscation and Encryption
by: Morales, Jaime, et al.
Published: (2026)
by: Morales, Jaime, et al.
Published: (2026)
Standardized Threat Taxonomy for AI Security, Governance, and Regulatory Compliance
by: Huwyler, Hernan
Published: (2025)
by: Huwyler, Hernan
Published: (2025)
Identity Deepfake Threats to Biometric Authentication Systems: Public and Expert Perspectives
by: He, Shijing, et al.
Published: (2025)
by: He, Shijing, et al.
Published: (2025)
The Age of Sensorial Zero Trust: Why We Can No Longer Trust Our Senses
by: Xavier, Fabio Correa
Published: (2025)
by: Xavier, Fabio Correa
Published: (2025)
Securing the Dark Matter: A Semantic-Enhanced Neuro-Symbolic Framework for Supply Chain Analysis of Opaque Industrial Software
by: Ning, Bowei, et al.
Published: (2026)
by: Ning, Bowei, et al.
Published: (2026)
h4rm3l: A language for Composable Jailbreak Attack Synthesis
by: Doumbouya, Moussa Koulako Bala, et al.
Published: (2024)
by: Doumbouya, Moussa Koulako Bala, et al.
Published: (2024)
Quantum Machine Learning for Cyber-Physical Anomaly Detection in Unmanned Aerial Vehicles: A Leakage-Free Evaluation with Proxy-Audited Feature Sets
by: Paredes, Carlos A. Durán, et al.
Published: (2026)
by: Paredes, Carlos A. Durán, et al.
Published: (2026)
When the Agent Is the Adversary: Architectural Requirements for Agentic AI Containment After the April 2026 Frontier Model Escape
by: Mitchell, Richard Joseph
Published: (2026)
by: Mitchell, Richard Joseph
Published: (2026)
Similar Items
-
$δ$-STEAL: LLM Stealing Attack with Local Differential Privacy
by: Dang, Kieu, et al.
Published: (2025) -
Breaking to Build: A Threat Model of Prompt-Based Attacks for Securing LLMs
by: Hill, Brennen, et al.
Published: (2025) -
BreakFun: Jailbreaking LLMs via Schema Exploitation
by: Oskooei, Amirkia Rafiei, et al.
Published: (2025) -
"Abuse Risks are Often Inherent to Product Features": Exploring AI Vendors' Bug Bounty and Responsible Disclosure Policies
by: Piao, Yangheran, et al.
Published: (2025) -
Semantically Guided Adversarial Testing of Vision Models Using Language Models
by: Filus, Katarzyna, et al.
Published: (2025)