Saved in:
| Main Authors: | Goyal, Dhruva, Subramanian, Sitaraman, Peela, Aditya, Shetty, Nisha P. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.09493 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
by: Muzsai, Lajos, et al.
Published: (2024)
by: Muzsai, Lajos, et al.
Published: (2024)
Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
by: Charles, Richard M., et al.
Published: (2025)
by: Charles, Richard M., et al.
Published: (2025)
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications
by: Härer, Felix
Published: (2025)
by: Härer, Felix
Published: (2025)
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
by: Zhu, Yuxuan, et al.
Published: (2025)
by: Zhu, Yuxuan, et al.
Published: (2025)
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
by: Hugglestone, James, et al.
Published: (2026)
by: Hugglestone, James, et al.
Published: (2026)
MalPurifier: Enhancing Android Malware Detection with Adversarial Purification against Evasion Attacks
by: Zhou, Yuyang, et al.
Published: (2023)
by: Zhou, Yuyang, et al.
Published: (2023)
Enhancing Multi-Criteria Decision Analysis with AI: Integrating Analytic Hierarchy Process and GPT-4 for Automated Decision Support
by: Svoboda, Igor, et al.
Published: (2024)
by: Svoboda, Igor, et al.
Published: (2024)
Improving LLM Agents with Reinforcement Learning on Cryptographic CTF Challenges
by: Muzsai, Lajos, et al.
Published: (2025)
by: Muzsai, Lajos, et al.
Published: (2025)
SMSI: System Model Security Inference: Automated Threat Modeling for Cyber-Physical Systems
by: Radaideh, RoÝah, et al.
Published: (2026)
by: Radaideh, RoÝah, et al.
Published: (2026)
SCAFDS: Edge-Feature Graph Attention for Interbank Fraud Detection with Attribution-Grounded SAR Generation
by: Uddin, Mohammad Nasir
Published: (2026)
by: Uddin, Mohammad Nasir
Published: (2026)
Evaluating the robustness of adversarial defenses in malware detection systems
by: Jafari, Mostafa, et al.
Published: (2025)
by: Jafari, Mostafa, et al.
Published: (2025)
Type-Checked Compliance: Deterministic Guardrails for Agentic Financial Systems Using Lean 4 Theorem Proving
by: Rashie, Devakh, et al.
Published: (2026)
by: Rashie, Devakh, et al.
Published: (2026)
Preliminary study on artificial intelligence methods for cybersecurity threat detection in computer networks based on raw data packets
by: Ogonowski, Aleksander, et al.
Published: (2024)
by: Ogonowski, Aleksander, et al.
Published: (2024)
AutoPentester: An LLM Agent-based Framework for Automated Pentesting
by: Ginige, Yasod, et al.
Published: (2025)
by: Ginige, Yasod, et al.
Published: (2025)
Tool Receipts, Not Zero-Knowledge Proofs: Practical Hallucination Detection for AI Agents
by: Basu, Abhinaba
Published: (2026)
by: Basu, Abhinaba
Published: (2026)
X-NegoBox: An Explainable Privacy-Budget Negotiation Framework for Secure Peer-to-Peer Energy Data Exchange
by: Sengupta, Poushali, et al.
Published: (2026)
by: Sengupta, Poushali, et al.
Published: (2026)
Scam Detection for Ethereum Smart Contracts: Leveraging Graph Representation Learning for Secure Blockchain
by: Jin, Yihong, et al.
Published: (2024)
by: Jin, Yihong, et al.
Published: (2024)
MaPPing Your Model: Assessing the Impact of Adversarial Attacks on LLM-based Programming Assistants
by: Heibel, John, et al.
Published: (2024)
by: Heibel, John, et al.
Published: (2024)
Adversarial Attacks and Defenses in Fault Detection and Diagnosis: A Comprehensive Benchmark on the Tennessee Eastman Process
by: Pozdnyakov, Vitaliy, et al.
Published: (2024)
by: Pozdnyakov, Vitaliy, et al.
Published: (2024)
MedBeads: An Agent-Native, Immutable Data Substrate for Trustworthy Medical AI
by: Nakajima, Takahito
Published: (2026)
by: Nakajima, Takahito
Published: (2026)
Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis
by: Chen, Yiyi, et al.
Published: (2024)
by: Chen, Yiyi, et al.
Published: (2024)
Efficient LLM Safety Evaluation through Multi-Agent Debate
by: Lin, Dachuan, et al.
Published: (2025)
by: Lin, Dachuan, et al.
Published: (2025)
Large Language Models for Agentic NetOps and AIOps: Architectures, Evaluation, and Safety
by: Bilal, Muhammad, et al.
Published: (2026)
by: Bilal, Muhammad, et al.
Published: (2026)
VFLAIR-LLM: A Comprehensive Framework and Benchmark for Split Learning of LLMs
by: Gu, Zixuan, et al.
Published: (2025)
by: Gu, Zixuan, et al.
Published: (2025)
AutoPentest: Enhancing Vulnerability Management With Autonomous LLM Agents
by: Henke, Julius
Published: (2025)
by: Henke, Julius
Published: (2025)
Who Governs the Machine? A Machine Identity Governance Taxonomy (MIGT) for AI Systems Operating Across Enterprise and Geopolitical Boundaries
by: Kurtz, Andrew, et al.
Published: (2026)
by: Kurtz, Andrew, et al.
Published: (2026)
SafetyDrift: Predicting When AI Agents Cross the Line Before They Actually Do
by: Dhodapkar, Aditya, et al.
Published: (2026)
by: Dhodapkar, Aditya, et al.
Published: (2026)
ARACNE: An LLM-Based Autonomous Shell Pentesting Agent
by: Nieponice, Tomas, et al.
Published: (2025)
by: Nieponice, Tomas, et al.
Published: (2025)
Tatemae: Detecting Alignment Faking via Tool Selection in LLMs
by: Leonesi, Matteo, et al.
Published: (2026)
by: Leonesi, Matteo, et al.
Published: (2026)
Developing a Strong CPS Defender: An Evolutionary Approach
by: Hu, Qingyuan, et al.
Published: (2025)
by: Hu, Qingyuan, et al.
Published: (2025)
A Robust Federated Learning Approach for Combating Attacks Against IoT Systems Under non-IID Challenges
by: Gad, Eyad, et al.
Published: (2025)
by: Gad, Eyad, et al.
Published: (2025)
Collaborative Approaches to Enhancing Smart Vehicle Cybersecurity by AI-Driven Threat Detection
by: Ali, Syed Atif, et al.
Published: (2024)
by: Ali, Syed Atif, et al.
Published: (2024)
IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol
by: Yao, Yunhao, et al.
Published: (2025)
by: Yao, Yunhao, et al.
Published: (2025)
SecureV2X: An Efficient and Privacy-Preserving System for Vehicle-to-Everything (V2X) Applications
by: Lee, Joshua, et al.
Published: (2025)
by: Lee, Joshua, et al.
Published: (2025)
From nuclear safety to LLM security: Applying non-probabilistic risk management strategies to build safe and secure LLM-powered systems
by: Gutfraind, Alexander, et al.
Published: (2025)
by: Gutfraind, Alexander, et al.
Published: (2025)
h4rm3l: A language for Composable Jailbreak Attack Synthesis
by: Doumbouya, Moussa Koulako Bala, et al.
Published: (2024)
by: Doumbouya, Moussa Koulako Bala, et al.
Published: (2024)
Inverting Cryptographic Hash Functions via Cube-and-Conquer
by: Zaikin, Oleg
Published: (2022)
by: Zaikin, Oleg
Published: (2022)
JavelinGuard: Low-Cost Transformer Architectures for LLM Security
by: Datta, Yash, et al.
Published: (2025)
by: Datta, Yash, et al.
Published: (2025)
CANAL -- Cyber Activity News Alerting Language Model: Empirical Approach vs. Expensive LLM
by: Patel, Urjitkumar, et al.
Published: (2024)
by: Patel, Urjitkumar, et al.
Published: (2024)
PentestMCP: A Toolkit for Agentic Penetration Testing
by: Ezetta, Zachary, et al.
Published: (2025)
by: Ezetta, Zachary, et al.
Published: (2025)
Similar Items
-
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
by: Muzsai, Lajos, et al.
Published: (2024) -
Mitigating Trojanized Prompt Chains in Educational LLM Use Cases: Experimental Findings and Detection Tool Design
by: Charles, Richard M., et al.
Published: (2025) -
Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications
by: Härer, Felix
Published: (2025) -
CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities
by: Zhu, Yuxuan, et al.
Published: (2025) -
STRIATUM-CTF: A Protocol-Driven Agentic Framework for General-Purpose CTF Solving
by: Hugglestone, James, et al.
Published: (2026)