Saved in:
| Main Authors: | Happe, Andreas, Cito, Jürgen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.04227 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Can LLMs Hack Enterprise Networks? -- Replicated Computational Results (RCR) Report
by: Happe, Andreas, et al.
Published: (2026)
by: Happe, Andreas, et al.
Published: (2026)
On the Surprising Efficacy of LLMs for Penetration-Testing
by: Happe, Andreas, et al.
Published: (2025)
by: Happe, Andreas, et al.
Published: (2025)
Ethics Statements in Autonomous Penetration-Testing Agent Research
by: Happe, Andreas, et al.
Published: (2025)
by: Happe, Andreas, et al.
Published: (2025)
Cochise: A Reference Harness for Autonomous Penetration Testing
by: Happe, Andreas, et al.
Published: (2026)
by: Happe, Andreas, et al.
Published: (2026)
Got Root? A Linux Priv-Esc Benchmark
by: Happe, Andreas, et al.
Published: (2024)
by: Happe, Andreas, et al.
Published: (2024)
LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks
by: Happe, Andreas, et al.
Published: (2023)
by: Happe, Andreas, et al.
Published: (2023)
Benchmarking Practices in LLM-driven Offensive Security: Testbeds, Metrics, and Experiment Design
by: Happe, Andreas, et al.
Published: (2025)
by: Happe, Andreas, et al.
Published: (2025)
Enhancing Linux Privilege Escalation Attack Capabilities of Local LLM Agents
by: Probst, Benjamin, et al.
Published: (2026)
by: Probst, Benjamin, et al.
Published: (2026)
Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards
by: Normann, Philipp, et al.
Published: (2026)
by: Normann, Philipp, et al.
Published: (2026)
Language Models Can Autonomously Hack and Self-Replicate
by: Air, Alena, et al.
Published: (2026)
by: Air, Alena, et al.
Published: (2026)
STRisk: A Socio-Technical Approach to Assess Hacking Breaches Risk
by: Hammouchi, Hicham, et al.
Published: (2024)
by: Hammouchi, Hicham, et al.
Published: (2024)
After the Breach: Incident Response within Enterprises
by: Rao, Sumanth
Published: (2024)
by: Rao, Sumanth
Published: (2024)
Enterprise Security Incident Analysis and Countermeasures Based on the T-Mobile Data Breach
by: Cui, Zhuohan, et al.
Published: (2025)
by: Cui, Zhuohan, et al.
Published: (2025)
BreachSeek: A Multi-Agent Automated Penetration Tester
by: Alshehri, Ibrahim, et al.
Published: (2024)
by: Alshehri, Ibrahim, et al.
Published: (2024)
HackSynth: LLM Agent and Evaluation Framework for Autonomous Penetration Testing
by: Muzsai, Lajos, et al.
Published: (2024)
by: Muzsai, Lajos, et al.
Published: (2024)
Penetration Testing of 5G Core Network Web Technologies
by: Giambartolomei, Filippo, et al.
Published: (2024)
by: Giambartolomei, Filippo, et al.
Published: (2024)
HADES: Detecting Active Directory Attacks via Whole Network Provenance Analytics
by: Liu, Qi, et al.
Published: (2024)
by: Liu, Qi, et al.
Published: (2024)
Incorporation of Verifier Functionality in the Software for Operations and Network Attack Results Review and the Autonomous Penetration Testing System
by: Milbrath, Jordan, et al.
Published: (2024)
by: Milbrath, Jordan, et al.
Published: (2024)
Vulnerability Mitigation System (VMS): LLM Agent and Evaluation Framework for Autonomous Penetration Testing
by: Abdulzada, Farzana
Published: (2025)
by: Abdulzada, Farzana
Published: (2025)
LLM Agents can Autonomously Hack Websites
by: Fang, Richard, et al.
Published: (2024)
by: Fang, Richard, et al.
Published: (2024)
Red-MIRROR: Agentic LLM-based Autonomous Penetration Testing with Reflective Verification and Knowledge-augmented Interaction
by: Khang, Tran Vy, et al.
Published: (2026)
by: Khang, Tran Vy, et al.
Published: (2026)
The IoT Breaches your Household Again
by: Bonaventura, Davide, et al.
Published: (2024)
by: Bonaventura, Davide, et al.
Published: (2024)
Characterizing the Networks Sending Enterprise Phishing Emails
by: Luo, Elisa, et al.
Published: (2024)
by: Luo, Elisa, et al.
Published: (2024)
Insider Threats Mitigation: Role of Penetration Testing
by: Chauhan, Krutarth
Published: (2024)
by: Chauhan, Krutarth
Published: (2024)
Zero Trust Score-based Network-level Access Control in Enterprise Networks
by: Bradatsch, Leonard, et al.
Published: (2024)
by: Bradatsch, Leonard, et al.
Published: (2024)
Incentivizing Collaboration for Detection of Credential Database Breaches
by: Nanda, Mridu, et al.
Published: (2025)
by: Nanda, Mridu, et al.
Published: (2025)
Penetration Testing for System Security: Methods and Practical Approaches
by: Zhang, Wei, et al.
Published: (2025)
by: Zhang, Wei, et al.
Published: (2025)
Automated Penetration Testing with LLM Agents and Classical Planning
by: Wang, Lingzhi, et al.
Published: (2025)
by: Wang, Lingzhi, et al.
Published: (2025)
A Comprehensive Evaluation and Practice of System Penetration Testing
by: Zhang, Chunyi, et al.
Published: (2025)
by: Zhang, Chunyi, et al.
Published: (2025)
Advanced Penetration Testing for Enhancing 5G Security
by: Smith-Haynes, Shari-Ann
Published: (2024)
by: Smith-Haynes, Shari-Ann
Published: (2024)
PenTest++: Elevating Ethical Hacking with AI and Automation
by: Al-Sinani, Haitham S., et al.
Published: (2025)
by: Al-Sinani, Haitham S., et al.
Published: (2025)
A Consensus-Bayesian Framework for Detecting Malicious Activity in Enterprise Directory Access Graphs
by: Uppuluri, Pratyush, et al.
Published: (2026)
by: Uppuluri, Pratyush, et al.
Published: (2026)
Critical Infrastructure Security: Penetration Testing and Exploit Development Perspectives
by: Orleans-Bosomtwe, Papa Kobina
Published: (2024)
by: Orleans-Bosomtwe, Papa Kobina
Published: (2024)
PTHelper: An open source tool to support the Penetration Testing process
by: de Gracia, Jacobo Casado, et al.
Published: (2024)
by: de Gracia, Jacobo Casado, et al.
Published: (2024)
PentestAgent: Incorporating LLM Agents to Automated Penetration Testing
by: Shen, Xiangmin, et al.
Published: (2024)
by: Shen, Xiangmin, et al.
Published: (2024)
PrivacyXray: Detecting Privacy Breaches in LLMs through Semantic Consistency and Probability Certainty
by: He, Jinwen, et al.
Published: (2025)
by: He, Jinwen, et al.
Published: (2025)
Automated Penetration Testing: Formalization and Realization
by: Skandylas, Charilaos, et al.
Published: (2024)
by: Skandylas, Charilaos, et al.
Published: (2024)
xOffense: An Autonomous Multi-Agent Framework for Penetration Testing with Domain-Adapted Large Language Models
by: Luong, Phung Duc, et al.
Published: (2025)
by: Luong, Phung Duc, et al.
Published: (2025)
Mind the Gap: Towards Generalizable Autonomous Penetration Testing via Domain Randomization and Meta-Reinforcement Learning
by: Zhou, Shicheng, et al.
Published: (2024)
by: Zhou, Shicheng, et al.
Published: (2024)
Evaluation of Reinforcement Learning for Autonomous Penetration Testing using A3C, Q-learning and DQN
by: Becker, Norman, et al.
Published: (2024)
by: Becker, Norman, et al.
Published: (2024)
Similar Items
-
Can LLMs Hack Enterprise Networks? -- Replicated Computational Results (RCR) Report
by: Happe, Andreas, et al.
Published: (2026) -
On the Surprising Efficacy of LLMs for Penetration-Testing
by: Happe, Andreas, et al.
Published: (2025) -
Ethics Statements in Autonomous Penetration-Testing Agent Research
by: Happe, Andreas, et al.
Published: (2025) -
Cochise: A Reference Harness for Autonomous Penetration Testing
by: Happe, Andreas, et al.
Published: (2026) -
Got Root? A Linux Priv-Esc Benchmark
by: Happe, Andreas, et al.
Published: (2024)