Saved in:
Bibliographic Details
Main Authors: Jafari, Mahdi, Sharma, Rahul, Naim, Sami, Gerking, Christopher, Reussner, Ralf
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.23698
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914420296253440
author Jafari, Mahdi
Sharma, Rahul
Naim, Sami
Gerking, Christopher
Reussner, Ralf
author_facet Jafari, Mahdi
Sharma, Rahul
Naim, Sami
Gerking, Christopher
Reussner, Ralf
contents Software architecture models capture early design decisions that strongly influence system quality attributes, including security. However, architecture-level security assessment and feedback are often absent in practice, allowing security weaknesses to propagate into later phases of the software development lifecycle and, in some cases, to remain undiscovered, ultimately leading to vulnerable systems. In this paper, we bridge this gap by proposing the generation of Abstract Penetration Test Cases (APTCs) from software architecture models as an input to support architecture-level security assessment. We first introduce a metamodel that defines the APTC concept, and then investigate the use of large language models with different prompting strategies to generate meaningful APTCs from architecture models. To design the APTC metamodel, we analyze relevant standards and state of the art using two criteria: (i) derivability from software architecture, and (ii) usability for both architecture security assessment and subsequent penetration testing. Building on this metamodel, we then proceed to generate APTCs from software architecture models. Our evaluation shows promising results, achieving up to 93\% usefulness and 86\% correctness, indicating that the generated APTCs can substantially support both architects (by highlighting security-critical design decisions) and penetration testers (by providing actionable testing guidance).
format Preprint
id arxiv_https___arxiv_org_abs_2603_23698
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Towards Leveraging LLMs to Generate Abstract Penetration Test Cases from Software Architecture
Jafari, Mahdi
Sharma, Rahul
Naim, Sami
Gerking, Christopher
Reussner, Ralf
Software Engineering
Software architecture models capture early design decisions that strongly influence system quality attributes, including security. However, architecture-level security assessment and feedback are often absent in practice, allowing security weaknesses to propagate into later phases of the software development lifecycle and, in some cases, to remain undiscovered, ultimately leading to vulnerable systems. In this paper, we bridge this gap by proposing the generation of Abstract Penetration Test Cases (APTCs) from software architecture models as an input to support architecture-level security assessment. We first introduce a metamodel that defines the APTC concept, and then investigate the use of large language models with different prompting strategies to generate meaningful APTCs from architecture models. To design the APTC metamodel, we analyze relevant standards and state of the art using two criteria: (i) derivability from software architecture, and (ii) usability for both architecture security assessment and subsequent penetration testing. Building on this metamodel, we then proceed to generate APTCs from software architecture models. Our evaluation shows promising results, achieving up to 93\% usefulness and 86\% correctness, indicating that the generated APTCs can substantially support both architects (by highlighting security-critical design decisions) and penetration testers (by providing actionable testing guidance).
title Towards Leveraging LLMs to Generate Abstract Penetration Test Cases from Software Architecture
topic Software Engineering
url https://arxiv.org/abs/2603.23698