Saved in:
| Main Authors: | Stocker, Mirko, Wahler, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.26609 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Toward Architecture-Aware Evaluation Metrics for LLM Agents
by: Souza, Débora, et al.
Published: (2026)
by: Souza, Débora, et al.
Published: (2026)
EvoGraph: Hybrid Directed Graph Evolution toward Software 3.0
by: Costa, Igor, et al.
Published: (2025)
by: Costa, Igor, et al.
Published: (2025)
A History Equivalence Algorithm for Dynamic Process Migration
by: Bakshi, Gargi, et al.
Published: (2024)
by: Bakshi, Gargi, et al.
Published: (2024)
Early-Stage Requirements Transformation Approaches: A Systematic Review
by: Letsholo, Keletso J.
Published: (2024)
by: Letsholo, Keletso J.
Published: (2024)
Leveraging Large Language Models for Use Case Model Generation from Software Requirements
by: Eisenreich, Tobias, et al.
Published: (2025)
by: Eisenreich, Tobias, et al.
Published: (2025)
Towards Observation Lakehouses: Living, Interactive Archives of Software Behavior
by: Kessel, Marcus
Published: (2025)
by: Kessel, Marcus
Published: (2025)
AgentModernize: Preserving Business Logic in Legacy Modernization with Multi-Agent LLMs and Behavioral Specification Graphs
by: Ahmed, Sheikh Nazib, et al.
Published: (2026)
by: Ahmed, Sheikh Nazib, et al.
Published: (2026)
OODEval: Evaluating Large Language Models on Object-Oriented Design
by: Xiao, Bingxu, et al.
Published: (2026)
by: Xiao, Bingxu, et al.
Published: (2026)
Morescient GAI for Software Engineering (Extended Version)
by: Kessel, Marcus, et al.
Published: (2024)
by: Kessel, Marcus, et al.
Published: (2024)
Test-driven Software Experimentation with LASSO: an LLM Prompt Benchmarking Example
by: Kessel, Marcus
Published: (2024)
by: Kessel, Marcus
Published: (2024)
SPViz: A DSL-Driven Approach for Software Project Visualization Tooling
by: Rentz, Niklas, et al.
Published: (2024)
by: Rentz, Niklas, et al.
Published: (2024)
Towards Single-System Illusion in Software-Defined Vehicles -- Automated, AI-Powered Workflow
by: Lebioda, Krzysztof, et al.
Published: (2024)
by: Lebioda, Krzysztof, et al.
Published: (2024)
Automating Domain-Driven Design: Experience with a Prompting Framework
by: Eisenreich, Tobias, et al.
Published: (2026)
by: Eisenreich, Tobias, et al.
Published: (2026)
RelRepair: Enhancing Automated Program Repair by Retrieving Relevant Code
by: Liu, Shunyu, et al.
Published: (2025)
by: Liu, Shunyu, et al.
Published: (2025)
Contrastive Learning-Enhanced Large Language Models for Monolith-to-Microservice Decomposition
by: Sellami, Khaled, et al.
Published: (2025)
by: Sellami, Khaled, et al.
Published: (2025)
Reconsidering Requirements Engineering: Human-AI Collaboration in AI-Native Software Development
by: Abbasi, Mateen Ahmed, et al.
Published: (2025)
by: Abbasi, Mateen Ahmed, et al.
Published: (2025)
Learning Software Bug Reports: A Systematic Literature Review
by: Long, Guoming, et al.
Published: (2025)
by: Long, Guoming, et al.
Published: (2025)
The Upper Bound of Information Diffusion in Code Review
by: Dorner, Michael, et al.
Published: (2023)
by: Dorner, Michael, et al.
Published: (2023)
Technical Debt Management: The Road Ahead for Successful Software Delivery
by: Avgeriou, Paris, et al.
Published: (2024)
by: Avgeriou, Paris, et al.
Published: (2024)
Exploring LLMs for User Story Extraction from Mockups
by: Firmenich, Diego, et al.
Published: (2026)
by: Firmenich, Diego, et al.
Published: (2026)
SmellBench: Evaluating LLM Agents on Architectural Code Smell Repair
by: Dinu, Ion George, et al.
Published: (2026)
by: Dinu, Ion George, et al.
Published: (2026)
LLMs as Idiomatic Decompilers: Recovering High-Level Code from x86-64 Assembly for Dart
by: Abualazm, Raafat, et al.
Published: (2026)
by: Abualazm, Raafat, et al.
Published: (2026)
Breaking Changes in Software Ecosystems: A Systematic Literature Review
by: Chen, Juntao, et al.
Published: (2026)
by: Chen, Juntao, et al.
Published: (2026)
LLM-Assisted Translation of Legacy FORTRAN Codes to C++: A Cross-Platform Study
by: Ranasinghe, Nishath Rajiv, et al.
Published: (2025)
by: Ranasinghe, Nishath Rajiv, et al.
Published: (2025)
N-Version Assessment and Enhancement of Generative AI
by: Kessel, Marcus, et al.
Published: (2024)
by: Kessel, Marcus, et al.
Published: (2024)
Synergy of Large Language Model and Model Driven Engineering for Automated Development of Centralized Vehicular Systems
by: Petrovic, Nenad, et al.
Published: (2024)
by: Petrovic, Nenad, et al.
Published: (2024)
Site Reliability Engineering (SRE) and Observations on SRE Process to Make Tasks Easier
by: Puli, Balaram
Published: (2025)
by: Puli, Balaram
Published: (2025)
Comprehensive Evaluation of Large Language Models on Software Engineering Tasks: A Multi-Task Benchmark
by: Gunawan, Go Frendi, et al.
Published: (2026)
by: Gunawan, Go Frendi, et al.
Published: (2026)
Characterising Contributions that Coincide with Vulnerability Mitigation in NPM Libraries
by: Rojpaisarnkit, Ruksit, et al.
Published: (2024)
by: Rojpaisarnkit, Ruksit, et al.
Published: (2024)
Interoperability From Kieker to OpenTelemetry: Demonstrated as Export to ExplorViz
by: Reichelt, David Georg, et al.
Published: (2024)
by: Reichelt, David Georg, et al.
Published: (2024)
Towards Identifying Code Proficiency through the Analysis of Python Textbooks
by: Rojpaisarnkit, Ruksit, et al.
Published: (2024)
by: Rojpaisarnkit, Ruksit, et al.
Published: (2024)
Developer-LLM Conversations: An Empirical Study of Interactions and Generated Code Quality
by: Zhong, Suzhen, et al.
Published: (2025)
by: Zhong, Suzhen, et al.
Published: (2025)
Beyond Greenfield: The D3 Framework for AI-Driven Productivity in Brownfield Engineering
by: Sharma, Krishna Kumaar
Published: (2025)
by: Sharma, Krishna Kumaar
Published: (2025)
Unified Modeling Language Code Generation from Diagram Images Using Multimodal Large Language Models
by: Bates, Averi, et al.
Published: (2025)
by: Bates, Averi, et al.
Published: (2025)
Compiled AI: Deterministic Code Generation for LLM-Based Workflow Automation
by: Trooskens, Geert, et al.
Published: (2026)
by: Trooskens, Geert, et al.
Published: (2026)
CIFE: Code Instruction-Following Evaluation
by: Gunnu, Sravani, et al.
Published: (2025)
by: Gunnu, Sravani, et al.
Published: (2025)
Addressing Data Leakage in HumanEval Using Combinatorial Test Design
by: Bradbury, Jeremy S., et al.
Published: (2024)
by: Bradbury, Jeremy S., et al.
Published: (2024)
Migrating Esope to Fortran 2008 using model transformations
by: Sow, Younoussa, et al.
Published: (2026)
by: Sow, Younoussa, et al.
Published: (2026)
LLM-based vs. Search-based Merge Conflict Resolution: An Empirical Study of Competing Paradigms
by: Junior, Heleno de Souza Campos, et al.
Published: (2026)
by: Junior, Heleno de Souza Campos, et al.
Published: (2026)
Early-Stage Prediction of Review Effort in AI-Generated Pull Requests
by: Minh, Dao Sy Duy, et al.
Published: (2026)
by: Minh, Dao Sy Duy, et al.
Published: (2026)
Similar Items
-
Toward Architecture-Aware Evaluation Metrics for LLM Agents
by: Souza, Débora, et al.
Published: (2026) -
EvoGraph: Hybrid Directed Graph Evolution toward Software 3.0
by: Costa, Igor, et al.
Published: (2025) -
A History Equivalence Algorithm for Dynamic Process Migration
by: Bakshi, Gargi, et al.
Published: (2024) -
Early-Stage Requirements Transformation Approaches: A Systematic Review
by: Letsholo, Keletso J.
Published: (2024) -
Leveraging Large Language Models for Use Case Model Generation from Software Requirements
by: Eisenreich, Tobias, et al.
Published: (2025)