:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Untila, Octavian
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Artificial Intelligence Multiagent Systems D.2.4; I.2.6
Online Access:	https://arxiv.org/abs/2603.21149
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AIRA: AI-Induced Risk Audit: A Structured Inspection Framework for AI-Generated Code
by: Parris, William M.
Published: (2026)

Provable Fairness Repair for Deep Neural Networks
by: Ma, Jianan, et al.
Published: (2026)

MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
by: Su, Jie, et al.
Published: (2025)

When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges
by: Darshan, Parth, et al.
Published: (2026)

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026)

A Practical Approach to Formal Methods: An Eclipse Integrated Development Environment (IDE) for Security Protocols
by: Garcia, Rémi, et al.
Published: (2024)

RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing
by: Guo, Jinyao, et al.
Published: (2025)

TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback
by: Jana, Prithwish, et al.
Published: (2026)

BONSAI: A Mixed-Initiative Workspace for Human-AI Co-Development of Visual Analytics Applications
by: Spinner, Thilo, et al.
Published: (2026)

Understanding and Detecting Flaky Builds in GitHub Actions
by: Ge, Wenhao, et al.
Published: (2026)

Nidus: Externalized Reasoning for AI-Assisted Engineering
by: Gorinevski, Danil
Published: (2026)

AI-assisted JSON Schema Creation and Mapping
by: Neubauer, Felix, et al.
Published: (2025)

Automated Bug Triaging using Instruction-Tuned Large Language Models
by: Kiashemshaki, Kiana, et al.
Published: (2025)

A Self-Improving Architecture for Dynamic Safety in Large Language Models
by: Slater, Tyler
Published: (2025)

LLMCup: Ranking-Enhanced Comment Updating with LLMs
by: Ge, Hua, et al.
Published: (2025)

Knowledge Equivalence in Digital Twins of Intelligent Systems
by: Zhang, Nan, et al.
Published: (2022)

From Domain Understanding to Design Readiness: a playbook for GenAI-supported learning in Software Engineering
by: Wlodarski, Rafal
Published: (2026)

Can Graph-Based Microservice Performance Detection Be Used for Microservice Intrusion Detection?
by: Ma, Yunjian
Published: (2026)

AI-Assisted Engineering Should Track the Epistemic Status and Temporal Validity of Architectural Decisions
by: Gilda, Sankalp, et al.
Published: (2026)

Secure coding for web applications: Frameworks, challenges, and the role of LLMs
by: Kiashemshaki, Kiana, et al.
Published: (2025)

PARNESS: A Paper Harness for End-to-End Automated Scientific Research with Dynamic Workflows, Full-Text Indexing, and Cross-Run Knowledge Accumulation
by: Wang, Yuchen, et al.
Published: (2026)

MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization
by: Tanjim, Md Mehrab, et al.
Published: (2026)

LLMDFA: Analyzing Dataflow in Code with Large Language Models
by: Wang, Chengpeng, et al.
Published: (2024)

On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
by: Hundal, Rajdeep Singh, et al.
Published: (2025)

Towards Explainable Test Case Prioritisation with Learning-to-Rank Models
by: Ramírez, Aurora, et al.
Published: (2024)

Towards Continuous Assurance with Formal Verification and Assurance Cases
by: Abeywickrama, Dhaminda B., et al.
Published: (2025)

Provable Repair of Deep Neural Network Defects by Preimage Synthesis and Property Refinement
by: Ma, Jianan, et al.
Published: (2025)

Multi-Agent Code Verification via Information Theory
by: Rajan, Shreshth
Published: (2025)

Feedback-Normalized Developer Memory for Reinforcement-Learning Coding Agents: A Safety-Gated MCP Architecture
by: Iscan, Mehmet
Published: (2026)

Software Defined Vehicle Code Generation: A Few-Shot Prompting Approach
by: Nguyen, Quang-Dung, et al.
Published: (2025)

The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking
by: Palacios, Diego Cabezas
Published: (2026)

AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair
by: Hu, Yuelin, et al.
Published: (2026)

CodeTracer: Towards Traceable Agent States
by: Li, Han, et al.
Published: (2026)

SDVDiag: Using Context-Aware Causality Mining for the Diagnosis of Connected Vehicle Functions
by: Weiß, Matthias, et al.
Published: (2026)

Comparing Unidirectional, Bidirectional, and Word2vec Models for Discovering Vulnerabilities in Compiled Lifted Code
by: McCully, Gary A., et al.
Published: (2024)

Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code
by: McCully, Gary A., et al.
Published: (2024)

LLM Agents for Generating Microservice-based Applications: how complex is your specification?
by: Yellin, Daniel M.
Published: (2025)

GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
by: Agrawal, Lakshya A, et al.
Published: (2025)

Uncovering Bugs in Formal Explainers: A Case Study with PyXAI
by: Huang, Xuanxiang, et al.
Published: (2025)

A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification
by: Odmark, Joshua, et al.
Published: (2026)