:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Parris, William M.
Format:	Preprint
Published:	2026
Subjects:	Software Engineering Artificial Intelligence D.2.5; I.2.6; D.2.4
Online Access:	https://arxiv.org/abs/2604.17587
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026)

Validating Solidity Code Defects using Symbolic and Concrete Execution powered by Large Language Models
by: Susan, Ştefan-Claudiu, et al.
Published: (2025)

RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing
by: Guo, Jinyao, et al.
Published: (2025)

CodeTracer: Towards Traceable Agent States
by: Li, Han, et al.
Published: (2026)

AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair
by: Hu, Yuelin, et al.
Published: (2026)

The Specification as Quality Gate: Three Hypotheses on AI-Assisted Code Review
by: Zietsman, Christo
Published: (2026)

LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
by: Ravi, Ravin, et al.
Published: (2026)

Towards a Probabilistic Framework for Analyzing and Improving LLM-Enabled Software
by: Baldonado, Juan Manuel, et al.
Published: (2025)

Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains
by: Untila, Octavian
Published: (2026)

Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance
by: Calboreanu, Elias
Published: (2026)

On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
by: Hundal, Rajdeep Singh, et al.
Published: (2025)

Towards Explainable Test Case Prioritisation with Learning-to-Rank Models
by: Ramírez, Aurora, et al.
Published: (2024)

Monitoring Agentic Systems Before They're Reliable
by: Boston, Marisa Ferrara, et al.
Published: (2026)

Automated structural testing of LLM-based agents: methods, framework, and case studies
by: Kohl, Jens, et al.
Published: (2026)

L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)

TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback
by: Jana, Prithwish, et al.
Published: (2026)

Understanding and Detecting Flaky Builds in GitHub Actions
by: Ge, Wenhao, et al.
Published: (2026)

LLMDFA: Analyzing Dataflow in Code with Large Language Models
by: Wang, Chengpeng, et al.
Published: (2024)

MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
by: Su, Jie, et al.
Published: (2025)

A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification
by: Odmark, Joshua, et al.
Published: (2026)

AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework
by: Nathanson, Samuel, et al.
Published: (2025)

Generative AI and the Transformation of Software Development Practices
by: Acharya, Vivek
Published: (2025)

Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code
by: Haseeb, Muhammad
Published: (2025)

Constitutional Spec-Driven Development: Enforcing Security by Construction in AI-Assisted Code Generation
by: Marri, Srinivas Rao
Published: (2026)

Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging
by: Hsieh, Po-Chung, et al.
Published: (2025)

Provable Fairness Repair for Deep Neural Networks
by: Ma, Jianan, et al.
Published: (2026)

AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error Propagation Tracking
by: Guo, Dongxin, et al.
Published: (2026)

Orion: Fuzzing Workflow Automation
by: Bazalii, Max, et al.
Published: (2025)

Adaptive and AI-Augmented Security Testing: A Systematic Survey of Program Analysis, Feedback-Driven Testing, and Hybrid Learning-Based Approaches
by: Wienczkowski, Michael
Published: (2026)

Comparing Human and LLM Generated Code: The Jury is Still Out!
by: Licorish, Sherlock A., et al.
Published: (2025)

VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMs
by: Daneshvar, Seyed Shayan, et al.
Published: (2024)

SDVDiag: Using Context-Aware Causality Mining for the Diagnosis of Connected Vehicle Functions
by: Weiß, Matthias, et al.
Published: (2026)

CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation
by: Eom, Jueon, et al.
Published: (2024)

Scattered Forest Search: Smarter Code Space Exploration with LLMs
by: Light, Jonathan, et al.
Published: (2024)

Structural Quality Gaps in Practitioner AI Governance Prompts: An Empirical Study Using a Five-Principle Evaluation Framework
by: Zietsman, Christo
Published: (2026)

Automated Vulnerability Detection Using Deep Learning Technique
by: Yang, Guan-Yan, et al.
Published: (2024)

ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree
by: Koc, Vincent, et al.
Published: (2026)

N-Version Assessment and Enhancement of Generative AI
by: Kessel, Marcus, et al.
Published: (2024)

ATLAS: A Layered Constraint-Guided Framework for Structured Artifact Generation in LLM-Assisted MDE
by: Ma, Tong, et al.
Published: (2025)

SLEAN: Simple Lightweight Ensemble Analysis Network for Multi-Provider LLM Coordination: Design, Implementation, and Vibe Coding Bug Investigation Case Study
by: Vargas, Matheus J. T.
Published: (2025)