Saved in:
| Main Author: | Parris, William M. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.17587 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026)
by: Jahan, Sigma, et al.
Published: (2026)
Validating Solidity Code Defects using Symbolic and Concrete Execution powered by Large Language Models
by: Susan, Ştefan-Claudiu, et al.
Published: (2025)
by: Susan, Ştefan-Claudiu, et al.
Published: (2025)
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing
by: Guo, Jinyao, et al.
Published: (2025)
by: Guo, Jinyao, et al.
Published: (2025)
CodeTracer: Towards Traceable Agent States
by: Li, Han, et al.
Published: (2026)
by: Li, Han, et al.
Published: (2026)
AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair
by: Hu, Yuelin, et al.
Published: (2026)
by: Hu, Yuelin, et al.
Published: (2026)
The Specification as Quality Gate: Three Hypotheses on AI-Assisted Code Review
by: Zietsman, Christo
Published: (2026)
by: Zietsman, Christo
Published: (2026)
LLMLOOP: Improving LLM-Generated Code and Tests through Automated Iterative Feedback Loops
by: Ravi, Ravin, et al.
Published: (2026)
by: Ravi, Ravin, et al.
Published: (2026)
Towards a Probabilistic Framework for Analyzing and Improving LLM-Enabled Software
by: Baldonado, Juan Manuel, et al.
Published: (2025)
by: Baldonado, Juan Manuel, et al.
Published: (2025)
Emergent Formal Verification: How an Autonomous AI Ecosystem Independently Discovered SMT-Based Safety Across Six Domains
by: Untila, Octavian
Published: (2026)
by: Untila, Octavian
Published: (2026)
Iterative Audit Convergence in LLM-Managed Multi-Agent Systems: A Case Study in Prompt Engineering Quality Assurance
by: Calboreanu, Elias
Published: (2026)
by: Calboreanu, Elias
Published: (2026)
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
by: Hundal, Rajdeep Singh, et al.
Published: (2025)
by: Hundal, Rajdeep Singh, et al.
Published: (2025)
Towards Explainable Test Case Prioritisation with Learning-to-Rank Models
by: Ramírez, Aurora, et al.
Published: (2024)
by: Ramírez, Aurora, et al.
Published: (2024)
Monitoring Agentic Systems Before They're Reliable
by: Boston, Marisa Ferrara, et al.
Published: (2026)
by: Boston, Marisa Ferrara, et al.
Published: (2026)
Automated structural testing of LLM-based agents: methods, framework, and case studies
by: Kohl, Jens, et al.
Published: (2026)
by: Kohl, Jens, et al.
Published: (2026)
L2MAC: Large Language Model Automatic Computer for Extensive Code Generation
by: Holt, Samuel, et al.
Published: (2023)
by: Holt, Samuel, et al.
Published: (2023)
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback
by: Jana, Prithwish, et al.
Published: (2026)
by: Jana, Prithwish, et al.
Published: (2026)
Understanding and Detecting Flaky Builds in GitHub Actions
by: Ge, Wenhao, et al.
Published: (2026)
by: Ge, Wenhao, et al.
Published: (2026)
LLMDFA: Analyzing Dataflow in Code with Large Language Models
by: Wang, Chengpeng, et al.
Published: (2024)
by: Wang, Chengpeng, et al.
Published: (2024)
MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
by: Su, Jie, et al.
Published: (2025)
by: Su, Jie, et al.
Published: (2025)
A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification
by: Odmark, Joshua, et al.
Published: (2026)
by: Odmark, Joshua, et al.
Published: (2026)
AI Bill of Materials and Beyond: Systematizing Security Assurance through the AI Risk Scanning (AIRS) Framework
by: Nathanson, Samuel, et al.
Published: (2025)
by: Nathanson, Samuel, et al.
Published: (2025)
Generative AI and the Transformation of Software Development Practices
by: Acharya, Vivek
Published: (2025)
by: Acharya, Vivek
Published: (2025)
Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code
by: Haseeb, Muhammad
Published: (2025)
by: Haseeb, Muhammad
Published: (2025)
Constitutional Spec-Driven Development: Enforcing Security by Construction in AI-Assisted Code Generation
by: Marri, Srinivas Rao
Published: (2026)
by: Marri, Srinivas Rao
Published: (2026)
Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging
by: Hsieh, Po-Chung, et al.
Published: (2025)
by: Hsieh, Po-Chung, et al.
Published: (2025)
Provable Fairness Repair for Deep Neural Networks
by: Ma, Jianan, et al.
Published: (2026)
by: Ma, Jianan, et al.
Published: (2026)
AgentEval: DAG-Structured Step-Level Evaluation for Agentic Workflows with Error Propagation Tracking
by: Guo, Dongxin, et al.
Published: (2026)
by: Guo, Dongxin, et al.
Published: (2026)
Orion: Fuzzing Workflow Automation
by: Bazalii, Max, et al.
Published: (2025)
by: Bazalii, Max, et al.
Published: (2025)
Adaptive and AI-Augmented Security Testing: A Systematic Survey of Program Analysis, Feedback-Driven Testing, and Hybrid Learning-Based Approaches
by: Wienczkowski, Michael
Published: (2026)
by: Wienczkowski, Michael
Published: (2026)
Comparing Human and LLM Generated Code: The Jury is Still Out!
by: Licorish, Sherlock A., et al.
Published: (2025)
by: Licorish, Sherlock A., et al.
Published: (2025)
VulScribeR: Exploring RAG-based Vulnerability Augmentation with LLMs
by: Daneshvar, Seyed Shayan, et al.
Published: (2024)
by: Daneshvar, Seyed Shayan, et al.
Published: (2024)
SDVDiag: Using Context-Aware Causality Mining for the Diagnosis of Connected Vehicle Functions
by: Weiß, Matthias, et al.
Published: (2026)
by: Weiß, Matthias, et al.
Published: (2026)
CovRL: Fuzzing JavaScript Engines with Coverage-Guided Reinforcement Learning for LLM-based Mutation
by: Eom, Jueon, et al.
Published: (2024)
by: Eom, Jueon, et al.
Published: (2024)
Scattered Forest Search: Smarter Code Space Exploration with LLMs
by: Light, Jonathan, et al.
Published: (2024)
by: Light, Jonathan, et al.
Published: (2024)
Structural Quality Gaps in Practitioner AI Governance Prompts: An Empirical Study Using a Five-Principle Evaluation Framework
by: Zietsman, Christo
Published: (2026)
by: Zietsman, Christo
Published: (2026)
Automated Vulnerability Detection Using Deep Learning Technique
by: Yang, Guan-Yan, et al.
Published: (2024)
by: Yang, Guan-Yan, et al.
Published: (2024)
ClawHub Security Signals: When VirusTotal, Static Analysis, and SkillSpector Disagree
by: Koc, Vincent, et al.
Published: (2026)
by: Koc, Vincent, et al.
Published: (2026)
N-Version Assessment and Enhancement of Generative AI
by: Kessel, Marcus, et al.
Published: (2024)
by: Kessel, Marcus, et al.
Published: (2024)
ATLAS: A Layered Constraint-Guided Framework for Structured Artifact Generation in LLM-Assisted MDE
by: Ma, Tong, et al.
Published: (2025)
by: Ma, Tong, et al.
Published: (2025)
SLEAN: Simple Lightweight Ensemble Analysis Network for Multi-Provider LLM Coordination: Design, Implementation, and Vibe Coding Bug Investigation Case Study
by: Vargas, Matheus J. T.
Published: (2025)
by: Vargas, Matheus J. T.
Published: (2025)
Similar Items
-
DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026) -
Validating Solidity Code Defects using Symbolic and Concrete Execution powered by Large Language Models
by: Susan, Ştefan-Claudiu, et al.
Published: (2025) -
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing
by: Guo, Jinyao, et al.
Published: (2025) -
CodeTracer: Towards Traceable Agent States
by: Li, Han, et al.
Published: (2026) -
AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair
by: Hu, Yuelin, et al.
Published: (2026)