Saved in:
| Main Author: | Untila, Octavian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.21149 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AIRA: AI-Induced Risk Audit: A Structured Inspection Framework for AI-Generated Code
by: Parris, William M.
Published: (2026)
by: Parris, William M.
Published: (2026)
Provable Fairness Repair for Deep Neural Networks
by: Ma, Jianan, et al.
Published: (2026)
by: Ma, Jianan, et al.
Published: (2026)
MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
by: Su, Jie, et al.
Published: (2025)
by: Su, Jie, et al.
Published: (2025)
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges
by: Darshan, Parth, et al.
Published: (2026)
by: Darshan, Parth, et al.
Published: (2026)
DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026)
by: Jahan, Sigma, et al.
Published: (2026)
A Practical Approach to Formal Methods: An Eclipse Integrated Development Environment (IDE) for Security Protocols
by: Garcia, Rémi, et al.
Published: (2024)
by: Garcia, Rémi, et al.
Published: (2024)
RepoAudit: An Autonomous LLM-Agent for Repository-Level Code Auditing
by: Guo, Jinyao, et al.
Published: (2025)
by: Guo, Jinyao, et al.
Published: (2025)
TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback
by: Jana, Prithwish, et al.
Published: (2026)
by: Jana, Prithwish, et al.
Published: (2026)
BONSAI: A Mixed-Initiative Workspace for Human-AI Co-Development of Visual Analytics Applications
by: Spinner, Thilo, et al.
Published: (2026)
by: Spinner, Thilo, et al.
Published: (2026)
Understanding and Detecting Flaky Builds in GitHub Actions
by: Ge, Wenhao, et al.
Published: (2026)
by: Ge, Wenhao, et al.
Published: (2026)
Nidus: Externalized Reasoning for AI-Assisted Engineering
by: Gorinevski, Danil
Published: (2026)
by: Gorinevski, Danil
Published: (2026)
AI-assisted JSON Schema Creation and Mapping
by: Neubauer, Felix, et al.
Published: (2025)
by: Neubauer, Felix, et al.
Published: (2025)
Automated Bug Triaging using Instruction-Tuned Large Language Models
by: Kiashemshaki, Kiana, et al.
Published: (2025)
by: Kiashemshaki, Kiana, et al.
Published: (2025)
A Self-Improving Architecture for Dynamic Safety in Large Language Models
by: Slater, Tyler
Published: (2025)
by: Slater, Tyler
Published: (2025)
LLMCup: Ranking-Enhanced Comment Updating with LLMs
by: Ge, Hua, et al.
Published: (2025)
by: Ge, Hua, et al.
Published: (2025)
Knowledge Equivalence in Digital Twins of Intelligent Systems
by: Zhang, Nan, et al.
Published: (2022)
by: Zhang, Nan, et al.
Published: (2022)
From Domain Understanding to Design Readiness: a playbook for GenAI-supported learning in Software Engineering
by: Wlodarski, Rafal
Published: (2026)
by: Wlodarski, Rafal
Published: (2026)
Can Graph-Based Microservice Performance Detection Be Used for Microservice Intrusion Detection?
by: Ma, Yunjian
Published: (2026)
by: Ma, Yunjian
Published: (2026)
AI-Assisted Engineering Should Track the Epistemic Status and Temporal Validity of Architectural Decisions
by: Gilda, Sankalp, et al.
Published: (2026)
by: Gilda, Sankalp, et al.
Published: (2026)
Secure coding for web applications: Frameworks, challenges, and the role of LLMs
by: Kiashemshaki, Kiana, et al.
Published: (2025)
by: Kiashemshaki, Kiana, et al.
Published: (2025)
PARNESS: A Paper Harness for End-to-End Automated Scientific Research with Dynamic Workflows, Full-Text Indexing, and Cross-Run Knowledge Accumulation
by: Wang, Yuchen, et al.
Published: (2026)
by: Wang, Yuchen, et al.
Published: (2026)
MOCHA: Multi-Objective Chebyshev Annealing for Agent Skill Optimization
by: Tanjim, Md Mehrab, et al.
Published: (2026)
by: Tanjim, Md Mehrab, et al.
Published: (2026)
LLMDFA: Analyzing Dataflow in Code with Large Language Models
by: Wang, Chengpeng, et al.
Published: (2024)
by: Wang, Chengpeng, et al.
Published: (2024)
On the Mistaken Assumption of Interchangeable Deep Reinforcement Learning Implementations
by: Hundal, Rajdeep Singh, et al.
Published: (2025)
by: Hundal, Rajdeep Singh, et al.
Published: (2025)
Towards Explainable Test Case Prioritisation with Learning-to-Rank Models
by: Ramírez, Aurora, et al.
Published: (2024)
by: Ramírez, Aurora, et al.
Published: (2024)
Towards Continuous Assurance with Formal Verification and Assurance Cases
by: Abeywickrama, Dhaminda B., et al.
Published: (2025)
by: Abeywickrama, Dhaminda B., et al.
Published: (2025)
Provable Repair of Deep Neural Network Defects by Preimage Synthesis and Property Refinement
by: Ma, Jianan, et al.
Published: (2025)
by: Ma, Jianan, et al.
Published: (2025)
Multi-Agent Code Verification via Information Theory
by: Rajan, Shreshth
Published: (2025)
by: Rajan, Shreshth
Published: (2025)
Feedback-Normalized Developer Memory for Reinforcement-Learning Coding Agents: A Safety-Gated MCP Architecture
by: Iscan, Mehmet
Published: (2026)
by: Iscan, Mehmet
Published: (2026)
Software Defined Vehicle Code Generation: A Few-Shot Prompting Approach
by: Nguyen, Quang-Dung, et al.
Published: (2025)
by: Nguyen, Quang-Dung, et al.
Published: (2025)
The Single-File Test: A Longitudinal Public-Interface Evaluation of First-Output LLM Web Generation with Social Reach Tracking
by: Palacios, Diego Cabezas
Published: (2026)
by: Palacios, Diego Cabezas
Published: (2026)
AuditRepairBench: A Paired-Execution Trace Corpus for Evaluator-Channel Ranking Instability in Agent Repair
by: Hu, Yuelin, et al.
Published: (2026)
by: Hu, Yuelin, et al.
Published: (2026)
CodeTracer: Towards Traceable Agent States
by: Li, Han, et al.
Published: (2026)
by: Li, Han, et al.
Published: (2026)
SDVDiag: Using Context-Aware Causality Mining for the Diagnosis of Connected Vehicle Functions
by: Weiß, Matthias, et al.
Published: (2026)
by: Weiß, Matthias, et al.
Published: (2026)
Comparing Unidirectional, Bidirectional, and Word2vec Models for Discovering Vulnerabilities in Compiled Lifted Code
by: McCully, Gary A., et al.
Published: (2024)
by: McCully, Gary A., et al.
Published: (2024)
Bi-Directional Transformers vs. word2vec: Discovering Vulnerabilities in Lifted Compiled Code
by: McCully, Gary A., et al.
Published: (2024)
by: McCully, Gary A., et al.
Published: (2024)
LLM Agents for Generating Microservice-based Applications: how complex is your specification?
by: Yellin, Daniel M.
Published: (2025)
by: Yellin, Daniel M.
Published: (2025)
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning
by: Agrawal, Lakshya A, et al.
Published: (2025)
by: Agrawal, Lakshya A, et al.
Published: (2025)
Uncovering Bugs in Formal Explainers: A Case Study with PyXAI
by: Huang, Xuanxiang, et al.
Published: (2025)
by: Huang, Xuanxiang, et al.
Published: (2025)
A measurement substrate for agentic Kubernetes operations: Methodology and a case study in retrieval-compounding falsification
by: Odmark, Joshua, et al.
Published: (2026)
by: Odmark, Joshua, et al.
Published: (2026)
Similar Items
-
AIRA: AI-Induced Risk Audit: A Structured Inspection Framework for AI-Generated Code
by: Parris, William M.
Published: (2026) -
Provable Fairness Repair for Deep Neural Networks
by: Ma, Jianan, et al.
Published: (2026) -
MFH: A Multi-faceted Heuristic Algorithm Selection Approach for Software Verification
by: Su, Jie, et al.
Published: (2025) -
When Gradients Collide: Failure Modes of Multi-Objective Prompt Optimization for LLM Judges
by: Darshan, Parth, et al.
Published: (2026) -
DEFault++: Automated Fault Detection, Categorization, and Diagnosis for Transformer Architectures
by: Jahan, Sigma, et al.
Published: (2026)