Saved in:
| Main Authors: | Zheng, Mingwei, Xie, Danning, Shi, Qingkai, Wang, Chengpeng, Zhang, Xiangyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.18050 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large Language Models for Validating Network Protocol Parsers
by: Zheng, Mingwei, et al.
Published: (2025)
by: Zheng, Mingwei, et al.
Published: (2025)
CoRe: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks
by: Xie, Danning, et al.
Published: (2025)
by: Xie, Danning, et al.
Published: (2025)
RFCAudit: An LLM Agent for Functional Bug Detection in Network Protocols
by: Zheng, Mingwei, et al.
Published: (2025)
by: Zheng, Mingwei, et al.
Published: (2025)
Raw Pointer Rewriting with LLMs for Translating C to Safer Rust
by: Gao, Yifei, et al.
Published: (2025)
by: Gao, Yifei, et al.
Published: (2025)
On Interpreting the Effectiveness of Unsupervised Software Traceability with Information Theory
by: Palacio, David N., et al.
Published: (2024)
by: Palacio, David N., et al.
Published: (2024)
NESA: Relational Neuro-Symbolic Static Program Analysis
by: Wang, Chengpeng, et al.
Published: (2024)
by: Wang, Chengpeng, et al.
Published: (2024)
TAI3: Testing Agent Integrity in Interpreting User Intent
by: Feng, Shiwei, et al.
Published: (2025)
by: Feng, Shiwei, et al.
Published: (2025)
Evaluating the Use of LLMs for Documentation to Code Traceability
by: Alor, Ebube, et al.
Published: (2025)
by: Alor, Ebube, et al.
Published: (2025)
REPOFUSE: Repository-Level Code Completion with Fused Dual Context
by: Liang, Ming, et al.
Published: (2024)
by: Liang, Ming, et al.
Published: (2024)
LogParser-LLM: Advancing Efficient Log Parsing with Large Language Models
by: Zhong, Aoxiao, et al.
Published: (2024)
by: Zhong, Aoxiao, et al.
Published: (2024)
Extracting Protocol Format as State Machine via Controlled Static Loop Analysis
by: Shi, Qingkai, et al.
Published: (2023)
by: Shi, Qingkai, et al.
Published: (2023)
Traceability and Accountability in Role-Specialized Multi-Agent LLM Pipelines
by: Barrak, Amine
Published: (2025)
by: Barrak, Amine
Published: (2025)
Understanding Automated Program Repair Agents Through the Lens of Traceability: An Empirical Study
by: Ceka, Ira, et al.
Published: (2025)
by: Ceka, Ira, et al.
Published: (2025)
Natural Language-Programming Language Software Traceability Link Recovery Needs More than Textual Similarity
by: Zou, Zhiyuan, et al.
Published: (2025)
by: Zou, Zhiyuan, et al.
Published: (2025)
SpecMap: Hierarchical LLM Agent for Datasheet-to-Code Traceability Link Recovery in Systems Engineering
by: Nipane, Vedant, et al.
Published: (2026)
by: Nipane, Vedant, et al.
Published: (2026)
RustEvo^2: An Evolving Benchmark for API Evolution in LLM-based Rust Code Generation
by: Liang, Linxi, et al.
Published: (2025)
by: Liang, Linxi, et al.
Published: (2025)
FlakyGuard: Automatically Fixing Flaky Tests at Industry Scale
by: Li, Chengpeng, et al.
Published: (2025)
by: Li, Chengpeng, et al.
Published: (2025)
Bridging Generation and Training: A Systematic Review of Quality Issues in LLMs for Code
by: He, Kaifeng, et al.
Published: (2026)
by: He, Kaifeng, et al.
Published: (2026)
Beyond Functional Correctness: Investigating Coding Style Inconsistencies in Large Language Models
by: Wang, Yanlin, et al.
Published: (2024)
by: Wang, Yanlin, et al.
Published: (2024)
Generating High-Quality Datasets for Code Editing via Open-Source Language Models
by: Zhang, Zekai, et al.
Published: (2025)
by: Zhang, Zekai, et al.
Published: (2025)
IncreRTL: Traceability-Guided Incremental RTL Generation under Requirement Evolution
by: Chen, Luanrong, et al.
Published: (2026)
by: Chen, Luanrong, et al.
Published: (2026)
Cross-level Requirement Traceability: A Novel Approach Integrating Bag-of-Words and Word Embedding for Enhanced Similarity Functionality
by: Mohammad, Baher, et al.
Published: (2024)
by: Mohammad, Baher, et al.
Published: (2024)
WebCoderBench: Benchmarking Web Application Generation with Comprehensive and Interpretable Evaluation Metrics
by: Liu, Chenxu, et al.
Published: (2026)
by: Liu, Chenxu, et al.
Published: (2026)
Rethinking Testing for LLM Applications: Characteristics, Challenges, and a Lightweight Interaction Protocol
by: Ma, Wei, et al.
Published: (2025)
by: Ma, Wei, et al.
Published: (2025)
An Ontology-based Approach Towards Traceable Behavior Specifications in Automated Driving
by: Salem, Nayel Fabian, et al.
Published: (2024)
by: Salem, Nayel Fabian, et al.
Published: (2024)
iPanda: An LLM-based Agent for Automated Conformance Testing of Communication Protocols
by: Sun, Xikai, et al.
Published: (2025)
by: Sun, Xikai, et al.
Published: (2025)
When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents
by: Yan, Lu, et al.
Published: (2026)
by: Yan, Lu, et al.
Published: (2026)
Enhancing Interpretability in Software Change Management with Chain-of-Thought Reasoning
by: Sun, Yongqian, et al.
Published: (2025)
by: Sun, Yongqian, et al.
Published: (2025)
Vul-RAG: Enhancing LLM-based Vulnerability Detection via Knowledge-level RAG
by: Du, Xueying, et al.
Published: (2024)
by: Du, Xueying, et al.
Published: (2024)
ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation
by: Liu, Sicong, et al.
Published: (2026)
by: Liu, Sicong, et al.
Published: (2026)
Validating LLM-Generated Programs with Metamorphic Prompt Testing
by: Wang, Xiaoyin, et al.
Published: (2024)
by: Wang, Xiaoyin, et al.
Published: (2024)
Neuro-Symbolic Generation and Validation of Memory-Aware Formal Function Specifications
by: Zhang, Liao, et al.
Published: (2026)
by: Zhang, Liao, et al.
Published: (2026)
FailureMem: A Failure-Aware Multimodal Framework for Autonomous Software Repair
by: Ma, Ruize, et al.
Published: (2026)
by: Ma, Ruize, et al.
Published: (2026)
Agora: Toward Autonomous Bug Detection in Production-Level Consensus Protocols with LLM Agents
by: Liu, Xiang, et al.
Published: (2026)
by: Liu, Xiang, et al.
Published: (2026)
Nova: Generative Language Models for Assembly Code with Hierarchical Attention and Contrastive Learning
by: Jiang, Nan, et al.
Published: (2023)
by: Jiang, Nan, et al.
Published: (2023)
Verification and Validation of Autonomous Systems
by: Shetiya, Sneha Sudhir, et al.
Published: (2024)
by: Shetiya, Sneha Sudhir, et al.
Published: (2024)
QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems
by: Shi, Jinjing, et al.
Published: (2024)
by: Shi, Jinjing, et al.
Published: (2024)
S3LLM: Large-Scale Scientific Software Understanding with LLMs using Source, Metadata, and Document
by: Shaik, Kareem, et al.
Published: (2024)
by: Shaik, Kareem, et al.
Published: (2024)
Learning From Developers: Towards Reliable Patch Validation at Scale for Linux
by: Lin, Chih-En, et al.
Published: (2026)
by: Lin, Chih-En, et al.
Published: (2026)
EvoC2Rust: A Skeleton-guided Framework for Project-Level C-to-Rust Translation
by: Wang, Chaofan, et al.
Published: (2025)
by: Wang, Chaofan, et al.
Published: (2025)
Similar Items
-
Large Language Models for Validating Network Protocol Parsers
by: Zheng, Mingwei, et al.
Published: (2025) -
CoRe: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks
by: Xie, Danning, et al.
Published: (2025) -
RFCAudit: An LLM Agent for Functional Bug Detection in Network Protocols
by: Zheng, Mingwei, et al.
Published: (2025) -
Raw Pointer Rewriting with LLMs for Translating C to Safer Rust
by: Gao, Yifei, et al.
Published: (2025) -
On Interpreting the Effectiveness of Unsupervised Software Traceability with Information Theory
by: Palacio, David N., et al.
Published: (2024)