:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Bihui, Xu, Xinglong, Jiang, Junjie, Cheng, Jiabei, Jia, Caijun, Li, Siyuan, He, Conghui, Wei, Jingxuan, Tan, Cheng
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence Software Engineering
Online Access:	https://arxiv.org/abs/2605.10341
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026)

The Impact of Documentation on Test Engagement in Pull Requests in OSS
by: Amore, Teal, et al.
Published: (2026)

GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025)

Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility
by: Jin, Bihui, et al.
Published: (2026)

One Documentation Does Not Fit All: Case Study of TensorFlow Documentation
by: Thirimanne, Sharuka Promodya, et al.
Published: (2025)

Energy-Efficient Software Development: A Multi-dimensional Empirical Analysis of Stack Overflow
by: Jin, Bihui, et al.
Published: (2024)

Towards a Human-in-the-Loop Framework for Reliable Patch Evaluation Using an LLM-as-a-Judge
by: Shi, Sherry, et al.
Published: (2025)

Seeing is Believing: Vision-driven Non-crash Functional Bug Detection for Mobile Apps
by: Liu, Zhe, et al.
Published: (2024)

One Size Does Not Fit All: Investigating Efficacy of Perplexity in Detecting LLM-Generated Code
by: Xu, Jinwei, et al.
Published: (2024)

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control
by: Wei, Jingxuan, et al.
Published: (2026)

Symbol Preference Aware Generative Models for Recovering Variable Names from Stripped Binary
by: Xu, Xiangzhe, et al.
Published: (2023)

Bridging the Programming Language Gap: Constructing a Multilingual Shared Semantic Space through AST Unification and Graph Matching
by: Chen, Junhao, et al.
Published: (2026)

Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models
by: Jin, Bihui, et al.
Published: (2025)

Impact of Extensions on Browser Performance: An Empirical Study on Google Chrome
by: Jin, Bihui, et al.
Published: (2024)

CodeCSE: A Simple Multilingual Model for Code and Comment Sentence Embeddings
by: Varkey, Anthony, et al.
Published: (2024)

Utilizing Deep Learning to Optimize Software Development Processes
by: Li, Keqin, et al.
Published: (2024)

From Prompts to Templates: A Systematic Prompt Template Analysis for Real-world LLMapps
by: Mao, Yuetian, et al.
Published: (2025)

Detect--Repair--Verify for LLM-Generated Code: A Multi-Language, Multi-Granularity Empirical Study
by: Cheng, Cheng
Published: (2026)

Detect Repair Verify for Securing LLM Generated Code: A Multi-Language Empirical Study
by: Cheng, Cheng
Published: (2026)

aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion
by: Li, Jia, et al.
Published: (2025)

OptiLoop: Coordination-in-the-Loop Verification and Repair for LLM-Generated Optimization Agents
by: Xu, Yujia, et al.
Published: (2026)

Software Testing with Large Language Models: Survey, Landscape, and Vision
by: Wang, Junjie, et al.
Published: (2023)

Inconsistencies in TeX-Produced Documents
by: Tan, Jovyn, et al.
Published: (2024)

Large Language Models for Code: Security Hardening and Adversarial Testing
by: He, Jingxuan, et al.
Published: (2023)

Environment-in-the-Loop: Rethinking Code Migration with LLM-based Agents
by: Li, Xiang, et al.
Published: (2026)

CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
by: Cheng, Cheng, et al.
Published: (2025)

Dependency-Aware Code Naturalness
by: Yang, Chen, et al.
Published: (2024)

An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation
by: Li, Junjie, et al.
Published: (2024)

On the Effectiveness of Training Data Optimization for LLM-based Code Generation: An Empirical Study
by: Kuang, Shiqi, et al.
Published: (2025)

Exploring Scientific Debt: Harnessing AI for SATD Identification in Scientific Software
by: Melin, Eric L., et al.
Published: (2025)

Towards Richer Challenge Problems for Scientific Computing Correctness
by: Sottile, Matthew, et al.
Published: (2025)

ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
by: Feng, Shiwei, et al.
Published: (2024)

Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization
by: Wu, Yuhan, et al.
Published: (2026)

A Comprehensive Study on the Use of Word Embedding Models in Software Engineering Domain
by: Chen, Xiaohan, et al.
Published: (2025)

Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair
by: Liang, Xinyue, et al.
Published: (2026)

Human in the Loop for Fuzz Testing: Literature Review and the Road Ahead
by: Yu, Jiongchi, et al.
Published: (2026)

Human-In-The-Loop Software Development Agents: Challenges and Future Directions
by: Pasuksmit, Jirat, et al.
Published: (2025)

LUNAR: Unsupervised LLM-based Log Parsing
by: Huang, Junjie, et al.
Published: (2024)

Sustaining Research Software: A Fitness Function Approach
by: Zech, Philipp, et al.
Published: (2025)

Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models
by: Chen, Siyuan, et al.
Published: (2025)