Saved in:
| Main Authors: | Yu, Bihui, Xu, Xinglong, Jiang, Junjie, Cheng, Jiabei, Jia, Caijun, Li, Siyuan, He, Conghui, Wei, Jingxuan, Tan, Cheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.10341 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026)
by: Pan, Chenkai, et al.
Published: (2026)
The Impact of Documentation on Test Engagement in Pull Requests in OSS
by: Amore, Teal, et al.
Published: (2026)
by: Amore, Teal, et al.
Published: (2026)
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025)
by: Wei, Jingxuan, et al.
Published: (2025)
Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility
by: Jin, Bihui, et al.
Published: (2026)
by: Jin, Bihui, et al.
Published: (2026)
One Documentation Does Not Fit All: Case Study of TensorFlow Documentation
by: Thirimanne, Sharuka Promodya, et al.
Published: (2025)
by: Thirimanne, Sharuka Promodya, et al.
Published: (2025)
Energy-Efficient Software Development: A Multi-dimensional Empirical Analysis of Stack Overflow
by: Jin, Bihui, et al.
Published: (2024)
by: Jin, Bihui, et al.
Published: (2024)
Towards a Human-in-the-Loop Framework for Reliable Patch Evaluation Using an LLM-as-a-Judge
by: Shi, Sherry, et al.
Published: (2025)
by: Shi, Sherry, et al.
Published: (2025)
Seeing is Believing: Vision-driven Non-crash Functional Bug Detection for Mobile Apps
by: Liu, Zhe, et al.
Published: (2024)
by: Liu, Zhe, et al.
Published: (2024)
One Size Does Not Fit All: Investigating Efficacy of Perplexity in Detecting LLM-Generated Code
by: Xu, Jinwei, et al.
Published: (2024)
by: Xu, Jinwei, et al.
Published: (2024)
PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control
by: Wei, Jingxuan, et al.
Published: (2026)
by: Wei, Jingxuan, et al.
Published: (2026)
Symbol Preference Aware Generative Models for Recovering Variable Names from Stripped Binary
by: Xu, Xiangzhe, et al.
Published: (2023)
by: Xu, Xiangzhe, et al.
Published: (2023)
Bridging the Programming Language Gap: Constructing a Multilingual Shared Semantic Space through AST Unification and Graph Matching
by: Chen, Junhao, et al.
Published: (2026)
by: Chen, Junhao, et al.
Published: (2026)
Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models
by: Jin, Bihui, et al.
Published: (2025)
by: Jin, Bihui, et al.
Published: (2025)
Impact of Extensions on Browser Performance: An Empirical Study on Google Chrome
by: Jin, Bihui, et al.
Published: (2024)
by: Jin, Bihui, et al.
Published: (2024)
CodeCSE: A Simple Multilingual Model for Code and Comment Sentence Embeddings
by: Varkey, Anthony, et al.
Published: (2024)
by: Varkey, Anthony, et al.
Published: (2024)
Utilizing Deep Learning to Optimize Software Development Processes
by: Li, Keqin, et al.
Published: (2024)
by: Li, Keqin, et al.
Published: (2024)
From Prompts to Templates: A Systematic Prompt Template Analysis for Real-world LLMapps
by: Mao, Yuetian, et al.
Published: (2025)
by: Mao, Yuetian, et al.
Published: (2025)
Detect--Repair--Verify for LLM-Generated Code: A Multi-Language, Multi-Granularity Empirical Study
by: Cheng, Cheng
Published: (2026)
by: Cheng, Cheng
Published: (2026)
Detect Repair Verify for Securing LLM Generated Code: A Multi-Language Empirical Study
by: Cheng, Cheng
Published: (2026)
by: Cheng, Cheng
Published: (2026)
aiXcoder-7B-v2: Training LLMs to Fully Utilize the Long Context in Repository-level Code Completion
by: Li, Jia, et al.
Published: (2025)
by: Li, Jia, et al.
Published: (2025)
OptiLoop: Coordination-in-the-Loop Verification and Repair for LLM-Generated Optimization Agents
by: Xu, Yujia, et al.
Published: (2026)
by: Xu, Yujia, et al.
Published: (2026)
Software Testing with Large Language Models: Survey, Landscape, and Vision
by: Wang, Junjie, et al.
Published: (2023)
by: Wang, Junjie, et al.
Published: (2023)
Inconsistencies in TeX-Produced Documents
by: Tan, Jovyn, et al.
Published: (2024)
by: Tan, Jovyn, et al.
Published: (2024)
Large Language Models for Code: Security Hardening and Adversarial Testing
by: He, Jingxuan, et al.
Published: (2023)
by: He, Jingxuan, et al.
Published: (2023)
Environment-in-the-Loop: Rethinking Code Migration with LLM-based Agents
by: Li, Xiang, et al.
Published: (2026)
by: Li, Xiang, et al.
Published: (2026)
CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
by: Cheng, Cheng, et al.
Published: (2025)
by: Cheng, Cheng, et al.
Published: (2025)
Dependency-Aware Code Naturalness
by: Yang, Chen, et al.
Published: (2024)
by: Yang, Chen, et al.
Published: (2024)
An Exploratory Study on Fine-Tuning Large Language Models for Secure Code Generation
by: Li, Junjie, et al.
Published: (2024)
by: Li, Junjie, et al.
Published: (2024)
On the Effectiveness of Training Data Optimization for LLM-based Code Generation: An Empirical Study
by: Kuang, Shiqi, et al.
Published: (2025)
by: Kuang, Shiqi, et al.
Published: (2025)
Exploring Scientific Debt: Harnessing AI for SATD Identification in Scientific Software
by: Melin, Eric L., et al.
Published: (2025)
by: Melin, Eric L., et al.
Published: (2025)
Towards Richer Challenge Problems for Scientific Computing Correctness
by: Sottile, Matthew, et al.
Published: (2025)
by: Sottile, Matthew, et al.
Published: (2025)
ROCAS: Root Cause Analysis of Autonomous Driving Accidents via Cyber-Physical Co-mutation
by: Feng, Shiwei, et al.
Published: (2024)
by: Feng, Shiwei, et al.
Published: (2024)
Improving Code Translation with Syntax-Guided and Semantic-aware Preference Optimization
by: Wu, Yuhan, et al.
Published: (2026)
by: Wu, Yuhan, et al.
Published: (2026)
A Comprehensive Study on the Use of Word Embedding Models in Software Engineering Domain
by: Chen, Xiaohan, et al.
Published: (2025)
by: Chen, Xiaohan, et al.
Published: (2025)
Boosting Automatic Java-to-Cangjie Translation with Multi-Stage LLM Training and Error Repair
by: Liang, Xinyue, et al.
Published: (2026)
by: Liang, Xinyue, et al.
Published: (2026)
Human in the Loop for Fuzz Testing: Literature Review and the Road Ahead
by: Yu, Jiongchi, et al.
Published: (2026)
by: Yu, Jiongchi, et al.
Published: (2026)
Human-In-The-Loop Software Development Agents: Challenges and Future Directions
by: Pasuksmit, Jirat, et al.
Published: (2025)
by: Pasuksmit, Jirat, et al.
Published: (2025)
LUNAR: Unsupervised LLM-based Log Parsing
by: Huang, Junjie, et al.
Published: (2024)
by: Huang, Junjie, et al.
Published: (2024)
Sustaining Research Software: A Fitness Function Approach
by: Zech, Philipp, et al.
Published: (2025)
by: Zech, Philipp, et al.
Published: (2025)
Evolution of Kernels: Automated RISC-V Kernel Optimization with Large Language Models
by: Chen, Siyuan, et al.
Published: (2025)
by: Chen, Siyuan, et al.
Published: (2025)
Similar Items
-
Programming with Data: Test-Driven Data Engineering for Self-Improving LLMs from Raw Corpora
by: Pan, Chenkai, et al.
Published: (2026) -
The Impact of Documentation on Test Engagement in Pull Requests in OSS
by: Amore, Teal, et al.
Published: (2026) -
GGBench: A Geometric Generative Reasoning Benchmark for Unified Multimodal Models
by: Wei, Jingxuan, et al.
Published: (2025) -
Automated Modernization of Machine Learning Engineering Notebooks for Reproducibility
by: Jin, Bihui, et al.
Published: (2026) -
One Documentation Does Not Fit All: Case Study of TensorFlow Documentation
by: Thirimanne, Sharuka Promodya, et al.
Published: (2025)