:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Abdollahi, Mohammad, Tasnia, Khandaker Rifah, Saha, Soumit Kanti, Yang, Jinqiu, Wang, Song, Hemmati, Hadi
Formato:	Preprint
Publicado:	2025
Materias:	Software Engineering
Acceso en línea:	https://arxiv.org/abs/2512.00215
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Beyond Translation Accuracy: Addressing False Failures in LLM-Based Code Translation
por: Rabbi, Fazle, et al.
Publicado: (2026)

Specification-Driven Code Translation Powered by Large Language Models: How Far Are We?
por: Saha, Soumit Kanti, et al.
Publicado: (2024)

BabelCoder: Agentic Code Translation with Specification Alignment
por: Rabbi, Fazle, et al.
Publicado: (2025)

An Empirical Study on Bug Severity Estimation using Source Code Metrics and Static Analysis
por: Mashhadi, Ehsan, et al.
Publicado: (2022)

Prompt Engineering or Fine-Tuning: An Empirical Assessment of LLMs for Code
por: Shin, Jiho, et al.
Publicado: (2023)

Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach
por: Sepidband, Melika, et al.
Publicado: (2025)

Engineering Pitfalls in AI Coding Tools: An Empirical Study of Bugs in Claude Code, Codex, and Gemini CLI
por: Zhang, Ruixin, et al.
Publicado: (2026)

Bias Unveiled: Investigating Social Bias in LLM-Generated Code
por: Ling, Lin, et al.
Publicado: (2024)

Deep-Bench: Deep Learning Benchmark Dataset for Code Generation
por: Daghighfarsoodeh, Alireza, et al.
Publicado: (2025)

Domain Adaptation for Code Model-based Unit Test Case Generation
por: Shin, Jiho, et al.
Publicado: (2023)

A Multi-Language Perspective on the Robustness of LLM Code Generation
por: Rabbi, Fazle, et al.
Publicado: (2025)

Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization
por: Widyasari, Ratnadira, et al.
Publicado: (2024)

SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring
por: Xu, Yisen, et al.
Publicado: (2026)

CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
por: Cheng, Cheng, et al.
Publicado: (2025)

COBOLAssist: Analyzing and Fixing Compilation Errors for LLM-Powered COBOL Code Generation
por: Dau, Anh T. V., et al.
Publicado: (2026)

Tracking the Evolution of Static Code Warnings: the State-of-the-Art and a Better Approach
por: Li, Junjie, et al.
Publicado: (2022)

RGFL: Reasoning Guided Fault Localization for Automated Program Repair Using Large Language Models
por: Sepidband, Melika, et al.
Publicado: (2026)

On the Role of Fault Localization Context for LLM-Based Program Repair
por: Sepidband, Melika, et al.
Publicado: (2026)

NeuroFlake: A Neuro-Symbolic LLM Framework for Flaky Test Classification
por: Hoque, Khondaker Tasnia, et al.
Publicado: (2026)

Social Bias in LLM-Generated Code: Benchmark and Mitigation
por: Rabbi, Fazle, et al.
Publicado: (2026)

HEJ-Robust: A Robustness Benchmark for LLM-Based Automated Program Repair
por: Rabbi, Fazle, et al.
Publicado: (2026)

Program Slicing in the Era of Large Language Models
por: Shahandashti, Kimya Khakzad, et al.
Publicado: (2024)

FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair
por: Fatima, Sakina, et al.
Publicado: (2023)

StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
por: Wang, Hao, et al.
Publicado: (2026)

Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation
por: Jin, Kailun, et al.
Publicado: (2024)

Secure-Instruct: An Automated Pipeline for Synthesizing Instruction-Tuning Datasets Using LLMs for Secure Code Generation
por: Li, Junjie, et al.
Publicado: (2025)

Assessing Evaluation Metrics for Neural Test Oracle Generation
por: Shin, Jiho, et al.
Publicado: (2023)

A Systematic Mapping Study of Crowd Knowledge Enhanced Software Engineering Research Using Stack Overflow
por: Tanzil, Minaoar, et al.
Publicado: (2024)

Consistency Meets Verification: Enhancing Test Generation Quality in Large Language Models Without Ground-Truth Solutions
por: Taherkhani, Hamed, et al.
Publicado: (2026)

EnvTrace: Simulation-Based Semantic Evaluation of LLM Code via Execution Trace Alignment -- Demonstrated at Synchrotron Beamlines
por: van der Vleuten, Noah, et al.
Publicado: (2025)

Perception-Guided Fuzzing for Simulated Scenario-Based Testing of Autonomous Driving Systems
por: Pham, Tri Minh Triet, et al.
Publicado: (2024)

An Empirical Study of LLM-Based Code Clone Detection
por: Zhu, Wenqing, et al.
Publicado: (2025)

Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
por: Liu, Mingwei, et al.
Publicado: (2025)

ABTest: Behavior-Driven Testing for AI Coding Agents
por: Dai, Wuyang, et al.
Publicado: (2026)

Automated Prompt Engineering for Cost-Effective Code Generation Using Evolutionary Algorithm
por: Taherkhani, Hamed, et al.
Publicado: (2024)

Decoding Human-LLM Collaboration in Coding: An Empirical Study of Multi-Turn Conversations in the Wild
por: Zhang, Binquan, et al.
Publicado: (2025)

Assessing Coherency and Consistency of Code Execution Reasoning by Large Language Models
por: Liu, Changshu, et al.
Publicado: (2025)

An Empirical Study of Interaction Smells in Multi-Turn Human-LLM Collaborative Code Generation
por: Zhang, Binquan, et al.
Publicado: (2026)

Rethinking Code Review Workflows with LLM Assistance: An Empirical Study
por: Aðalsteinsson, Fannar Steinn, et al.
Publicado: (2025)

RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts
por: Abdollahi, Armin, et al.
Publicado: (2025)