Guardado en:
| Autores principales: | Abdollahi, Mohammad, Tasnia, Khandaker Rifah, Saha, Soumit Kanti, Yang, Jinqiu, Wang, Song, Hemmati, Hadi |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2512.00215 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Beyond Translation Accuracy: Addressing False Failures in LLM-Based Code Translation
por: Rabbi, Fazle, et al.
Publicado: (2026)
por: Rabbi, Fazle, et al.
Publicado: (2026)
Specification-Driven Code Translation Powered by Large Language Models: How Far Are We?
por: Saha, Soumit Kanti, et al.
Publicado: (2024)
por: Saha, Soumit Kanti, et al.
Publicado: (2024)
BabelCoder: Agentic Code Translation with Specification Alignment
por: Rabbi, Fazle, et al.
Publicado: (2025)
por: Rabbi, Fazle, et al.
Publicado: (2025)
An Empirical Study on Bug Severity Estimation using Source Code Metrics and Static Analysis
por: Mashhadi, Ehsan, et al.
Publicado: (2022)
por: Mashhadi, Ehsan, et al.
Publicado: (2022)
Prompt Engineering or Fine-Tuning: An Empirical Assessment of LLMs for Code
por: Shin, Jiho, et al.
Publicado: (2023)
por: Shin, Jiho, et al.
Publicado: (2023)
Enhancing LLM-Based Code Generation with Complexity Metrics: A Feedback-Driven Approach
por: Sepidband, Melika, et al.
Publicado: (2025)
por: Sepidband, Melika, et al.
Publicado: (2025)
Engineering Pitfalls in AI Coding Tools: An Empirical Study of Bugs in Claude Code, Codex, and Gemini CLI
por: Zhang, Ruixin, et al.
Publicado: (2026)
por: Zhang, Ruixin, et al.
Publicado: (2026)
Bias Unveiled: Investigating Social Bias in LLM-Generated Code
por: Ling, Lin, et al.
Publicado: (2024)
por: Ling, Lin, et al.
Publicado: (2024)
Deep-Bench: Deep Learning Benchmark Dataset for Code Generation
por: Daghighfarsoodeh, Alireza, et al.
Publicado: (2025)
por: Daghighfarsoodeh, Alireza, et al.
Publicado: (2025)
Domain Adaptation for Code Model-based Unit Test Case Generation
por: Shin, Jiho, et al.
Publicado: (2023)
por: Shin, Jiho, et al.
Publicado: (2023)
A Multi-Language Perspective on the Robustness of LLM Code Generation
por: Rabbi, Fazle, et al.
Publicado: (2025)
por: Rabbi, Fazle, et al.
Publicado: (2025)
Demystifying Faulty Code with LLM: Step-by-Step Reasoning for Explainable Fault Localization
por: Widyasari, Ratnadira, et al.
Publicado: (2024)
por: Widyasari, Ratnadira, et al.
Publicado: (2024)
SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring
por: Xu, Yisen, et al.
Publicado: (2026)
por: Xu, Yisen, et al.
Publicado: (2026)
CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
por: Cheng, Cheng, et al.
Publicado: (2025)
por: Cheng, Cheng, et al.
Publicado: (2025)
COBOLAssist: Analyzing and Fixing Compilation Errors for LLM-Powered COBOL Code Generation
por: Dau, Anh T. V., et al.
Publicado: (2026)
por: Dau, Anh T. V., et al.
Publicado: (2026)
Tracking the Evolution of Static Code Warnings: the State-of-the-Art and a Better Approach
por: Li, Junjie, et al.
Publicado: (2022)
por: Li, Junjie, et al.
Publicado: (2022)
RGFL: Reasoning Guided Fault Localization for Automated Program Repair Using Large Language Models
por: Sepidband, Melika, et al.
Publicado: (2026)
por: Sepidband, Melika, et al.
Publicado: (2026)
On the Role of Fault Localization Context for LLM-Based Program Repair
por: Sepidband, Melika, et al.
Publicado: (2026)
por: Sepidband, Melika, et al.
Publicado: (2026)
NeuroFlake: A Neuro-Symbolic LLM Framework for Flaky Test Classification
por: Hoque, Khondaker Tasnia, et al.
Publicado: (2026)
por: Hoque, Khondaker Tasnia, et al.
Publicado: (2026)
Social Bias in LLM-Generated Code: Benchmark and Mitigation
por: Rabbi, Fazle, et al.
Publicado: (2026)
por: Rabbi, Fazle, et al.
Publicado: (2026)
HEJ-Robust: A Robustness Benchmark for LLM-Based Automated Program Repair
por: Rabbi, Fazle, et al.
Publicado: (2026)
por: Rabbi, Fazle, et al.
Publicado: (2026)
Program Slicing in the Era of Large Language Models
por: Shahandashti, Kimya Khakzad, et al.
Publicado: (2024)
por: Shahandashti, Kimya Khakzad, et al.
Publicado: (2024)
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair
por: Fatima, Sakina, et al.
Publicado: (2023)
por: Fatima, Sakina, et al.
Publicado: (2023)
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
por: Wang, Hao, et al.
Publicado: (2026)
por: Wang, Hao, et al.
Publicado: (2026)
Can ChatGPT Support Developers? An Empirical Evaluation of Large Language Models for Code Generation
por: Jin, Kailun, et al.
Publicado: (2024)
por: Jin, Kailun, et al.
Publicado: (2024)
Secure-Instruct: An Automated Pipeline for Synthesizing Instruction-Tuning Datasets Using LLMs for Secure Code Generation
por: Li, Junjie, et al.
Publicado: (2025)
por: Li, Junjie, et al.
Publicado: (2025)
Assessing Evaluation Metrics for Neural Test Oracle Generation
por: Shin, Jiho, et al.
Publicado: (2023)
por: Shin, Jiho, et al.
Publicado: (2023)
A Systematic Mapping Study of Crowd Knowledge Enhanced Software Engineering Research Using Stack Overflow
por: Tanzil, Minaoar, et al.
Publicado: (2024)
por: Tanzil, Minaoar, et al.
Publicado: (2024)
Consistency Meets Verification: Enhancing Test Generation Quality in Large Language Models Without Ground-Truth Solutions
por: Taherkhani, Hamed, et al.
Publicado: (2026)
por: Taherkhani, Hamed, et al.
Publicado: (2026)
EnvTrace: Simulation-Based Semantic Evaluation of LLM Code via Execution Trace Alignment -- Demonstrated at Synchrotron Beamlines
por: van der Vleuten, Noah, et al.
Publicado: (2025)
por: van der Vleuten, Noah, et al.
Publicado: (2025)
Perception-Guided Fuzzing for Simulated Scenario-Based Testing of Autonomous Driving Systems
por: Pham, Tri Minh Triet, et al.
Publicado: (2024)
por: Pham, Tri Minh Triet, et al.
Publicado: (2024)
An Empirical Study of LLM-Based Code Clone Detection
por: Zhu, Wenqing, et al.
Publicado: (2025)
por: Zhu, Wenqing, et al.
Publicado: (2025)
Code Copycat Conundrum: Demystifying Repetition in LLM-based Code Generation
por: Liu, Mingwei, et al.
Publicado: (2025)
por: Liu, Mingwei, et al.
Publicado: (2025)
ABTest: Behavior-Driven Testing for AI Coding Agents
por: Dai, Wuyang, et al.
Publicado: (2026)
por: Dai, Wuyang, et al.
Publicado: (2026)
Automated Prompt Engineering for Cost-Effective Code Generation Using Evolutionary Algorithm
por: Taherkhani, Hamed, et al.
Publicado: (2024)
por: Taherkhani, Hamed, et al.
Publicado: (2024)
Decoding Human-LLM Collaboration in Coding: An Empirical Study of Multi-Turn Conversations in the Wild
por: Zhang, Binquan, et al.
Publicado: (2025)
por: Zhang, Binquan, et al.
Publicado: (2025)
Assessing Coherency and Consistency of Code Execution Reasoning by Large Language Models
por: Liu, Changshu, et al.
Publicado: (2025)
por: Liu, Changshu, et al.
Publicado: (2025)
An Empirical Study of Interaction Smells in Multi-Turn Human-LLM Collaborative Code Generation
por: Zhang, Binquan, et al.
Publicado: (2026)
por: Zhang, Binquan, et al.
Publicado: (2026)
Rethinking Code Review Workflows with LLM Assistance: An Empirical Study
por: Aðalsteinsson, Fannar Steinn, et al.
Publicado: (2025)
por: Aðalsteinsson, Fannar Steinn, et al.
Publicado: (2025)
RocketPPA: Code-Level Power, Performance, and Area Prediction via LLM and Mixture of Experts
por: Abdollahi, Armin, et al.
Publicado: (2025)
por: Abdollahi, Armin, et al.
Publicado: (2025)
Ejemplares similares
-
Beyond Translation Accuracy: Addressing False Failures in LLM-Based Code Translation
por: Rabbi, Fazle, et al.
Publicado: (2026) -
Specification-Driven Code Translation Powered by Large Language Models: How Far Are We?
por: Saha, Soumit Kanti, et al.
Publicado: (2024) -
BabelCoder: Agentic Code Translation with Specification Alignment
por: Rabbi, Fazle, et al.
Publicado: (2025) -
An Empirical Study on Bug Severity Estimation using Source Code Metrics and Static Analysis
por: Mashhadi, Ehsan, et al.
Publicado: (2022) -
Prompt Engineering or Fine-Tuning: An Empirical Assessment of LLMs for Code
por: Shin, Jiho, et al.
Publicado: (2023)