:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Eghbali, Aryaz, Pradel, Michael
Format:	Preprint
Published:	2024
Subjects:	Software Engineering
Online Access:	https://arxiv.org/abs/2401.01701
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PoCGen: Generating Proof-of-Concept Exploits for Vulnerabilities in Npm Packages
by: Simsek, Deniz, et al.
Published: (2025)

Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation
by: Dai, Yihan, et al.
Published: (2025)

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
by: Zhang, Ziyao, et al.
Published: (2024)

Evaluating LLM Agents on Automated Software Analysis Tasks
by: Bouzenia, Islem, et al.
Published: (2026)

Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges
by: Lee, Yunseo, et al.
Published: (2025)

Towards Mitigating API Hallucination in Code Generated by LLMs with Hierarchical Dependency Aware
by: Chen, Yujia, et al.
Published: (2025)

CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits
by: Hu, Huimin, et al.
Published: (2025)

Code Hallucination
by: Rahman, Mirza Masfiqur, et al.
Published: (2024)

Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis
by: Khati, Dipin, et al.
Published: (2026)

Beyond Functional Correctness: Exploring Hallucinations in LLM-Generated Code
by: Liu, Fang, et al.
Published: (2024)

Hallucination in LLM-Based Code Generation: An Automotive Case Study
by: Pavel, Marc, et al.
Published: (2025)

ChangeGuard: Validating Code Changes via Pairwise Learning-Guided Execution
by: Gröninger, Lars, et al.
Published: (2024)

Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering
by: Ravuri, Chaitanya, et al.
Published: (2025)

Citation-Grounded Code Comprehension: Preventing LLM Hallucination Through Hybrid Retrieval and Graph-Augmented Context
by: Arafat, Jahidul
Published: (2025)

Testora: Using Natural Language Intent to Detect Behavioral Regressions
by: Pradel, Michael
Published: (2025)

An Empirical Analysis of Static Analysis Methods for Detection and Mitigation of Code Library Hallucinations
by: Miranda-Pena, Clarissa, et al.
Published: (2026)

Hallucination Detection for LLM-based Text-to-SQL Generation via Two-Stage Metamorphic Testing
by: Yang, Bo, et al.
Published: (2025)

You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects
by: Bouzenia, Islem, et al.
Published: (2024)

CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
by: Tian, Yuchen, et al.
Published: (2024)

CodeMirage: Hallucinations in Code Generated by Large Language Models
by: Agarwal, Vibhor, et al.
Published: (2024)

A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI
by: Gao, Cuiyun, et al.
Published: (2025)

CodeCureAgent: Automatic Classification and Repair of Static Analysis Warnings
by: Joos, Pascal, et al.
Published: (2025)

Artisan: Agentic Artifact Evaluation
by: Baek, Doehyun, et al.
Published: (2026)

RepairAgent: An Autonomous, LLM-Based Agent for Program Repair
by: Bouzenia, Islem, et al.
Published: (2024)

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries
by: Nirujan, Hinduja, et al.
Published: (2026)

Hallucinations in Code Change to Natural Language Generation: Prevalence and Evaluation of Detection Metrics
by: Liu, Chunhua, et al.
Published: (2025)

NoCode-bench: A Benchmark for Evaluating Natural Language-Driven Feature Addition
by: Deng, Le, et al.
Published: (2025)

Issue2Test: Generating Reproducing Test Cases from Issue Reports
by: Nashid, Noor, et al.
Published: (2025)

Are "Solved Issues" in SWE-bench Really Solved Correctly? An Empirical Study
by: Wang, You, et al.
Published: (2025)

RippleGUItester: Change-Aware Exploratory Testing
by: Su, Yanqi, et al.
Published: (2026)

Names Are All You Need: Effective and Safe Regression Test Selection for Python
by: Wang, You, et al.
Published: (2026)

Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories
by: Bouzenia, Islem, et al.
Published: (2025)

Treefix: Enabling Execution with a Tree of Prefixes
by: Souza, Beatriz, et al.
Published: (2025)

AgentStepper: Interactive Debugging of Software Development Agents
by: Hutter, Robert, et al.
Published: (2026)

Hallucination to Consensus: Multi-Agent LLMs for End-to-End JUnit Test Generation
by: Xu, Qinghua, et al.
Published: (2025)

DyPyBench: A Benchmark of Executable Python Software
by: Bouzenia, Islem, et al.
Published: (2024)

LLM-Based Repair of Static Nullability Errors
by: Karimipour, Nima, et al.
Published: (2025)

Analyzing Quantum Programs with LintQ: A Static Analysis Framework for Qiskit
by: Paltenghi, Matteo, et al.
Published: (2023)

Automatic Generation of Benchmarks and Reliable LLM Judgment for Code Tasks
by: Farchi, Eitan, et al.
Published: (2024)

ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries
by: Maharaj, Kishan, et al.
Published: (2024)