Saved in:
| Main Authors: | Eghbali, Aryaz, Pradel, Michael |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.01701 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PoCGen: Generating Proof-of-Concept Exploits for Vulnerabilities in Npm Packages
by: Simsek, Deniz, et al.
Published: (2025)
by: Simsek, Deniz, et al.
Published: (2025)
Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation
by: Dai, Yihan, et al.
Published: (2025)
by: Dai, Yihan, et al.
Published: (2025)
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
by: Zhang, Ziyao, et al.
Published: (2024)
by: Zhang, Ziyao, et al.
Published: (2024)
Evaluating LLM Agents on Automated Software Analysis Tasks
by: Bouzenia, Islem, et al.
Published: (2026)
by: Bouzenia, Islem, et al.
Published: (2026)
Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges
by: Lee, Yunseo, et al.
Published: (2025)
by: Lee, Yunseo, et al.
Published: (2025)
Towards Mitigating API Hallucination in Code Generated by LLMs with Hierarchical Dependency Aware
by: Chen, Yujia, et al.
Published: (2025)
by: Chen, Yujia, et al.
Published: (2025)
CodeMapper: A Language-Agnostic Approach to Mapping Code Regions Across Commits
by: Hu, Huimin, et al.
Published: (2025)
by: Hu, Huimin, et al.
Published: (2025)
Code Hallucination
by: Rahman, Mirza Masfiqur, et al.
Published: (2024)
by: Rahman, Mirza Masfiqur, et al.
Published: (2024)
Detecting and Correcting Hallucinations in LLM-Generated Code via Deterministic AST Analysis
by: Khati, Dipin, et al.
Published: (2026)
by: Khati, Dipin, et al.
Published: (2026)
Beyond Functional Correctness: Exploring Hallucinations in LLM-Generated Code
by: Liu, Fang, et al.
Published: (2024)
by: Liu, Fang, et al.
Published: (2024)
Hallucination in LLM-Based Code Generation: An Automotive Case Study
by: Pavel, Marc, et al.
Published: (2025)
by: Pavel, Marc, et al.
Published: (2025)
ChangeGuard: Validating Code Changes via Pairwise Learning-Guided Execution
by: Gröninger, Lars, et al.
Published: (2024)
by: Gröninger, Lars, et al.
Published: (2024)
Eliminating Hallucination-Induced Errors in LLM Code Generation with Functional Clustering
by: Ravuri, Chaitanya, et al.
Published: (2025)
by: Ravuri, Chaitanya, et al.
Published: (2025)
Citation-Grounded Code Comprehension: Preventing LLM Hallucination Through Hybrid Retrieval and Graph-Augmented Context
by: Arafat, Jahidul
Published: (2025)
by: Arafat, Jahidul
Published: (2025)
Testora: Using Natural Language Intent to Detect Behavioral Regressions
by: Pradel, Michael
Published: (2025)
by: Pradel, Michael
Published: (2025)
An Empirical Analysis of Static Analysis Methods for Detection and Mitigation of Code Library Hallucinations
by: Miranda-Pena, Clarissa, et al.
Published: (2026)
by: Miranda-Pena, Clarissa, et al.
Published: (2026)
Hallucination Detection for LLM-based Text-to-SQL Generation via Two-Stage Metamorphic Testing
by: Yang, Bo, et al.
Published: (2025)
by: Yang, Bo, et al.
Published: (2025)
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects
by: Bouzenia, Islem, et al.
Published: (2024)
by: Bouzenia, Islem, et al.
Published: (2024)
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
by: Tian, Yuchen, et al.
Published: (2024)
by: Tian, Yuchen, et al.
Published: (2024)
CodeMirage: Hallucinations in Code Generated by Large Language Models
by: Agarwal, Vibhor, et al.
Published: (2024)
by: Agarwal, Vibhor, et al.
Published: (2024)
A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI
by: Gao, Cuiyun, et al.
Published: (2025)
by: Gao, Cuiyun, et al.
Published: (2025)
CodeCureAgent: Automatic Classification and Repair of Static Analysis Warnings
by: Joos, Pascal, et al.
Published: (2025)
by: Joos, Pascal, et al.
Published: (2025)
Artisan: Agentic Artifact Evaluation
by: Baek, Doehyun, et al.
Published: (2026)
by: Baek, Doehyun, et al.
Published: (2026)
RepairAgent: An Autonomous, LLM-Based Agent for Program Repair
by: Bouzenia, Islem, et al.
Published: (2024)
by: Bouzenia, Islem, et al.
Published: (2024)
Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries
by: Nirujan, Hinduja, et al.
Published: (2026)
by: Nirujan, Hinduja, et al.
Published: (2026)
Hallucinations in Code Change to Natural Language Generation: Prevalence and Evaluation of Detection Metrics
by: Liu, Chunhua, et al.
Published: (2025)
by: Liu, Chunhua, et al.
Published: (2025)
NoCode-bench: A Benchmark for Evaluating Natural Language-Driven Feature Addition
by: Deng, Le, et al.
Published: (2025)
by: Deng, Le, et al.
Published: (2025)
Issue2Test: Generating Reproducing Test Cases from Issue Reports
by: Nashid, Noor, et al.
Published: (2025)
by: Nashid, Noor, et al.
Published: (2025)
Are "Solved Issues" in SWE-bench Really Solved Correctly? An Empirical Study
by: Wang, You, et al.
Published: (2025)
by: Wang, You, et al.
Published: (2025)
RippleGUItester: Change-Aware Exploratory Testing
by: Su, Yanqi, et al.
Published: (2026)
by: Su, Yanqi, et al.
Published: (2026)
Names Are All You Need: Effective and Safe Regression Test Selection for Python
by: Wang, You, et al.
Published: (2026)
by: Wang, You, et al.
Published: (2026)
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories
by: Bouzenia, Islem, et al.
Published: (2025)
by: Bouzenia, Islem, et al.
Published: (2025)
Treefix: Enabling Execution with a Tree of Prefixes
by: Souza, Beatriz, et al.
Published: (2025)
by: Souza, Beatriz, et al.
Published: (2025)
AgentStepper: Interactive Debugging of Software Development Agents
by: Hutter, Robert, et al.
Published: (2026)
by: Hutter, Robert, et al.
Published: (2026)
Hallucination to Consensus: Multi-Agent LLMs for End-to-End JUnit Test Generation
by: Xu, Qinghua, et al.
Published: (2025)
by: Xu, Qinghua, et al.
Published: (2025)
DyPyBench: A Benchmark of Executable Python Software
by: Bouzenia, Islem, et al.
Published: (2024)
by: Bouzenia, Islem, et al.
Published: (2024)
LLM-Based Repair of Static Nullability Errors
by: Karimipour, Nima, et al.
Published: (2025)
by: Karimipour, Nima, et al.
Published: (2025)
Analyzing Quantum Programs with LintQ: A Static Analysis Framework for Qiskit
by: Paltenghi, Matteo, et al.
Published: (2023)
by: Paltenghi, Matteo, et al.
Published: (2023)
Automatic Generation of Benchmarks and Reliable LLM Judgment for Code Tasks
by: Farchi, Eitan, et al.
Published: (2024)
by: Farchi, Eitan, et al.
Published: (2024)
ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries
by: Maharaj, Kishan, et al.
Published: (2024)
by: Maharaj, Kishan, et al.
Published: (2024)
Similar Items
-
PoCGen: Generating Proof-of-Concept Exploits for Vulnerabilities in Npm Packages
by: Simsek, Deniz, et al.
Published: (2025) -
Reducing Hallucinations in LLM-Generated Code via Semantic Triangulation
by: Dai, Yihan, et al.
Published: (2025) -
LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation
by: Zhang, Ziyao, et al.
Published: (2024) -
Evaluating LLM Agents on Automated Software Analysis Tasks
by: Bouzenia, Islem, et al.
Published: (2026) -
Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges
by: Lee, Yunseo, et al.
Published: (2025)