Saved in:
| Main Authors: | Xue, Zhantong, Ma, Pingchuan, Wang, Zhaoyu, Zhou, Yuguang, Zhang, Xiaoqin, Wang, Shuai, Rahmel, Juergen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.11708 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ZK-Value: A Practical Zero-Knowledge System for Verifiable Data Valuation
by: Wang, Zhaoyu, et al.
Published: (2026)
by: Wang, Zhaoyu, et al.
Published: (2026)
RepoMark: A Data-Usage Auditing Framework for Code Large Language Models
by: Qu, Wenjie, et al.
Published: (2025)
by: Qu, Wenjie, et al.
Published: (2025)
A Survey on Evaluating Large Language Models in Code Generation Tasks
by: Chen, Liguo, et al.
Published: (2024)
by: Chen, Liguo, et al.
Published: (2024)
SkillReducer: Optimizing LLM Agent Skills for Token Efficiency
by: Gao, Yudong, et al.
Published: (2026)
by: Gao, Yudong, et al.
Published: (2026)
Self-planning Code Generation with Large Language Models
by: Jiang, Xue, et al.
Published: (2023)
by: Jiang, Xue, et al.
Published: (2023)
Towards Understanding the Characteristics of Code Generation Errors Made by Large Language Models
by: Wang, Zhijie, et al.
Published: (2024)
by: Wang, Zhijie, et al.
Published: (2024)
Exploring Multi-Lingual Bias of Large Code Models in Code Generation
by: Wang, Chaozheng, et al.
Published: (2024)
by: Wang, Chaozheng, et al.
Published: (2024)
Digging Into the Internal: Causality-Based Analysis of LLM Function Calling
by: Ji, Zhenlan, et al.
Published: (2025)
by: Ji, Zhenlan, et al.
Published: (2025)
How Multi-Modal LLMs Reshape Visual Deep Learning Testing? A Comprehensive Study Through the Lens of Image Mutation
by: Wang, Liwen, et al.
Published: (2024)
by: Wang, Liwen, et al.
Published: (2024)
Large Language Model Unlearning for Source Code
by: Jiang, Xue, et al.
Published: (2025)
by: Jiang, Xue, et al.
Published: (2025)
On the Evaluation of Large Language Models in Unit Test Generation
by: Yang, Lin, et al.
Published: (2024)
by: Yang, Lin, et al.
Published: (2024)
CrossPL: Evaluating Large Language Models on Cross Programming Language Code Generation
by: Xiong, Zhanhang, et al.
Published: (2025)
by: Xiong, Zhanhang, et al.
Published: (2025)
SpecEval: Evaluating Code Comprehension in Large Language Models via Program Specifications
by: Ma, Lezhi, et al.
Published: (2024)
by: Ma, Lezhi, et al.
Published: (2024)
ClassEval-T: Evaluating Large Language Models in Class-Level Code Translation
by: Xue, Pengyu, et al.
Published: (2024)
by: Xue, Pengyu, et al.
Published: (2024)
Optimizing Large Language Model Hyperparameters for Code Generation
by: Arora, Chetan, et al.
Published: (2024)
by: Arora, Chetan, et al.
Published: (2024)
CFCEval: Evaluating Security Aspects in Code Generated by Large Language Models
by: Cheng, Cheng, et al.
Published: (2025)
by: Cheng, Cheng, et al.
Published: (2025)
Failure-Aware Enhancements for Large Language Model (LLM) Code Generation: An Empirical Study on Decision Framework
by: Shen, Jianru, et al.
Published: (2026)
by: Shen, Jianru, et al.
Published: (2026)
Task Abstention for Large Language Models in Code Generation
by: Zhou, Yanke, et al.
Published: (2026)
by: Zhou, Yanke, et al.
Published: (2026)
Strengthening Programming Comprehension in Large Language Models through Code Generation
by: Ren, Xiaoning, et al.
Published: (2025)
by: Ren, Xiaoning, et al.
Published: (2025)
Knowledge-Aware Code Generation with Large Language Models
by: Huang, Tao, et al.
Published: (2024)
by: Huang, Tao, et al.
Published: (2024)
QuanBench: Benchmarking Quantum Code Generation with Large Language Models
by: Guo, Xiaoyu, et al.
Published: (2025)
by: Guo, Xiaoyu, et al.
Published: (2025)
Hotfixing Large Language Models for Code
by: Yang, Zhou, et al.
Published: (2024)
by: Yang, Zhou, et al.
Published: (2024)
Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability
by: He, Mengliang, et al.
Published: (2025)
by: He, Mengliang, et al.
Published: (2025)
Evaluating the Impact of Post-Training Quantization on Large Language Models for Code Generation
by: Giagnorio, Alessandro, et al.
Published: (2025)
by: Giagnorio, Alessandro, et al.
Published: (2025)
Large Language Models as Test Case Generators: Performance Evaluation and Enhancement
by: Li, Kefan, et al.
Published: (2024)
by: Li, Kefan, et al.
Published: (2024)
A Hybrid Approach for EMF Code Generation:Code Templates Meet Large Language Models
by: He, Xiao, et al.
Published: (2025)
by: He, Xiao, et al.
Published: (2025)
Confidentiality-Preserving Verifiable Business Processes through Zero-Knowledge Proofs
by: Kiesel, Jannis, et al.
Published: (2025)
by: Kiesel, Jannis, et al.
Published: (2025)
On the Effectiveness of Large Language Models in Domain-Specific Code Generation
by: Gu, Xiaodong, et al.
Published: (2023)
by: Gu, Xiaodong, et al.
Published: (2023)
Is Your AI-Generated Code Really Safe? Evaluating Large Language Models on Secure Code Generation with CodeSecEval
by: Wang, Jiexin, et al.
Published: (2024)
by: Wang, Jiexin, et al.
Published: (2024)
Ecosystem of Large Language Models for Code
by: Yang, Zhou, et al.
Published: (2024)
by: Yang, Zhou, et al.
Published: (2024)
Chain-of-Thought in Neural Code Generation: From and For Lightweight Language Models
by: Yang, Guang, et al.
Published: (2023)
by: Yang, Guang, et al.
Published: (2023)
A Preliminary Study on the Robustness of Code Generation by Large Language Models
by: Li, Zike, et al.
Published: (2025)
by: Li, Zike, et al.
Published: (2025)
Advancing Code Coverage: Incorporating Program Analysis with Large Language Models
by: Yang, Chen, et al.
Published: (2024)
by: Yang, Chen, et al.
Published: (2024)
ROCODE: Integrating Backtracking Mechanism and Program Analysis in Large Language Models for Code Generation
by: Jiang, Xue, et al.
Published: (2024)
by: Jiang, Xue, et al.
Published: (2024)
Cross-Task Benchmarking and Evaluation of General-Purpose and Code-Specific Large Language Models
by: Das, Gunjan, et al.
Published: (2025)
by: Das, Gunjan, et al.
Published: (2025)
Greening Large Language Models of Code
by: Shi, Jieke, et al.
Published: (2023)
by: Shi, Jieke, et al.
Published: (2023)
Evaluating Generated Commit Messages with Large Language Models
by: Zeng, Qunhong, et al.
Published: (2025)
by: Zeng, Qunhong, et al.
Published: (2025)
Synergistic Enhancement of Requirement-to-Code Traceability: A Framework Combining Large Language Model based Data Augmentation and an Advanced Encoder
by: Zhang, Jianzhang, et al.
Published: (2025)
by: Zhang, Jianzhang, et al.
Published: (2025)
CodeScore: Evaluating Code Generation by Learning Code Execution
by: Dong, Yihong, et al.
Published: (2023)
by: Dong, Yihong, et al.
Published: (2023)
Automated Proof Generation for Rust Code via Self-Evolution
by: Chen, Tianyu, et al.
Published: (2024)
by: Chen, Tianyu, et al.
Published: (2024)
Similar Items
-
ZK-Value: A Practical Zero-Knowledge System for Verifiable Data Valuation
by: Wang, Zhaoyu, et al.
Published: (2026) -
RepoMark: A Data-Usage Auditing Framework for Code Large Language Models
by: Qu, Wenjie, et al.
Published: (2025) -
A Survey on Evaluating Large Language Models in Code Generation Tasks
by: Chen, Liguo, et al.
Published: (2024) -
SkillReducer: Optimizing LLM Agent Skills for Token Efficiency
by: Gao, Yudong, et al.
Published: (2026) -
Self-planning Code Generation with Large Language Models
by: Jiang, Xue, et al.
Published: (2023)