Saved in:
| Main Authors: | Zhang, Xiaoyu, Jiang, Weipeng, Shen, Chao, Li, Qi, Wang, Qian, Lin, Chenhao, Guan, Xiaohong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.17871 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025)
by: Jiang, Weipeng, et al.
Published: (2025)
AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
by: Yu, Jiongchi, et al.
Published: (2025)
by: Yu, Jiongchi, et al.
Published: (2025)
Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024)
by: Gao, Xuanqi, et al.
Published: (2024)
The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation
by: Zhang, Xiaoyu, et al.
Published: (2025)
by: Zhang, Xiaoyu, et al.
Published: (2025)
False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models
by: Jiang, Weipeng, et al.
Published: (2026)
by: Jiang, Weipeng, et al.
Published: (2026)
Deep Learning-Based Identification of Inconsistent Method Names: How Far Are We?
by: Wang, Taiming, et al.
Published: (2025)
by: Wang, Taiming, et al.
Published: (2025)
DREAM: Debugging and Repairing AutoML Pipelines
by: Zhang, Xiaoyu, et al.
Published: (2023)
by: Zhang, Xiaoyu, et al.
Published: (2023)
Explicating Tacit Regulatory Knowledge from LLMs to Auto-Formalize Requirements for Compliance Test Case Generation
by: Xue, Zhiyi, et al.
Published: (2026)
by: Xue, Zhiyi, et al.
Published: (2026)
Klear-CodeTest: Scalable Test Case Generation for Code Reinforcement Learning
by: Fu, Jia, et al.
Published: (2025)
by: Fu, Jia, et al.
Published: (2025)
Navigating the Labyrinth: Path-Sensitive Unit Test Generation with Large Language Models
by: Liao, Dianshu, et al.
Published: (2025)
by: Liao, Dianshu, et al.
Published: (2025)
Rethinking Technology Stack Selection with AI Coding Proficiency
by: Zhang, Xiaoyu, et al.
Published: (2025)
by: Zhang, Xiaoyu, et al.
Published: (2025)
ASSURE: Metamorphic Testing for AI-powered Browser Extensions
by: Gao, Xuanqi, et al.
Published: (2025)
by: Gao, Xuanqi, et al.
Published: (2025)
Software Development Life Cycle Perspective: A Survey of Benchmarks for Code Large Language Models and Agents
by: Wang, Kaixin, et al.
Published: (2025)
by: Wang, Kaixin, et al.
Published: (2025)
Effective Code Membership Inference for Code Completion Models via Adversarial Prompts
by: Jiang, Yuan, et al.
Published: (2025)
by: Jiang, Yuan, et al.
Published: (2025)
Rethinking Testing for LLM Applications: Characteristics, Challenges, and a Lightweight Interaction Protocol
by: Ma, Wei, et al.
Published: (2025)
by: Ma, Wei, et al.
Published: (2025)
Rethinking Diversity in Deep Neural Network Testing
by: Wang, Zi, et al.
Published: (2023)
by: Wang, Zi, et al.
Published: (2023)
When Fuzzing Meets LLMs: Challenges and Opportunities
by: Jiang, Yu, et al.
Published: (2024)
by: Jiang, Yu, et al.
Published: (2024)
Keeping Deep Learning Models in Check: A History-Based Approach to Mitigate Overfitting
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
Open-Source AI-based SE Tools: Opportunities and Challenges of Collaborative Software Learning
by: Lin, Zhihao, et al.
Published: (2024)
by: Lin, Zhihao, et al.
Published: (2024)
Beyond Accuracy: An Empirical Study on Unit Testing in Open-source Deep Learning Projects
by: Wang, Han, et al.
Published: (2024)
by: Wang, Han, et al.
Published: (2024)
Addressing Quality Challenges in Deep Learning: The Role of MLOps and Domain Knowledge
by: del Rey, Santiago, et al.
Published: (2025)
by: del Rey, Santiago, et al.
Published: (2025)
Enhancing High-Quality Code Generation in Large Language Models with Comparative Prefix-Tuning
by: Jiang, Yuan, et al.
Published: (2025)
by: Jiang, Yuan, et al.
Published: (2025)
The Hitchhiker's Guide to Program Analysis, Part II: Deep Thoughts by LLMs
by: Li, Haonan, et al.
Published: (2025)
by: Li, Haonan, et al.
Published: (2025)
Testing Storage-System Correctness: Challenges, Fuzzing Limitations, and AI-Augmented Opportunities
by: Wang, Ying, et al.
Published: (2026)
by: Wang, Ying, et al.
Published: (2026)
DeepCode: Open Agentic Coding
by: Li, Zongwei, et al.
Published: (2025)
by: Li, Zongwei, et al.
Published: (2025)
Tool-integrated Reinforcement Learning for Repo Deep Search
by: Ma, Zexiong, et al.
Published: (2025)
by: Ma, Zexiong, et al.
Published: (2025)
QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems
by: Shi, Jinjing, et al.
Published: (2024)
by: Shi, Jinjing, et al.
Published: (2024)
TENET: Leveraging Tests Beyond Validation for Code Generation
by: Hu, Yiran, et al.
Published: (2025)
by: Hu, Yiran, et al.
Published: (2025)
DeepKnowledge: Generalisation-Driven Deep Learning Testing
by: Missaoui, Sondess, et al.
Published: (2024)
by: Missaoui, Sondess, et al.
Published: (2024)
Rethinking the Value of Agent-Generated Tests for LLM-Based Software Engineering Agents
by: Chen, Zhi, et al.
Published: (2026)
by: Chen, Zhi, et al.
Published: (2026)
FVSpec: Real-World Property-Based Tests as Lean Challenges
by: Dougherty, Quinn, et al.
Published: (2026)
by: Dougherty, Quinn, et al.
Published: (2026)
Enhancing Differential Testing With LLMs For Testing Deep Learning Libraries
by: Li, Meiziniu, et al.
Published: (2024)
by: Li, Meiziniu, et al.
Published: (2024)
DialogAgent: An Auto-engagement Agent for Code Question Answering Data Production
by: Liang, Xiaoyun, et al.
Published: (2024)
by: Liang, Xiaoyun, et al.
Published: (2024)
VSRQ: Quantitative Assessment Method for Safety Risk of Vehicle Intelligent Connected System
by: Zhang, Tian, et al.
Published: (2023)
by: Zhang, Tian, et al.
Published: (2023)
Just-In-Time Software Defect Prediction via Bi-modal Change Representation Learning
by: Jiang, Yuze, et al.
Published: (2024)
by: Jiang, Yuze, et al.
Published: (2024)
FlaKat: A Machine Learning-Based Categorization Framework for Flaky Tests
by: Lin, Shizhe, et al.
Published: (2024)
by: Lin, Shizhe, et al.
Published: (2024)
Explore-Construct-Filter: An Automated Framework for Rich and Reliable API Knowledge Graph Construction
by: Sun, Yanbang, et al.
Published: (2025)
by: Sun, Yanbang, et al.
Published: (2025)
Empowering AI to Generate Better AI Code: Guided Generation of Deep Learning Projects with LLMs
by: Xie, Chen, et al.
Published: (2025)
by: Xie, Chen, et al.
Published: (2025)
Challenges in Testing Large Language Model Based Software: A Faceted Taxonomy
by: Dobslaw, Felix, et al.
Published: (2025)
by: Dobslaw, Felix, et al.
Published: (2025)
Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code
by: Jiang, Nan, et al.
Published: (2024)
by: Jiang, Nan, et al.
Published: (2024)
Similar Items
-
The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries
by: Jiang, Weipeng, et al.
Published: (2025) -
AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis
by: Yu, Jiongchi, et al.
Published: (2025) -
Efficient DNN-Powered Software with Fair Sparse Models
by: Gao, Xuanqi, et al.
Published: (2024) -
The Invisible Hand: Unveiling Provider Bias in Large Language Models for Code Generation
by: Zhang, Xiaoyu, et al.
Published: (2025) -
False Friends in the Shell: Unveiling the Emoticon Semantic Confusion in Large Language Models
by: Jiang, Weipeng, et al.
Published: (2026)