Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Geng, Jiahui, Cai, Fengyu, Cui, Shaobo, Li, Qing, Chen, Liangwei, Lyu, Chenyang, Li, Haonan, Zhu, Derui, Pretschner, Walter, Koeppl, Heinz, Karray, Fakhri
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Software Engineering Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2506.11066
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866909754383663104
author	Geng, Jiahui Cai, Fengyu Cui, Shaobo Li, Qing Chen, Liangwei Lyu, Chenyang Li, Haonan Zhu, Derui Pretschner, Walter Koeppl, Heinz Karray, Fakhri
author_facet	Geng, Jiahui Cai, Fengyu Cui, Shaobo Li, Qing Chen, Liangwei Lyu, Chenyang Li, Haonan Zhu, Derui Pretschner, Walter Koeppl, Heinz Karray, Fakhri
contents	Code retrieval is essential in modern software development, as it boosts code reuse and accelerates debugging. However, current benchmarks primarily emphasize functional relevance while neglecting critical dimensions of software quality. Motivated by this gap, we introduce CoQuIR, the first large-scale, multilingual benchmark specifically designed to evaluate quality-aware code retrieval across four key dimensions: correctness, efficiency, security, and maintainability. CoQuIR provides fine-grained quality annotations for 42,725 queries and 134,907 code snippets in 11 programming languages, and is accompanied by two quality-centric evaluation metrics: Pairwise Preference Accuracy and Margin-based Ranking Score. Using CoQuIR, we benchmark 23 retrieval models, covering both open-source and proprietary systems, and find that even top-performing models frequently fail to distinguish buggy or insecure code from their more robust counterparts. Furthermore, we conduct preliminary investigations into training methods that explicitly encourage retrievers to recognize code quality. Using synthetic datasets, we demonstrate promising improvements in quality-aware metrics across various models, without sacrificing semantic relevance. Downstream code generation experiments further validate the effectiveness of our approach. Overall, our work highlights the importance of integrating quality signals into code retrieval systems, laying the groundwork for more trustworthy and robust software development tools.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_11066
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval Geng, Jiahui Cai, Fengyu Cui, Shaobo Li, Qing Chen, Liangwei Lyu, Chenyang Li, Haonan Zhu, Derui Pretschner, Walter Koeppl, Heinz Karray, Fakhri Software Engineering Artificial Intelligence Code retrieval is essential in modern software development, as it boosts code reuse and accelerates debugging. However, current benchmarks primarily emphasize functional relevance while neglecting critical dimensions of software quality. Motivated by this gap, we introduce CoQuIR, the first large-scale, multilingual benchmark specifically designed to evaluate quality-aware code retrieval across four key dimensions: correctness, efficiency, security, and maintainability. CoQuIR provides fine-grained quality annotations for 42,725 queries and 134,907 code snippets in 11 programming languages, and is accompanied by two quality-centric evaluation metrics: Pairwise Preference Accuracy and Margin-based Ranking Score. Using CoQuIR, we benchmark 23 retrieval models, covering both open-source and proprietary systems, and find that even top-performing models frequently fail to distinguish buggy or insecure code from their more robust counterparts. Furthermore, we conduct preliminary investigations into training methods that explicitly encourage retrievers to recognize code quality. Using synthetic datasets, we demonstrate promising improvements in quality-aware metrics across various models, without sacrificing semantic relevance. Downstream code generation experiments further validate the effectiveness of our approach. Overall, our work highlights the importance of integrating quality signals into code retrieval systems, laying the groundwork for more trustworthy and robust software development tools.
title	CoQuIR: A Comprehensive Benchmark for Code Quality-Aware Information Retrieval
topic	Software Engineering Artificial Intelligence
url	https://arxiv.org/abs/2506.11066

Ähnliche Einträge