Saved in:
| Main Authors: | Sorokin, Nikita, Sedykh, Ivan, Malykh, Valentin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.09643 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
by: Sedykh, Ivan, et al.
Published: (2023)
by: Sedykh, Ivan, et al.
Published: (2023)
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search
by: Tikhonov, Anton, et al.
Published: (2023)
by: Tikhonov, Anton, et al.
Published: (2023)
Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation
by: Sorokin, Nikita, et al.
Published: (2026)
by: Sorokin, Nikita, et al.
Published: (2026)
ReCode: Updating Code API Knowledge with Reinforcement Learning
by: Wu, Haoze, et al.
Published: (2025)
by: Wu, Haoze, et al.
Published: (2025)
CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
by: Zhang, Sheng, et al.
Published: (2025)
by: Zhang, Sheng, et al.
Published: (2025)
Selective Shot Learning for Code Explanation
by: Bhattacharya, Paheli, et al.
Published: (2024)
by: Bhattacharya, Paheli, et al.
Published: (2024)
ProCQA: A Large-scale Community-based Programming Question Answering Dataset for Code Search
by: Li, Zehan, et al.
Published: (2024)
by: Li, Zehan, et al.
Published: (2024)
cAST: Enhancing Code Retrieval-Augmented Generation with Structural Chunking via Abstract Syntax Tree
by: Zhang, Yilin, et al.
Published: (2025)
by: Zhang, Yilin, et al.
Published: (2025)
Not All Denoising Steps Are Equal: Model Scheduling for Faster Masked Diffusion Language Models
by: Sedykh, Ivan, et al.
Published: (2026)
by: Sedykh, Ivan, et al.
Published: (2026)
When "Better" Prompts Hurt: Evaluation-Driven Iteration for LLM Applications
by: Commey, Daniel
Published: (2026)
by: Commey, Daniel
Published: (2026)
SaraCoder: Orchestrating Semantic and Structural Cues for Resource-Optimized Repository-Level Code Completion
by: Chen, Xiaohan, et al.
Published: (2025)
by: Chen, Xiaohan, et al.
Published: (2025)
Rewriting the Code: A Simple Method for Large Language Model Augmented Code Search
by: Li, Haochen, et al.
Published: (2024)
by: Li, Haochen, et al.
Published: (2024)
LLM Agents Improve Semantic Code Search
by: Jain, Sarthak, et al.
Published: (2024)
by: Jain, Sarthak, et al.
Published: (2024)
CodeKGC: Code Language Model for Generative Knowledge Graph Construction
by: Bi, Zhen, et al.
Published: (2023)
by: Bi, Zhen, et al.
Published: (2023)
Toward building next-generation Geocoding systems: a systematic review
by: Yin, Zhengcong, et al.
Published: (2025)
by: Yin, Zhengcong, et al.
Published: (2025)
Assessing the Ability of ChatGPT to Screen Articles for Systematic Reviews
by: Syriani, Eugene, et al.
Published: (2023)
by: Syriani, Eugene, et al.
Published: (2023)
Evidence Absence Is Not Evidence Insufficiency: Diagnosing NEI Construction Artifacts in Fact Verification
by: Qiu, Jingxi, et al.
Published: (2026)
by: Qiu, Jingxi, et al.
Published: (2026)
Automating Database-Native Function Code Synthesis with LLMs
by: Zhou, Wei, et al.
Published: (2026)
by: Zhou, Wei, et al.
Published: (2026)
Domain-Specific Retrieval-Augmented Generation Using Vector Stores, Knowledge Graphs, and Tensor Factorization
by: Barron, Ryan C., et al.
Published: (2024)
by: Barron, Ryan C., et al.
Published: (2024)
SweRank: Software Issue Localization with Code Ranking
by: Reddy, Revanth Gangi, et al.
Published: (2025)
by: Reddy, Revanth Gangi, et al.
Published: (2025)
StRuCom: A Novel Dataset of Structured Code Comments in Russian
by: Dziuba, Maria, et al.
Published: (2025)
by: Dziuba, Maria, et al.
Published: (2025)
CIDRe: A Reference-Free Multi-Aspect Criterion for Code Comment Quality Measurement
by: Dziuba, Maria, et al.
Published: (2025)
by: Dziuba, Maria, et al.
Published: (2025)
Modular Layout Synthesis (MLS): Front-end Code via Structure Normalization and Constrained Generation
by: Liu, Chong, et al.
Published: (2025)
by: Liu, Chong, et al.
Published: (2025)
MGS3: A Multi-Granularity Self-Supervised Code Search Framework
by: Li, Rui, et al.
Published: (2025)
by: Li, Rui, et al.
Published: (2025)
ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal
by: Zhang, Haonan, et al.
Published: (2025)
by: Zhang, Haonan, et al.
Published: (2025)
SemLink: A Semantic-Aware Automated Test Oracle for Hyperlink Verification using Siamese Sentence-BERT
by: Yang, Guan-Yan, et al.
Published: (2026)
by: Yang, Guan-Yan, et al.
Published: (2026)
Studying and Recommending Information Highlighting in Stack Overflow Answers
by: Ahmed, Shahla Shaan, et al.
Published: (2024)
by: Ahmed, Shahla Shaan, et al.
Published: (2024)
Towards AI Evaluation in Domain-Specific RAG Systems: The AgriHubi Case Study
by: Hasan, Md. Toufique, et al.
Published: (2026)
by: Hasan, Md. Toufique, et al.
Published: (2026)
Embedding-based search in JetBrains IDEs
by: Abramov, Evgeny, et al.
Published: (2024)
by: Abramov, Evgeny, et al.
Published: (2024)
SBAN: A Framework & Multi-Dimensional Dataset for Large Language Model Pre-Training and Software Code Mining
by: Jelodar, Hamed, et al.
Published: (2025)
by: Jelodar, Hamed, et al.
Published: (2025)
In-Context Learning as an Effective Estimator of Functional Correctness of LLM-Generated Code
by: Das, Susmita, et al.
Published: (2025)
by: Das, Susmita, et al.
Published: (2025)
The Invisible Hand of AI Libraries Shaping Open Source Projects and Communities
by: Esposito, Matteo, et al.
Published: (2026)
by: Esposito, Matteo, et al.
Published: (2026)
Descriptor: C++ Self-Admitted Technical Debt Dataset (CppSATD)
by: Pham, Phuoc, et al.
Published: (2025)
by: Pham, Phuoc, et al.
Published: (2025)
TreeRanker: Fast and Model-agnostic Ranking System for Code Suggestions in IDEs
by: Cipollone, Daniele, et al.
Published: (2025)
by: Cipollone, Daniele, et al.
Published: (2025)
Code-Craft: Hierarchical Graph-Based Code Summarization for Enhanced Context Retrieval
by: Sounthiraraj, David, et al.
Published: (2025)
by: Sounthiraraj, David, et al.
Published: (2025)
Source Code Clone Detection Using Unsupervised Similarity Measures
by: Martinez-Gil, Jorge
Published: (2024)
by: Martinez-Gil, Jorge
Published: (2024)
GNN-Coder: Boosting Semantic Code Retrieval with Combined GNNs and Transformer
by: Ye, Yufan, et al.
Published: (2025)
by: Ye, Yufan, et al.
Published: (2025)
Deep Code Search with Naming-Agnostic Contrastive Multi-View Learning
by: Feng, Jiadong, et al.
Published: (2024)
by: Feng, Jiadong, et al.
Published: (2024)
DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation
by: Esakkiraja, Esakkivel, et al.
Published: (2025)
by: Esakkiraja, Esakkivel, et al.
Published: (2025)
SQuaD: The Software Quality Dataset
by: Robredo, Mikel, et al.
Published: (2025)
by: Robredo, Mikel, et al.
Published: (2025)
Similar Items
-
Searching by Code: a New SearchBySnippet Dataset and SnippeR Retrieval Model for Searching by Code Snippets
by: Sedykh, Ivan, et al.
Published: (2023) -
CCT-Code: Cross-Consistency Training for Multilingual Clone Detection and Code Search
by: Tikhonov, Anton, et al.
Published: (2023) -
Hierarchical Embedding Fusion for Retrieval-Augmented Code Generation
by: Sorokin, Nikita, et al.
Published: (2026) -
ReCode: Updating Code API Knowledge with Reinforcement Learning
by: Wu, Haoze, et al.
Published: (2025) -
CodeRAG: Finding Relevant and Necessary Knowledge for Retrieval-Augmented Repository-Level Code Completion
by: Zhang, Sheng, et al.
Published: (2025)