Guardado en:
| Autores principales: | Sun, Hui, Zhang, Yun-Ji, Xie, Zheng, Liu, Ren-Biao, Du, Yali, Li, Xin-Ye, Li, Ming |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2604.03922 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Exploring Pass-Rate Reward in Reinforcement Learning for Code Generation
por: Li, Xin-Ye, et al.
Publicado: (2026)
por: Li, Xin-Ye, et al.
Publicado: (2026)
Post-Incorporating Code Structural Knowledge into Pretrained Models via ICL for Code Translation
por: Du, Yali, et al.
Publicado: (2025)
por: Du, Yali, et al.
Publicado: (2025)
Weakly Supervised AUC Optimization: A Unified Partial AUC Approach
por: Xie, Zheng, et al.
Publicado: (2023)
por: Xie, Zheng, et al.
Publicado: (2023)
Design-Specification Tiling for ICL-based CAD Code Generation
por: Du, Yali, et al.
Publicado: (2026)
por: Du, Yali, et al.
Publicado: (2026)
A Joint Learning Model with Variational Interaction for Multilingual Program Translation
por: Du, Yali, et al.
Publicado: (2024)
por: Du, Yali, et al.
Publicado: (2024)
Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking
por: Lyu, Zhi-Cun, et al.
Publicado: (2024)
por: Lyu, Zhi-Cun, et al.
Publicado: (2024)
Leave-One-Out Prediction for General Hypothesis Classes
por: Qian, Jian, et al.
Publicado: (2026)
por: Qian, Jian, et al.
Publicado: (2026)
An Iterative Test-and-Repair Framework for Competitive Code Generation
por: Tang, Lingxiao, et al.
Publicado: (2026)
por: Tang, Lingxiao, et al.
Publicado: (2026)
ACES: Accent Subspaces for Coupling, Explanations, and Stress-Testing in Automatic Speech Recognition
por: Parekh, Swapnil
Publicado: (2026)
por: Parekh, Swapnil
Publicado: (2026)
Evaluating the Test Adequacy of Benchmarks for LLMs on Code Generation
por: Xiangyue Liu, et al.
Publicado: (2025)
por: Xiangyue Liu, et al.
Publicado: (2025)
Weighted Leave-One-Out Cross Validation
por: Pronzato, Luc, et al.
Publicado: (2025)
por: Pronzato, Luc, et al.
Publicado: (2025)
Leave-One-Out Stable Conformal Prediction
por: Lee, Kiljae, et al.
Publicado: (2025)
por: Lee, Kiljae, et al.
Publicado: (2025)
Leave-One-Out Learning with Log-Loss
por: Fogel, Yaniv, et al.
Publicado: (2025)
por: Fogel, Yaniv, et al.
Publicado: (2025)
Asymptotically Optimal Tests for One- and Two-Sample Problems
por: Grootveld, Arick, et al.
Publicado: (2026)
por: Grootveld, Arick, et al.
Publicado: (2026)
Enhancing LLMs in Long Code Translation through Instrumentation and Program State Alignment
por: Xin-Ye, Li, et al.
Publicado: (2025)
por: Xin-Ye, Li, et al.
Publicado: (2025)
Mutation-based Consistency Testing for Evaluating the Code Understanding Capability of LLMs
por: Li, Ziyu, et al.
Publicado: (2024)
por: Li, Ziyu, et al.
Publicado: (2024)
Structural Evaluation Metrics for SVG Generation via Leave-One-Out Analysis
por: Zhu, Haonan, et al.
Publicado: (2026)
por: Zhu, Haonan, et al.
Publicado: (2026)
OODD: Test-time Out-of-Distribution Detection with Dynamic Dictionary
por: Yang, Yifeng, et al.
Publicado: (2025)
por: Yang, Yifeng, et al.
Publicado: (2025)
Interval-Based AUC (iAUC): Extending ROC Analysis to Uncertainty-Aware Classification
por: Li, Yuqi, et al.
Publicado: (2026)
por: Li, Yuqi, et al.
Publicado: (2026)
CodeContests+: High-Quality Test Case Generation for Competitive Programming
por: Wang, Zihan, et al.
Publicado: (2025)
por: Wang, Zihan, et al.
Publicado: (2025)
MUCOCO: Automated Consistency Testing of Code LLMs
por: Chou, Chua Jin, et al.
Publicado: (2026)
por: Chou, Chua Jin, et al.
Publicado: (2026)
When LRP Diverges from Leave-One-Out in Transformers
por: You, Weiqiu, et al.
Publicado: (2025)
por: You, Weiqiu, et al.
Publicado: (2025)
Leave-One-Out-, Bootstrap- and Cross-Conformal Anomaly Detectors
por: Hennhöfer, Oliver, et al.
Publicado: (2024)
por: Hennhöfer, Oliver, et al.
Publicado: (2024)
ScaleRTL: Scaling LLMs with Reasoning Data and Test-Time Compute for Accurate RTL Code Generation
por: Deng, Chenhui, et al.
Publicado: (2025)
por: Deng, Chenhui, et al.
Publicado: (2025)
Leave-One-Out Analysis for Nonconvex Robust Matrix Completion with General Thresholding Functions
por: Wang, Tianming, et al.
Publicado: (2024)
por: Wang, Tianming, et al.
Publicado: (2024)
S*: Test Time Scaling for Code Generation
por: Li, Dacheng, et al.
Publicado: (2025)
por: Li, Dacheng, et al.
Publicado: (2025)
Navigating Pharmacogenomic Testing in Practice: Who to Test and When to Test
por: James M. Stevenson, et al.
Publicado: (2025)
por: James M. Stevenson, et al.
Publicado: (2025)
ACES: Generating Diverse Programming Puzzles with with Autotelic Generative Models
por: Pourcel, Julien, et al.
Publicado: (2023)
por: Pourcel, Julien, et al.
Publicado: (2023)
Generalizing Test Cases for Comprehensive Test Scenario Coverage
por: Qi, Binhang, et al.
Publicado: (2026)
por: Qi, Binhang, et al.
Publicado: (2026)
Measuring the Influence of Incorrect Code on Test Generation
por: Huang, Dong, et al.
Publicado: (2024)
por: Huang, Dong, et al.
Publicado: (2024)
Lares: LLM-driven Code Slice Semantic Search for Patch Presence Testing
por: Li, Siyuan, et al.
Publicado: (2025)
por: Li, Siyuan, et al.
Publicado: (2025)
Who Wrote this Code? Watermarking for Code Generation
por: Lee, Taehyun, et al.
Publicado: (2023)
por: Lee, Taehyun, et al.
Publicado: (2023)
MCTS-Judge: Test-Time Scaling in LLM-as-a-Judge for Code Correctness Evaluation
por: Wang, Yutong, et al.
Publicado: (2025)
por: Wang, Yutong, et al.
Publicado: (2025)
Preserving AUC Fairness in Learning with Noisy Protected Groups
por: Wu, Mingyang, et al.
Publicado: (2025)
por: Wu, Mingyang, et al.
Publicado: (2025)
Confidence Intervals for AUC and pAUC by Empirical Likelihood
por: Yumin Zhao, et al.
Publicado: (2025)
por: Yumin Zhao, et al.
Publicado: (2025)
Cross-validating causal discovery via Leave-One-Variable-Out
por: Schkoda, Daniela, et al.
Publicado: (2024)
por: Schkoda, Daniela, et al.
Publicado: (2024)
Klear-CodeTest: Scalable Test Case Generation for Code Reinforcement Learning
por: Fu, Jia, et al.
Publicado: (2025)
por: Fu, Jia, et al.
Publicado: (2025)
Test-time GNN Model Evaluation on Dynamic Graphs
por: Li, Bo, et al.
Publicado: (2025)
por: Li, Bo, et al.
Publicado: (2025)
Leaving No One Behind, Leaving No One Unaccountable
por: Glušac, Luka
Publicado: (2023)
por: Glušac, Luka
Publicado: (2023)
VeriScale: Adversarial Test-Suite Scaling for Verifiable Code Generation
por: Bai, Yifan, et al.
Publicado: (2026)
por: Bai, Yifan, et al.
Publicado: (2026)
Ejemplares similares
-
Exploring Pass-Rate Reward in Reinforcement Learning for Code Generation
por: Li, Xin-Ye, et al.
Publicado: (2026) -
Post-Incorporating Code Structural Knowledge into Pretrained Models via ICL for Code Translation
por: Du, Yali, et al.
Publicado: (2025) -
Weakly Supervised AUC Optimization: A Unified Partial AUC Approach
por: Xie, Zheng, et al.
Publicado: (2023) -
Design-Specification Tiling for ICL-based CAD Code Generation
por: Du, Yali, et al.
Publicado: (2026) -
A Joint Learning Model with Variational Interaction for Multilingual Program Translation
por: Du, Yali, et al.
Publicado: (2024)