Saved in:
| Main Authors: | Scaffi, Enzo, Bonneau, Antoine, Mouël, Frédéric Le, Mieyeville, Fabien |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.07948 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Automatic Generation of High-Performance RL Environments
by: Karten, Seth, et al.
Published: (2026)
by: Karten, Seth, et al.
Published: (2026)
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence
by: White, Matt, et al.
Published: (2024)
by: White, Matt, et al.
Published: (2024)
SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments
by: Arora, Avi, et al.
Published: (2025)
by: Arora, Avi, et al.
Published: (2025)
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
by: Deshpande, Darshan, et al.
Published: (2026)
by: Deshpande, Darshan, et al.
Published: (2026)
PIPer: On-Device Environment Setup via Online Reinforcement Learning
by: Kovrigin, Alexander, et al.
Published: (2025)
by: Kovrigin, Alexander, et al.
Published: (2025)
Predicting Configuration Performance in Multiple Environments with Sequential Meta-learning
by: Gong, Jingzhi, et al.
Published: (2024)
by: Gong, Jingzhi, et al.
Published: (2024)
LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages
by: Diehl, Patrick, et al.
Published: (2025)
by: Diehl, Patrick, et al.
Published: (2025)
One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis
by: Akli, Amal, et al.
Published: (2026)
by: Akli, Amal, et al.
Published: (2026)
Deep Configuration Performance Learning: A Systematic Survey and Taxonomy
by: Gong, Jingzhi, et al.
Published: (2024)
by: Gong, Jingzhi, et al.
Published: (2024)
The Impact of Environment Configurations on the Stability of AI-Enabled Systems
by: Rahman, Musfiqur, et al.
Published: (2024)
by: Rahman, Musfiqur, et al.
Published: (2024)
Learning Performance-Improving Code Edits
by: Shypula, Alexander, et al.
Published: (2023)
by: Shypula, Alexander, et al.
Published: (2023)
Intuition to Evidence: Measuring AI's True Impact on Developer Productivity
by: Kumar, Anand, et al.
Published: (2025)
by: Kumar, Anand, et al.
Published: (2025)
SynthTools: A Framework for Scaling Synthetic Tools for Agent Development
by: Castellani, Tommaso, et al.
Published: (2025)
by: Castellani, Tommaso, et al.
Published: (2025)
Hardness, Structural Knowledge, and Opportunity: An Analytical Framework for Modular Performance Modeling
by: Gheibi, Omid, et al.
Published: (2025)
by: Gheibi, Omid, et al.
Published: (2025)
A Regression Framework for Understanding Prompt Component Impact on LLM Performance
by: Lauziere, Andrew, et al.
Published: (2026)
by: Lauziere, Andrew, et al.
Published: (2026)
DevBench: A Realistic, Developer-Informed Benchmark for Code Generation Models
by: Kumarappan, Adarsh, et al.
Published: (2026)
by: Kumarappan, Adarsh, et al.
Published: (2026)
Protocol-Driven Development: Governing Generated Software Through Invariants and Continuous Evidence
by: He, Jun, et al.
Published: (2026)
by: He, Jun, et al.
Published: (2026)
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
by: Singh, Jaskirat, et al.
Published: (2024)
by: Singh, Jaskirat, et al.
Published: (2024)
Explainable Artificial Intelligence Techniques for Software Development Lifecycle: A Phase-specific Survey
by: Arora, Lakshit, et al.
Published: (2025)
by: Arora, Lakshit, et al.
Published: (2025)
DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery
by: Liu, Tianyu, et al.
Published: (2026)
by: Liu, Tianyu, et al.
Published: (2026)
Experiential Co-Learning of Software-Developing Agents
by: Qian, Chen, et al.
Published: (2023)
by: Qian, Chen, et al.
Published: (2023)
Beyond Synthetic Benchmarks: Evaluating LLM Performance on Real-World Class-Level Code Generation
by: Rahman, Musfiqur, et al.
Published: (2025)
by: Rahman, Musfiqur, et al.
Published: (2025)
Understanding the Helpfulness of Stale Bot for Pull-based Development: An Empirical Study of 20 Large Open-Source Projects
by: Khatoonabadi, SayedHassan, et al.
Published: (2023)
by: Khatoonabadi, SayedHassan, et al.
Published: (2023)
Are Large Language Models Memorizing Bug Benchmarks?
by: Ramos, Daniel, et al.
Published: (2024)
by: Ramos, Daniel, et al.
Published: (2024)
SPELL: Synthesis of Programmatic Edits using LLMs
by: Ramos, Daniel, et al.
Published: (2026)
by: Ramos, Daniel, et al.
Published: (2026)
Utilizing Deep Learning to Optimize Software Development Processes
by: Li, Keqin, et al.
Published: (2024)
by: Li, Keqin, et al.
Published: (2024)
Redundancy and Concept Analysis for Code-trained Language Models
by: Sharma, Arushi, et al.
Published: (2023)
by: Sharma, Arushi, et al.
Published: (2023)
A Theoretical Analysis of Test-Driven Code Generation
by: Menet, Nicolas, et al.
Published: (2026)
by: Menet, Nicolas, et al.
Published: (2026)
Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis
by: Roy, Joyjit, et al.
Published: (2025)
by: Roy, Joyjit, et al.
Published: (2025)
Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study)
by: Hu, Boyue Caroline, et al.
Published: (2025)
by: Hu, Boyue Caroline, et al.
Published: (2025)
What's documented in AI? Systematic Analysis of 32K AI Model Cards
by: Liang, Weixin, et al.
Published: (2024)
by: Liang, Weixin, et al.
Published: (2024)
Machine Learning Robustness: A Primer
by: Braiek, Houssem Ben, et al.
Published: (2024)
by: Braiek, Houssem Ben, et al.
Published: (2024)
Machine Learning with Requirements: a Manifesto
by: Giunchiglia, Eleonora, et al.
Published: (2023)
by: Giunchiglia, Eleonora, et al.
Published: (2023)
SLIM: a Scalable Light-weight Root Cause Analysis for Imbalanced Data in Microservice
by: Ren, Rui, et al.
Published: (2024)
by: Ren, Rui, et al.
Published: (2024)
AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems
by: Wang, Zhaohui Geoffrey
Published: (2026)
by: Wang, Zhaohui Geoffrey
Published: (2026)
Bayesian Program Learning by Decompiling Amortized Knowledge
by: Palmarini, Alessandro B., et al.
Published: (2023)
by: Palmarini, Alessandro B., et al.
Published: (2023)
On the Replicability and Reproducibility of Deep Learning in Software Engineering
by: Liu, Chao, et al.
Published: (2020)
by: Liu, Chao, et al.
Published: (2020)
A Reference Architecture of Reinforcement Learning Frameworks
by: Liu, Xiaoran, et al.
Published: (2026)
by: Liu, Xiaoran, et al.
Published: (2026)
Does Few-Shot Learning Help LLM Performance in Code Synthesis?
by: Xu, Derek, et al.
Published: (2024)
by: Xu, Derek, et al.
Published: (2024)
An Empirical Study of Fault Localisation Techniques for Deep Learning
by: Humbatova, Nargiz, et al.
Published: (2024)
by: Humbatova, Nargiz, et al.
Published: (2024)
Similar Items
-
Automatic Generation of High-Performance RL Environments
by: Karten, Seth, et al.
Published: (2026) -
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence
by: White, Matt, et al.
Published: (2024) -
SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments
by: Arora, Avi, et al.
Published: (2025) -
Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
by: Deshpande, Darshan, et al.
Published: (2026) -
PIPer: On-Device Environment Setup via Online Reinforcement Learning
by: Kovrigin, Alexander, et al.
Published: (2025)