:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Scaffi, Enzo, Bonneau, Antoine, Mouël, Frédéric Le, Mieyeville, Fabien
Format:	Preprint
Published:	2024
Subjects:	Software Engineering Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2404.07948
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Automatic Generation of High-Performance RL Environments
by: Karten, Seth, et al.
Published: (2026)

The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial Intelligence
by: White, Matt, et al.
Published: (2024)

SetupBench: Assessing Software Engineering Agents' Ability to Bootstrap Development Environments
by: Arora, Avi, et al.
Published: (2025)

Benchmarking Reward Hack Detection in Code Environments via Contrastive Analysis
by: Deshpande, Darshan, et al.
Published: (2026)

PIPer: On-Device Environment Setup via Online Reinforcement Learning
by: Kovrigin, Alexander, et al.
Published: (2025)

Predicting Configuration Performance in Multiple Environments with Sequential Meta-learning
by: Gong, Jingzhi, et al.
Published: (2024)

LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages
by: Diehl, Patrick, et al.
Published: (2025)

One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis
by: Akli, Amal, et al.
Published: (2026)

Deep Configuration Performance Learning: A Systematic Survey and Taxonomy
by: Gong, Jingzhi, et al.
Published: (2024)

The Impact of Environment Configurations on the Stability of AI-Enabled Systems
by: Rahman, Musfiqur, et al.
Published: (2024)

Learning Performance-Improving Code Edits
by: Shypula, Alexander, et al.
Published: (2023)

Intuition to Evidence: Measuring AI's True Impact on Developer Productivity
by: Kumar, Anand, et al.
Published: (2025)

SynthTools: A Framework for Scaling Synthetic Tools for Agent Development
by: Castellani, Tommaso, et al.
Published: (2025)

Hardness, Structural Knowledge, and Opportunity: An Analytical Framework for Modular Performance Modeling
by: Gheibi, Omid, et al.
Published: (2025)

A Regression Framework for Understanding Prompt Component Impact on LLM Performance
by: Lauziere, Andrew, et al.
Published: (2026)

DevBench: A Realistic, Developer-Informed Benchmark for Code Generation Models
by: Kumarappan, Adarsh, et al.
Published: (2026)

Protocol-Driven Development: Governing Generated Software Through Invariants and Continuous Evidence
by: He, Jun, et al.
Published: (2026)

On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
by: Singh, Jaskirat, et al.
Published: (2024)

Explainable Artificial Intelligence Techniques for Software Development Lifecycle: A Phase-specific Survey
by: Arora, Lakshit, et al.
Published: (2025)

DrugPlayGround: Benchmarking Large Language Models and Embeddings for Drug Discovery
by: Liu, Tianyu, et al.
Published: (2026)

Experiential Co-Learning of Software-Developing Agents
by: Qian, Chen, et al.
Published: (2023)

Beyond Synthetic Benchmarks: Evaluating LLM Performance on Real-World Class-Level Code Generation
by: Rahman, Musfiqur, et al.
Published: (2025)

Understanding the Helpfulness of Stale Bot for Pull-based Development: An Empirical Study of 20 Large Open-Source Projects
by: Khatoonabadi, SayedHassan, et al.
Published: (2023)

Are Large Language Models Memorizing Bug Benchmarks?
by: Ramos, Daniel, et al.
Published: (2024)

SPELL: Synthesis of Programmatic Edits using LLMs
by: Ramos, Daniel, et al.
Published: (2026)

Utilizing Deep Learning to Optimize Software Development Processes
by: Li, Keqin, et al.
Published: (2024)

Redundancy and Concept Analysis for Code-trained Language Models
by: Sharma, Arushi, et al.
Published: (2023)

A Theoretical Analysis of Test-Driven Code Generation
by: Menet, Nicolas, et al.
Published: (2026)

Comparative Evaluation of Embedding Representations for Financial News Sentiment Analysis
by: Roy, Joyjit, et al.
Published: (2025)

Debugging and Runtime Analysis of Neural Networks with VLMs (A Case Study)
by: Hu, Boyue Caroline, et al.
Published: (2025)

What's documented in AI? Systematic Analysis of 32K AI Model Cards
by: Liang, Weixin, et al.
Published: (2024)

Machine Learning Robustness: A Primer
by: Braiek, Houssem Ben, et al.
Published: (2024)

Machine Learning with Requirements: a Manifesto
by: Giunchiglia, Eleonora, et al.
Published: (2023)

SLIM: a Scalable Light-weight Root Cause Analysis for Imbalanced Data in Microservice
by: Ren, Rui, et al.
Published: (2024)

AgentTrace: Causal Graph Tracing for Root Cause Analysis in Deployed Multi-Agent Systems
by: Wang, Zhaohui Geoffrey
Published: (2026)

Bayesian Program Learning by Decompiling Amortized Knowledge
by: Palmarini, Alessandro B., et al.
Published: (2023)

On the Replicability and Reproducibility of Deep Learning in Software Engineering
by: Liu, Chao, et al.
Published: (2020)

A Reference Architecture of Reinforcement Learning Frameworks
by: Liu, Xiaoran, et al.
Published: (2026)

Does Few-Shot Learning Help LLM Performance in Code Synthesis?
by: Xu, Derek, et al.
Published: (2024)

An Empirical Study of Fault Localisation Techniques for Deep Learning
by: Humbatova, Nargiz, et al.
Published: (2024)