:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Szeider, Stefan
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computation and Language Machine Learning Software Engineering
Online Access:	https://arxiv.org/abs/2508.07468
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MCP-Solver: Integrating Language Models with Constraint Programming Systems
by: Szeider, Stefan
Published: (2024)

Generating Streamlining Constraints with Large Language Models
by: Voboril, Florentina, et al.
Published: (2024)

Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs
by: Dai, Hankun, et al.
Published: (2025)

AFlow: Automating Agentic Workflow Generation
by: Zhang, Jiayi, et al.
Published: (2024)

Scaling Test-Time Compute for Agentic Coding
by: Kim, Joongwon, et al.
Published: (2026)

Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
by: Wong, Sherman, et al.
Published: (2025)

DocAgent: A Multi-Agent System for Automated Code Documentation Generation
by: Yang, Dayu, et al.
Published: (2025)

AgentArmor: Enforcing Program Analysis on Agent Runtime Trace to Defend Against Prompt Injection
by: Wang, Peiran, et al.
Published: (2025)

The Impact of Fine-tuning Large Language Models on Automated Program Repair
by: Macháček, Roman, et al.
Published: (2025)

The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models
by: Ruiz, Fernando Vallecillos, et al.
Published: (2025)

Do LLMs Consider Security? An Empirical Study on Responses to Programming Questions
by: Sajadi, Amirali, et al.
Published: (2025)

Experiential Co-Learning of Software-Developing Agents
by: Qian, Chen, et al.
Published: (2023)

From I/O to Code with Discovery Agent
by: Dong, Yihong, et al.
Published: (2026)

Is Programming by Example solved by LLMs?
by: Li, Wen-Ding, et al.
Published: (2024)

A Survey on Code Generation with LLM-based Agents
by: Dong, Yihong, et al.
Published: (2025)

On Problems of Implicit Context Compression for Software Engineering Agents
by: Gelvan, Kirill, et al.
Published: (2026)

Agentless: Demystifying LLM-based Software Engineering Agents
by: Xia, Chunqiu Steven, et al.
Published: (2024)

AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
by: Press, Ori, et al.
Published: (2025)

Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces
by: Hort, Max, et al.
Published: (2025)

LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
by: Zheng, Zihan, et al.
Published: (2025)

GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
by: Shetty, Manish, et al.
Published: (2025)

Maestro: Joint Graph & Config Optimization for Reliable AI Agents
by: Wang, Wenxiao, et al.
Published: (2025)

Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
by: Zhang, Kexun, et al.
Published: (2024)

SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning
by: Chi, Yizhou, et al.
Published: (2024)

Toward Training Superintelligent Software Agents through Self-Play SWE-RL
by: Wei, Yuxiang, et al.
Published: (2025)

Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
by: Xia, Chunqiu Steven, et al.
Published: (2025)

Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems
by: Liu, Jiacheng, et al.
Published: (2026)

ReGAL: Refactoring Programs to Discover Generalizable Abstractions
by: Stengel-Eskin, Elias, et al.
Published: (2024)

CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation
by: Wang, Xinchen, et al.
Published: (2025)

Solution-oriented Agent-based Models Generation with Verifier-assisted Iterative In-context Learning
by: Niu, Tong, et al.
Published: (2024)

AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
by: Trivedi, Harsh, et al.
Published: (2024)

SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)

EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
by: Wei, Anjiang, et al.
Published: (2025)

Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search
by: Lu, Jialin, et al.
Published: (2026)

Functional Programming Paradigm of Python for Scientific Computation Pipeline Integration
by: Zhang, Chen, et al.
Published: (2024)

Operationalizing AI: Empirical Evidence on MLOps Practices, User Satisfaction, and Organizational Context
by: Pasch, Stefan
Published: (2025)

Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
by: Ishibashi, Yoichi, et al.
Published: (2024)

Natural Language Outlines for Code: Literate Programming in the LLM Era
by: Shi, Kensen, et al.
Published: (2024)

A Multi-Agent Framework for Stateful Inference-Time Search
by: Lalan, Arshika, et al.
Published: (2025)

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
by: Hubinger, Evan, et al.
Published: (2024)