Saved in:
| Main Author: | Szeider, Stefan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.07468 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MCP-Solver: Integrating Language Models with Constraint Programming Systems
by: Szeider, Stefan
Published: (2024)
by: Szeider, Stefan
Published: (2024)
Generating Streamlining Constraints with Large Language Models
by: Voboril, Florentina, et al.
Published: (2024)
by: Voboril, Florentina, et al.
Published: (2024)
Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs
by: Dai, Hankun, et al.
Published: (2025)
by: Dai, Hankun, et al.
Published: (2025)
AFlow: Automating Agentic Workflow Generation
by: Zhang, Jiayi, et al.
Published: (2024)
by: Zhang, Jiayi, et al.
Published: (2024)
Scaling Test-Time Compute for Agentic Coding
by: Kim, Joongwon, et al.
Published: (2026)
by: Kim, Joongwon, et al.
Published: (2026)
Confucius Code Agent: Scalable Agent Scaffolding for Real-World Codebases
by: Wong, Sherman, et al.
Published: (2025)
by: Wong, Sherman, et al.
Published: (2025)
DocAgent: A Multi-Agent System for Automated Code Documentation Generation
by: Yang, Dayu, et al.
Published: (2025)
by: Yang, Dayu, et al.
Published: (2025)
AgentArmor: Enforcing Program Analysis on Agent Runtime Trace to Defend Against Prompt Injection
by: Wang, Peiran, et al.
Published: (2025)
by: Wang, Peiran, et al.
Published: (2025)
The Impact of Fine-tuning Large Language Models on Automated Program Repair
by: Macháček, Roman, et al.
Published: (2025)
by: Macháček, Roman, et al.
Published: (2025)
The Art of Repair: Optimizing Iterative Program Repair with Instruction-Tuned Models
by: Ruiz, Fernando Vallecillos, et al.
Published: (2025)
by: Ruiz, Fernando Vallecillos, et al.
Published: (2025)
Do LLMs Consider Security? An Empirical Study on Responses to Programming Questions
by: Sajadi, Amirali, et al.
Published: (2025)
by: Sajadi, Amirali, et al.
Published: (2025)
Experiential Co-Learning of Software-Developing Agents
by: Qian, Chen, et al.
Published: (2023)
by: Qian, Chen, et al.
Published: (2023)
From I/O to Code with Discovery Agent
by: Dong, Yihong, et al.
Published: (2026)
by: Dong, Yihong, et al.
Published: (2026)
Is Programming by Example solved by LLMs?
by: Li, Wen-Ding, et al.
Published: (2024)
by: Li, Wen-Ding, et al.
Published: (2024)
A Survey on Code Generation with LLM-based Agents
by: Dong, Yihong, et al.
Published: (2025)
by: Dong, Yihong, et al.
Published: (2025)
On Problems of Implicit Context Compression for Software Engineering Agents
by: Gelvan, Kirill, et al.
Published: (2026)
by: Gelvan, Kirill, et al.
Published: (2026)
Agentless: Demystifying LLM-based Software Engineering Agents
by: Xia, Chunqiu Steven, et al.
Published: (2024)
by: Xia, Chunqiu Steven, et al.
Published: (2024)
AlgoTune: Can Language Models Speed Up General-Purpose Numerical Programs?
by: Press, Ori, et al.
Published: (2025)
by: Press, Ori, et al.
Published: (2025)
Codehacks: A Dataset of Adversarial Tests for Competitive Programming Problems Obtained from Codeforces
by: Hort, Max, et al.
Published: (2025)
by: Hort, Max, et al.
Published: (2025)
LiveCodeBench Pro: How Do Olympiad Medalists Judge LLMs in Competitive Programming?
by: Zheng, Zihan, et al.
Published: (2025)
by: Zheng, Zihan, et al.
Published: (2025)
GSO: Challenging Software Optimization Tasks for Evaluating SWE-Agents
by: Shetty, Manish, et al.
Published: (2025)
by: Shetty, Manish, et al.
Published: (2025)
Maestro: Joint Graph & Config Optimization for Reliable AI Agents
by: Wang, Wenxiao, et al.
Published: (2025)
by: Wang, Wenxiao, et al.
Published: (2025)
Diversity Empowers Intelligence: Integrating Expertise of Software Engineering Agents
by: Zhang, Kexun, et al.
Published: (2024)
by: Zhang, Kexun, et al.
Published: (2024)
SELA: Tree-Search Enhanced LLM Agents for Automated Machine Learning
by: Chi, Yizhou, et al.
Published: (2024)
by: Chi, Yizhou, et al.
Published: (2024)
Toward Training Superintelligent Software Agents through Self-Play SWE-RL
by: Wei, Yuxiang, et al.
Published: (2025)
by: Wei, Yuxiang, et al.
Published: (2025)
Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?
by: Xia, Chunqiu Steven, et al.
Published: (2025)
by: Xia, Chunqiu Steven, et al.
Published: (2025)
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems
by: Liu, Jiacheng, et al.
Published: (2026)
by: Liu, Jiacheng, et al.
Published: (2026)
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation
by: Wang, Xinchen, et al.
Published: (2025)
by: Wang, Xinchen, et al.
Published: (2025)
Solution-oriented Agent-based Models Generation with Verifier-assisted Iterative In-context Learning
by: Niu, Tong, et al.
Published: (2024)
by: Niu, Tong, et al.
Published: (2024)
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents
by: Trivedi, Harsh, et al.
Published: (2024)
by: Trivedi, Harsh, et al.
Published: (2024)
SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
EquiBench: Benchmarking Large Language Models' Reasoning about Program Semantics via Equivalence Checking
by: Wei, Anjiang, et al.
Published: (2025)
by: Wei, Anjiang, et al.
Published: (2025)
Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search
by: Lu, Jialin, et al.
Published: (2026)
by: Lu, Jialin, et al.
Published: (2026)
Functional Programming Paradigm of Python for Scientific Computation Pipeline Integration
by: Zhang, Chen, et al.
Published: (2024)
by: Zhang, Chen, et al.
Published: (2024)
Operationalizing AI: Empirical Evidence on MLOps Practices, User Satisfaction, and Organizational Context
by: Pasch, Stefan
Published: (2025)
by: Pasch, Stefan
Published: (2025)
Self-Organized Agents: A LLM Multi-Agent Framework toward Ultra Large-Scale Code Generation and Optimization
by: Ishibashi, Yoichi, et al.
Published: (2024)
by: Ishibashi, Yoichi, et al.
Published: (2024)
Natural Language Outlines for Code: Literate Programming in the LLM Era
by: Shi, Kensen, et al.
Published: (2024)
by: Shi, Kensen, et al.
Published: (2024)
A Multi-Agent Framework for Stateful Inference-Time Search
by: Lalan, Arshika, et al.
Published: (2025)
by: Lalan, Arshika, et al.
Published: (2025)
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training
by: Hubinger, Evan, et al.
Published: (2024)
by: Hubinger, Evan, et al.
Published: (2024)
Similar Items
-
MCP-Solver: Integrating Language Models with Constraint Programming Systems
by: Szeider, Stefan
Published: (2024) -
Generating Streamlining Constraints with Large Language Models
by: Voboril, Florentina, et al.
Published: (2024) -
Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs
by: Dai, Hankun, et al.
Published: (2025) -
AFlow: Automating Agentic Workflow Generation
by: Zhang, Jiayi, et al.
Published: (2024) -
Scaling Test-Time Compute for Agentic Coding
by: Kim, Joongwon, et al.
Published: (2026)