Saved in:
Bibliographic Details
Main Authors: Kniazev, Evgenii, Kravchenko, Arseny, Rekun, Igor, Broadhead, James, Shamgunov, Nikita, Sah, Pranav, Nichite, Pratik, Yamshchikov, Ivan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.03310
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We present app.build (https://github.com/neondatabase/appdotbuild-agent), an open-source framework that improves LLM-based application generation through systematic validation and structured environments. Our approach combines multi-layered validation pipelines, stack-specific orchestration, and model-agnostic architecture, implemented across three reference stacks. Through evaluation on 30 generation tasks, we demonstrate that comprehensive validation achieves 73.3% viability rate with 30% reaching perfect quality scores, while open-weights models achieve 80.8% of closed-model performance when provided structured environments. The open-source framework has been adopted by the community, with over 3,000 applications generated to date. This work demonstrates that scaling reliable AI agents requires scaling environments, not just models -- providing empirical insights and complete reference implementations for production-oriented agent systems.