Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Du, Shangheng, Yan, Xiangchao, Jiang, Dengyang, Yuan, Jiakang, Hu, Yusong, Li, Xin, He, Liang, Zhang, Bo, Bai, Lei
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2510.08511
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915542602874880
author	Du, Shangheng Yan, Xiangchao Jiang, Dengyang Yuan, Jiakang Hu, Yusong Li, Xin He, Liang Zhang, Bo Bai, Lei
author_facet	Du, Shangheng Yan, Xiangchao Jiang, Dengyang Yuan, Jiakang Hu, Yusong Li, Xin He, Liang Zhang, Bo Bai, Lei
contents	Large language models (LLMs) have shown impressive performance in general programming tasks. However, in Machine Learning Engineering (MLE) scenarios such as AutoML and Kaggle competitions, achieving high performance depends heavily on expert intervention and repeated adjustments rather than simply generating correct code. When applied directly to these tasks, LLMs often lack fine-grained domain priors, and existing MLE approaches that use linear or tree-structured searches limit knowledge transfer to adjacent hierarchical links. As a result, they cannot leverage past full trajectories or share information across branches, limiting self-evolving ability and search space diversity. To address these limitations, we introduce AutoMLGen, an LLM-based coding agent that integrates a domain knowledge base for high-quality prior guidance and Monte Carlo Graph Search (MCGS) for efficient exploration. MCGS retains the tree-guided exploration of MCTS while embedding a graph structure into the expansion stage to enable dynamic path reorganization, historical trajectory reuse, and multi-solution fusion to support both self-evolution and collaborative learning. Combined with fine-grained operator sets, this design improves stability and accelerates convergence. Evaluation on the MLE-Bench shows that AutoMLGen achieves state-of-the-art performance in numerous dimensions, such as the average medal rate and the valid submission rate, under a 12-hour budget (half the standard runtime). The code is available at https://github.com/Alpha-Innovator/InternAgent.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_08511
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents Du, Shangheng Yan, Xiangchao Jiang, Dengyang Yuan, Jiakang Hu, Yusong Li, Xin He, Liang Zhang, Bo Bai, Lei Artificial Intelligence Computation and Language Machine Learning Large language models (LLMs) have shown impressive performance in general programming tasks. However, in Machine Learning Engineering (MLE) scenarios such as AutoML and Kaggle competitions, achieving high performance depends heavily on expert intervention and repeated adjustments rather than simply generating correct code. When applied directly to these tasks, LLMs often lack fine-grained domain priors, and existing MLE approaches that use linear or tree-structured searches limit knowledge transfer to adjacent hierarchical links. As a result, they cannot leverage past full trajectories or share information across branches, limiting self-evolving ability and search space diversity. To address these limitations, we introduce AutoMLGen, an LLM-based coding agent that integrates a domain knowledge base for high-quality prior guidance and Monte Carlo Graph Search (MCGS) for efficient exploration. MCGS retains the tree-guided exploration of MCTS while embedding a graph structure into the expansion stage to enable dynamic path reorganization, historical trajectory reuse, and multi-solution fusion to support both self-evolution and collaborative learning. Combined with fine-grained operator sets, this design improves stability and accelerates convergence. Evaluation on the MLE-Bench shows that AutoMLGen achieves state-of-the-art performance in numerous dimensions, such as the average medal rate and the valid submission rate, under a 12-hour budget (half the standard runtime). The code is available at https://github.com/Alpha-Innovator/InternAgent.
title	AutoMLGen: Navigating Fine-Grained Optimization for Coding Agents
topic	Artificial Intelligence Computation and Language Machine Learning
url	https://arxiv.org/abs/2510.08511

Similar Items