Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhang, Yunxiang, Zhou, Kang, Xu, Zhichao, Ramnath, Kiran, Zhou, Yun, Woo, Sangmin, Ding, Haibo, Cheong, Lin Lee
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2601.17596
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866908786163187712
author Zhang, Yunxiang
Zhou, Kang
Xu, Zhichao
Ramnath, Kiran
Zhou, Yun
Woo, Sangmin
Ding, Haibo
Cheong, Lin Lee
author_facet Zhang, Yunxiang
Zhou, Kang
Xu, Zhichao
Ramnath, Kiran
Zhou, Yun
Woo, Sangmin
Ding, Haibo
Cheong, Lin Lee
contents Existing machine learning engineering (MLE) agents struggle to iteratively optimize their implemented algorithms for effectiveness. To address this, we introduce MLE-Ideator, a dual-agent framework that separates ideation from implementation. In our system, an implementation agent can request strategic help from a dedicated Ideator. We show this approach is effective in two ways. First, in a training-free setup, our framework significantly outperforms implementation-only agent baselines on MLE-Bench. Second, we demonstrate that the Ideator can be trained with reinforcement learning (RL) to generate more effective ideas. With only 1K training samples from 10 MLE tasks, our RL-trained Qwen3-8B Ideator achieves an 11.5% relative improvement compared to its untrained counterpart and surpasses Claude Sonnet 3.5. These results highlights a promising path toward training strategic AI systems for scientific discovery.
format Preprint
id arxiv_https___arxiv_org_abs_2601_17596
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Learning to Ideate for Machine Learning Engineering Agents
Zhang, Yunxiang
Zhou, Kang
Xu, Zhichao
Ramnath, Kiran
Zhou, Yun
Woo, Sangmin
Ding, Haibo
Cheong, Lin Lee
Computation and Language
Existing machine learning engineering (MLE) agents struggle to iteratively optimize their implemented algorithms for effectiveness. To address this, we introduce MLE-Ideator, a dual-agent framework that separates ideation from implementation. In our system, an implementation agent can request strategic help from a dedicated Ideator. We show this approach is effective in two ways. First, in a training-free setup, our framework significantly outperforms implementation-only agent baselines on MLE-Bench. Second, we demonstrate that the Ideator can be trained with reinforcement learning (RL) to generate more effective ideas. With only 1K training samples from 10 MLE tasks, our RL-trained Qwen3-8B Ideator achieves an 11.5% relative improvement compared to its untrained counterpart and surpasses Claude Sonnet 3.5. These results highlights a promising path toward training strategic AI systems for scientific discovery.
title Learning to Ideate for Machine Learning Engineering Agents
topic Computation and Language
url https://arxiv.org/abs/2601.17596