Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Zhang, Genghan, Liang, Weixin, Hsu, Olivia, Olukotun, Kunle
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2502.02534
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866916956817326080
author Zhang, Genghan
Liang, Weixin
Hsu, Olivia
Olukotun, Kunle
author_facet Zhang, Genghan
Liang, Weixin
Hsu, Olivia
Olukotun, Kunle
contents ML libraries, often written in architecture-specific programming languages (ASPLs) that target domain-specific architectures, are key to efficient ML systems. However, writing these high-performance ML libraries is challenging because it requires expert knowledge of ML algorithms and the ASPL. Large language models (LLMs), on the other hand, have shown general coding capabilities. However, challenges remain when using LLMs for generating ML libraries using ASPLs because 1) this task is complicated even for experienced human programmers and 2) there are limited code examples because of the esoteric and evolving nature of ASPLs. Therefore, LLMs need complex reasoning with limited data in order to complete this task. To address these challenges, we introduce an adaptive self-improvement agentic system. In order to evaluate the effectiveness of our system, we construct a benchmark of a typical ML library and generate ASPL code with both open and closed-source LLMs on this benchmark. Our results show improvements of up to $3.9\times$ over a baseline single LLM.
format Preprint
id arxiv_https___arxiv_org_abs_2502_02534
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Adaptive Self-improvement LLM Agentic System for ML Library Development
Zhang, Genghan
Liang, Weixin
Hsu, Olivia
Olukotun, Kunle
Computation and Language
ML libraries, often written in architecture-specific programming languages (ASPLs) that target domain-specific architectures, are key to efficient ML systems. However, writing these high-performance ML libraries is challenging because it requires expert knowledge of ML algorithms and the ASPL. Large language models (LLMs), on the other hand, have shown general coding capabilities. However, challenges remain when using LLMs for generating ML libraries using ASPLs because 1) this task is complicated even for experienced human programmers and 2) there are limited code examples because of the esoteric and evolving nature of ASPLs. Therefore, LLMs need complex reasoning with limited data in order to complete this task. To address these challenges, we introduce an adaptive self-improvement agentic system. In order to evaluate the effectiveness of our system, we construct a benchmark of a typical ML library and generate ASPL code with both open and closed-source LLMs on this benchmark. Our results show improvements of up to $3.9\times$ over a baseline single LLM.
title Adaptive Self-improvement LLM Agentic System for ML Library Development
topic Computation and Language
url https://arxiv.org/abs/2502.02534