MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Kim, Yongwan, Park, Sungchul
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2603.25813
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866912984190681088
author	Kim, Yongwan Park, Sungchul
author_facet	Kim, Yongwan Park, Sungchul
contents	We present MAGNET (Model Autonomously Growing Network), a decentralized system for autonomous generation, training, and serving of domain-expert language models across commodity hardware. MAGNET integrates four components: (1) autoresearch, an autonomous ML research pipeline that automates dataset generation, hyperparameter exploration, evaluation, and error-driven iteration; (2) BitNet b1.58 ternary training, enabling CPU-native inference via bitnet.cpp without GPU hardware; (3) DiLoCo-based distributed merging for communication-efficient aggregation of domain specialists; and (4) on-chain contribution tracking on the HOOTi EVM chain. We validate autoresearch through three case studies: video safety classification (balanced accuracy 0.9287 to 0.9851), cryptocurrency directional prediction (41% to 54.9% hit rate), and BitNet hyperparameter optimization (10-phase sweep, -16.7% validation loss).
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_25813
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training Kim, Yongwan Park, Sungchul Machine Learning Artificial Intelligence We present MAGNET (Model Autonomously Growing Network), a decentralized system for autonomous generation, training, and serving of domain-expert language models across commodity hardware. MAGNET integrates four components: (1) autoresearch, an autonomous ML research pipeline that automates dataset generation, hyperparameter exploration, evaluation, and error-driven iteration; (2) BitNet b1.58 ternary training, enabling CPU-native inference via bitnet.cpp without GPU hardware; (3) DiLoCo-based distributed merging for communication-efficient aggregation of domain specialists; and (4) on-chain contribution tracking on the HOOTi EVM chain. We validate autoresearch through three case studies: video safety classification (balanced accuracy 0.9287 to 0.9851), cryptocurrency directional prediction (41% to 54.9% hit rate), and BitNet hyperparameter optimization (10-phase sweep, -16.7% validation loss).
title	MAGNET: Autonomous Expert Model Generation via Decentralized Autoresearch and BitNet Training
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2603.25813

Documenti analoghi