Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kotar, Klemen, Tuckute, Greta
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2504.21047
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913812507000832
author	Kotar, Klemen Tuckute, Greta
author_facet	Kotar, Klemen Tuckute, Greta
contents	Biological neural networks are shaped both by evolution across generations and by individual learning within an organism's lifetime, whereas standard artificial neural networks undergo a single, large training procedure without inherited constraints. In this preliminary work, we propose a framework that incorporates this crucial generational dimension - an "outer loop" of evolution that shapes the "inner loop" of learning - so that artificial networks better mirror the effects of evolution and individual learning in biological organisms. Focusing on language, we train a model that inherits a "model connectome" from the outer evolution loop before exposing it to a developmental-scale corpus of 100M tokens. Compared with two closely matched control models, we show that the connectome model performs better or on par on natural language processing tasks as well as alignment to human behavior and brain data. These findings suggest that a model connectome serves as an efficient prior for learning in low-data regimes - narrowing the gap between single-generation artificial models and biologically evolved neural networks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2504_21047
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Model Connectomes: A Generational Approach to Data-Efficient Language Models Kotar, Klemen Tuckute, Greta Machine Learning Artificial Intelligence Biological neural networks are shaped both by evolution across generations and by individual learning within an organism's lifetime, whereas standard artificial neural networks undergo a single, large training procedure without inherited constraints. In this preliminary work, we propose a framework that incorporates this crucial generational dimension - an "outer loop" of evolution that shapes the "inner loop" of learning - so that artificial networks better mirror the effects of evolution and individual learning in biological organisms. Focusing on language, we train a model that inherits a "model connectome" from the outer evolution loop before exposing it to a developmental-scale corpus of 100M tokens. Compared with two closely matched control models, we show that the connectome model performs better or on par on natural language processing tasks as well as alignment to human behavior and brain data. These findings suggest that a model connectome serves as an efficient prior for learning in low-data regimes - narrowing the gap between single-generation artificial models and biologically evolved neural networks.
title	Model Connectomes: A Generational Approach to Data-Efficient Language Models
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2504.21047

Similar Items