Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Traylor, Aaron, Merullo, Jack, Frank, Michael J., Pavlick, Ellie
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence I.2.6
Online Access:	https://arxiv.org/abs/2402.08211
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910328816664576
author	Traylor, Aaron Merullo, Jack Frank, Michael J. Pavlick, Ellie
author_facet	Traylor, Aaron Merullo, Jack Frank, Michael J. Pavlick, Ellie
contents	Models based on the Transformer neural network architecture have seen success on a wide variety of tasks that appear to require complex "cognitive branching" -- or the ability to maintain pursuit of one goal while accomplishing others. In cognitive neuroscience, success on such tasks is thought to rely on sophisticated frontostriatal mechanisms for selective \textit{gating}, which enable role-addressable updating -- and later readout -- of information to and from distinct "addresses" of memory, in the form of clusters of neurons. However, Transformer models have no such mechanisms intentionally built-in. It is thus an open question how Transformers solve such tasks, and whether the mechanisms that emerge to help them to do so bear any resemblance to the gating mechanisms in the human brain. In this work, we analyze the mechanisms that emerge within a vanilla attention-only Transformer trained on a simple sequence modeling task inspired by a task explicitly designed to study working memory gating in computational cognitive neuroscience. We find that, as a result of training, the self-attention mechanism within the Transformer specializes in a way that mirrors the input and output gating mechanisms which were explicitly incorporated into earlier, more biologically-inspired architectures. These results suggest opportunities for future research on computational similarities between modern AI architectures and models of the human brain.
format	Preprint
id	arxiv_https___arxiv_org_abs_2402_08211
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks Traylor, Aaron Merullo, Jack Frank, Michael J. Pavlick, Ellie Artificial Intelligence I.2.6 Models based on the Transformer neural network architecture have seen success on a wide variety of tasks that appear to require complex "cognitive branching" -- or the ability to maintain pursuit of one goal while accomplishing others. In cognitive neuroscience, success on such tasks is thought to rely on sophisticated frontostriatal mechanisms for selective \textit{gating}, which enable role-addressable updating -- and later readout -- of information to and from distinct "addresses" of memory, in the form of clusters of neurons. However, Transformer models have no such mechanisms intentionally built-in. It is thus an open question how Transformers solve such tasks, and whether the mechanisms that emerge to help them to do so bear any resemblance to the gating mechanisms in the human brain. In this work, we analyze the mechanisms that emerge within a vanilla attention-only Transformer trained on a simple sequence modeling task inspired by a task explicitly designed to study working memory gating in computational cognitive neuroscience. We find that, as a result of training, the self-attention mechanism within the Transformer specializes in a way that mirrors the input and output gating mechanisms which were explicitly incorporated into earlier, more biologically-inspired architectures. These results suggest opportunities for future research on computational similarities between modern AI architectures and models of the human brain.
title	Transformer Mechanisms Mimic Frontostriatal Gating Operations When Trained on Human Working Memory Tasks
topic	Artificial Intelligence I.2.6
url	https://arxiv.org/abs/2402.08211

Similar Items