Saved in:
Bibliographic Details
Main Authors: Zou, Will Y., Zhang, Jennifer Y.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.03221
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909675668111360
author Zou, Will Y.
Zhang, Jennifer Y.
author_facet Zou, Will Y.
Zhang, Jennifer Y.
contents To be effective, efficient, and diverse, deep learning models need to dynamically choose its architecture based on signals from a population of neurons. We hypothesize dynamic routing models can be improved with neural inhibition in those neural populations. This means signals commonly shared among the various modes of data statistics can be inhibited so that the routing model can choose a specialized expert path for each data sample. Only through inhibition is the routing mechanism able to effectively select neural pathways. We believe this is an under-studied and under-verified implementation methodology for Mixture-of-Experts, dynamic routing, and transformer language models. We provide experimental evidence that the neural inhibition algorithm significantly boosts the performance of general tasks and motivates more effort to be invested in this research direction.
format Preprint
id arxiv_https___arxiv_org_abs_2507_03221
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Neural Inhibition Improves Dynamic Routing and Mixture of Experts
Zou, Will Y.
Zhang, Jennifer Y.
Machine Learning
Artificial Intelligence
To be effective, efficient, and diverse, deep learning models need to dynamically choose its architecture based on signals from a population of neurons. We hypothesize dynamic routing models can be improved with neural inhibition in those neural populations. This means signals commonly shared among the various modes of data statistics can be inhibited so that the routing model can choose a specialized expert path for each data sample. Only through inhibition is the routing mechanism able to effectively select neural pathways. We believe this is an under-studied and under-verified implementation methodology for Mixture-of-Experts, dynamic routing, and transformer language models. We provide experimental evidence that the neural inhibition algorithm significantly boosts the performance of general tasks and motivates more effort to be invested in this research direction.
title Neural Inhibition Improves Dynamic Routing and Mixture of Experts
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2507.03221