Saved in:
Bibliographic Details
Main Authors: Pasteris, Stephen, Hicks, Chris, Mavroudis, Vasilios
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.15883
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Running backpropagation end to end on large neural networks is fraught with difficulties like vanishing gradients and degradation. In this paper we present an alternative architecture composed of many small neural networks that interact with one another. Instead of propagating gradients back through the architecture we propagate vector-valued messages computed via forward passes, which are then used to update the parameters. Currently the performance is conjectured as we are yet to implement the architecture. However, we do back it up with some theory. A previous version of this paper was entitled "Fusion encoder networks" and detailed a slightly different architecture.