Saved in:
Bibliographic Details
Main Authors: Mali, Ankur, Salvatori, Tommaso, Ororbia, Alexander
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.04708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916425239625728
author Mali, Ankur
Salvatori, Tommaso
Ororbia, Alexander
author_facet Mali, Ankur
Salvatori, Tommaso
Ororbia, Alexander
contents Energy-based learning algorithms, such as predictive coding (PC), have garnered significant attention in the machine learning community due to their theoretical properties, such as local operations and biologically plausible mechanisms for error correction. In this work, we rigorously analyze the stability, robustness, and convergence of PC through the lens of dynamical systems theory. We show that, first, PC is Lyapunov stable under mild assumptions on its loss and residual energy functions, which implies intrinsic robustness to small random perturbations due to its well-defined energy-minimizing dynamics. Second, we formally establish that the PC updates approximate quasi-Newton methods by incorporating higher-order curvature information, which makes them more stable and able to converge with fewer iterations compared to models trained via backpropagation (BP). Furthermore, using this dynamical framework, we provide new theoretical bounds on the similarity between PC and other algorithms, i.e., BP and target propagation (TP), by precisely characterizing the role of higher-order derivatives. These bounds, derived through detailed analysis of the Hessian structures, show that PC is significantly closer to quasi-Newton updates than TP, providing a deeper understanding of the stability and efficiency of PC compared to conventional learning methods.
format Preprint
id arxiv_https___arxiv_org_abs_2410_04708
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Tight Stability, Convergence, and Robustness Bounds for Predictive Coding Networks
Mali, Ankur
Salvatori, Tommaso
Ororbia, Alexander
Machine Learning
Artificial Intelligence
Neural and Evolutionary Computing
Optimization and Control
Energy-based learning algorithms, such as predictive coding (PC), have garnered significant attention in the machine learning community due to their theoretical properties, such as local operations and biologically plausible mechanisms for error correction. In this work, we rigorously analyze the stability, robustness, and convergence of PC through the lens of dynamical systems theory. We show that, first, PC is Lyapunov stable under mild assumptions on its loss and residual energy functions, which implies intrinsic robustness to small random perturbations due to its well-defined energy-minimizing dynamics. Second, we formally establish that the PC updates approximate quasi-Newton methods by incorporating higher-order curvature information, which makes them more stable and able to converge with fewer iterations compared to models trained via backpropagation (BP). Furthermore, using this dynamical framework, we provide new theoretical bounds on the similarity between PC and other algorithms, i.e., BP and target propagation (TP), by precisely characterizing the role of higher-order derivatives. These bounds, derived through detailed analysis of the Hessian structures, show that PC is significantly closer to quasi-Newton updates than TP, providing a deeper understanding of the stability and efficiency of PC compared to conventional learning methods.
title Tight Stability, Convergence, and Robustness Bounds for Predictive Coding Networks
topic Machine Learning
Artificial Intelligence
Neural and Evolutionary Computing
Optimization and Control
url https://arxiv.org/abs/2410.04708