Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Mali, Ankur, Salvatori, Tommaso, Ororbia, Alexander
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Neural and Evolutionary Computing Optimization and Control
Online Access:	https://arxiv.org/abs/2410.04708
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916425239625728
author	Mali, Ankur Salvatori, Tommaso Ororbia, Alexander
author_facet	Mali, Ankur Salvatori, Tommaso Ororbia, Alexander
contents	Energy-based learning algorithms, such as predictive coding (PC), have garnered significant attention in the machine learning community due to their theoretical properties, such as local operations and biologically plausible mechanisms for error correction. In this work, we rigorously analyze the stability, robustness, and convergence of PC through the lens of dynamical systems theory. We show that, first, PC is Lyapunov stable under mild assumptions on its loss and residual energy functions, which implies intrinsic robustness to small random perturbations due to its well-defined energy-minimizing dynamics. Second, we formally establish that the PC updates approximate quasi-Newton methods by incorporating higher-order curvature information, which makes them more stable and able to converge with fewer iterations compared to models trained via backpropagation (BP). Furthermore, using this dynamical framework, we provide new theoretical bounds on the similarity between PC and other algorithms, i.e., BP and target propagation (TP), by precisely characterizing the role of higher-order derivatives. These bounds, derived through detailed analysis of the Hessian structures, show that PC is significantly closer to quasi-Newton updates than TP, providing a deeper understanding of the stability and efficiency of PC compared to conventional learning methods.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_04708
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Tight Stability, Convergence, and Robustness Bounds for Predictive Coding Networks Mali, Ankur Salvatori, Tommaso Ororbia, Alexander Machine Learning Artificial Intelligence Neural and Evolutionary Computing Optimization and Control Energy-based learning algorithms, such as predictive coding (PC), have garnered significant attention in the machine learning community due to their theoretical properties, such as local operations and biologically plausible mechanisms for error correction. In this work, we rigorously analyze the stability, robustness, and convergence of PC through the lens of dynamical systems theory. We show that, first, PC is Lyapunov stable under mild assumptions on its loss and residual energy functions, which implies intrinsic robustness to small random perturbations due to its well-defined energy-minimizing dynamics. Second, we formally establish that the PC updates approximate quasi-Newton methods by incorporating higher-order curvature information, which makes them more stable and able to converge with fewer iterations compared to models trained via backpropagation (BP). Furthermore, using this dynamical framework, we provide new theoretical bounds on the similarity between PC and other algorithms, i.e., BP and target propagation (TP), by precisely characterizing the role of higher-order derivatives. These bounds, derived through detailed analysis of the Hessian structures, show that PC is significantly closer to quasi-Newton updates than TP, providing a deeper understanding of the stability and efficiency of PC compared to conventional learning methods.
title	Tight Stability, Convergence, and Robustness Bounds for Predictive Coding Networks
topic	Machine Learning Artificial Intelligence Neural and Evolutionary Computing Optimization and Control
url	https://arxiv.org/abs/2410.04708

Similar Items