Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Tamim, Abdulrahman
Format:	Preprint
Published:	2025
Subjects:	Machine Learning 68T07, 18B99, 55N35 I.2.6; F.4.1; G.2.2
Online Access:	https://arxiv.org/abs/2510.04376
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909825874526208
author	Tamim, Abdulrahman
author_facet	Tamim, Abdulrahman
contents	Neural network training is typically viewed as gradient descent on a loss surface. We propose a fundamentally different perspective: learning is a structure-preserving transformation (a functor L) between the space of network parameters (Param) and the space of learned representations (Rep). This categorical framework reveals that different training runs producing similar test performance often belong to the same homotopy class (continuous deformation family) of optimization paths. We show experimentally that networks converging via homotopic trajectories generalize within 0.5% accuracy of each other, while non-homotopic paths differ by over 3%. The theory provides practical tools: persistent homology identifies stable minima predictive of generalization (R^2 = 0.82 correlation), pullback constructions formalize transfer learning, and 2-categorical structures explain when different optimization algorithms yield functionally equivalent models. These categorical invariants offer both theoretical insight into why deep learning works and concrete algorithmic principles for training more robust networks.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_04376
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Categorical Invariants of Learning Dynamics Tamim, Abdulrahman Machine Learning 68T07, 18B99, 55N35 I.2.6; F.4.1; G.2.2 Neural network training is typically viewed as gradient descent on a loss surface. We propose a fundamentally different perspective: learning is a structure-preserving transformation (a functor L) between the space of network parameters (Param) and the space of learned representations (Rep). This categorical framework reveals that different training runs producing similar test performance often belong to the same homotopy class (continuous deformation family) of optimization paths. We show experimentally that networks converging via homotopic trajectories generalize within 0.5% accuracy of each other, while non-homotopic paths differ by over 3%. The theory provides practical tools: persistent homology identifies stable minima predictive of generalization (R^2 = 0.82 correlation), pullback constructions formalize transfer learning, and 2-categorical structures explain when different optimization algorithms yield functionally equivalent models. These categorical invariants offer both theoretical insight into why deep learning works and concrete algorithmic principles for training more robust networks.
title	Categorical Invariants of Learning Dynamics
topic	Machine Learning 68T07, 18B99, 55N35 I.2.6; F.4.1; G.2.2
url	https://arxiv.org/abs/2510.04376

Similar Items