Saved in:
Bibliographic Details
Main Author: Tamim, Abdulrahman
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.04376
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909825874526208
author Tamim, Abdulrahman
author_facet Tamim, Abdulrahman
contents Neural network training is typically viewed as gradient descent on a loss surface. We propose a fundamentally different perspective: learning is a structure-preserving transformation (a functor L) between the space of network parameters (Param) and the space of learned representations (Rep). This categorical framework reveals that different training runs producing similar test performance often belong to the same homotopy class (continuous deformation family) of optimization paths. We show experimentally that networks converging via homotopic trajectories generalize within 0.5% accuracy of each other, while non-homotopic paths differ by over 3%. The theory provides practical tools: persistent homology identifies stable minima predictive of generalization (R^2 = 0.82 correlation), pullback constructions formalize transfer learning, and 2-categorical structures explain when different optimization algorithms yield functionally equivalent models. These categorical invariants offer both theoretical insight into why deep learning works and concrete algorithmic principles for training more robust networks.
format Preprint
id arxiv_https___arxiv_org_abs_2510_04376
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Categorical Invariants of Learning Dynamics
Tamim, Abdulrahman
Machine Learning
68T07, 18B99, 55N35
I.2.6; F.4.1; G.2.2
Neural network training is typically viewed as gradient descent on a loss surface. We propose a fundamentally different perspective: learning is a structure-preserving transformation (a functor L) between the space of network parameters (Param) and the space of learned representations (Rep). This categorical framework reveals that different training runs producing similar test performance often belong to the same homotopy class (continuous deformation family) of optimization paths. We show experimentally that networks converging via homotopic trajectories generalize within 0.5% accuracy of each other, while non-homotopic paths differ by over 3%. The theory provides practical tools: persistent homology identifies stable minima predictive of generalization (R^2 = 0.82 correlation), pullback constructions formalize transfer learning, and 2-categorical structures explain when different optimization algorithms yield functionally equivalent models. These categorical invariants offer both theoretical insight into why deep learning works and concrete algorithmic principles for training more robust networks.
title Categorical Invariants of Learning Dynamics
topic Machine Learning
68T07, 18B99, 55N35
I.2.6; F.4.1; G.2.2
url https://arxiv.org/abs/2510.04376