MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Sule, Shashank, Spencer, Richard G., Czaja, Wojciech
Natura:	Preprint
Pubblicazione:	2023
Soggetti:	Machine Learning Numerical Analysis Signal Processing
Accesso online:	https://arxiv.org/abs/2301.07820
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866916377899565056
author	Sule, Shashank Spencer, Richard G. Czaja, Wojciech
author_facet	Sule, Shashank Spencer, Richard G. Czaja, Wojciech
contents	We characterize the exact solutions to neural network descrambling--a mathematical model for explaining the fully connected layers of trained neural networks (NNs). By reformulating the problem to the minimization of the Brockett function arising in graph matching and complexity theory we show that the principal components of the hidden layer preactivations can be characterized as the optimal explainers or descramblers for the layer weights, leading to descrambled weight matrices. We show that in typical deep learning contexts these descramblers take diverse and interesting forms including (1) matching largest principal components with the lowest frequency modes of the Fourier basis for isotropic hidden data, (2) discovering the semantic development in two-layer linear NNs for signal recovery problems, and (3) explaining CNNs by optimally permuting the neurons. Our numerical experiments indicate that the eigendecompositions of the hidden layer data--now understood as the descramblers--can also reveal the layer's underlying transformation. These results illustrate that the SVD is more directly related to the explainability of NNs than previously thought and offers a promising avenue for discovering interpretable motifs for the hidden action of NNs, especially in contexts of operator learning or physics-informed NNs, where the input/output data has limited human readability.
format	Preprint
id	arxiv_https___arxiv_org_abs_2301_07820
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	On the limits of neural network explainability via descrambling Sule, Shashank Spencer, Richard G. Czaja, Wojciech Machine Learning Numerical Analysis Signal Processing We characterize the exact solutions to neural network descrambling--a mathematical model for explaining the fully connected layers of trained neural networks (NNs). By reformulating the problem to the minimization of the Brockett function arising in graph matching and complexity theory we show that the principal components of the hidden layer preactivations can be characterized as the optimal explainers or descramblers for the layer weights, leading to descrambled weight matrices. We show that in typical deep learning contexts these descramblers take diverse and interesting forms including (1) matching largest principal components with the lowest frequency modes of the Fourier basis for isotropic hidden data, (2) discovering the semantic development in two-layer linear NNs for signal recovery problems, and (3) explaining CNNs by optimally permuting the neurons. Our numerical experiments indicate that the eigendecompositions of the hidden layer data--now understood as the descramblers--can also reveal the layer's underlying transformation. These results illustrate that the SVD is more directly related to the explainability of NNs than previously thought and offers a promising avenue for discovering interpretable motifs for the hidden action of NNs, especially in contexts of operator learning or physics-informed NNs, where the input/output data has limited human readability.
title	On the limits of neural network explainability via descrambling
topic	Machine Learning Numerical Analysis Signal Processing
url	https://arxiv.org/abs/2301.07820

Documenti analoghi