Saved in:
Bibliographic Details
Main Authors: Wenzel, Tizian, Santin, Gabriele, Haasdonk, Bernard
Format: Preprint
Published: 2021
Subjects:
Online Access:https://arxiv.org/abs/2105.07228
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914043171700736
author Wenzel, Tizian
Santin, Gabriele
Haasdonk, Bernard
author_facet Wenzel, Tizian
Santin, Gabriele
Haasdonk, Bernard
contents In this paper, we leverage a recent deep kernel representer theorem to connect kernel based learning and (deep) neural networks in order to understand their interplay. In particular, we show that the use of special types of kernels yields models reminiscent of neural networks that are founded in the same theoretical framework of classical kernel methods, while benefiting from the computational advantages of deep neural networks. Especially the introduced Structured Deep Kernel Networks (SDKNs) can be viewed as neural networks (NNs) with optimizable activation functions obeying a representer theorem. This link allows us to analyze also NNs within the framework of kernel networks. We prove analytic properties of the SDKNs which show their universal approximation properties in three different asymptotic regimes of unbounded number of centers, width and depth. Especially in the case of unbounded depth, more accurate constructions can be achieved using fewer layers compared to corresponding constructions for ReLU neural networks. This is made possible by leveraging properties of kernel approximation.
format Preprint
id arxiv_https___arxiv_org_abs_2105_07228
institution arXiv
publishDate 2021
record_format arxiv
spellingShingle Analysis of Structured Deep Kernel Networks
Wenzel, Tizian
Santin, Gabriele
Haasdonk, Bernard
Machine Learning
In this paper, we leverage a recent deep kernel representer theorem to connect kernel based learning and (deep) neural networks in order to understand their interplay. In particular, we show that the use of special types of kernels yields models reminiscent of neural networks that are founded in the same theoretical framework of classical kernel methods, while benefiting from the computational advantages of deep neural networks. Especially the introduced Structured Deep Kernel Networks (SDKNs) can be viewed as neural networks (NNs) with optimizable activation functions obeying a representer theorem. This link allows us to analyze also NNs within the framework of kernel networks. We prove analytic properties of the SDKNs which show their universal approximation properties in three different asymptotic regimes of unbounded number of centers, width and depth. Especially in the case of unbounded depth, more accurate constructions can be achieved using fewer layers compared to corresponding constructions for ReLU neural networks. This is made possible by leveraging properties of kernel approximation.
title Analysis of Structured Deep Kernel Networks
topic Machine Learning
url https://arxiv.org/abs/2105.07228