Saved in:
Bibliographic Details
Main Authors: Eringis, Deividas, Leth, John, Tan, Zheng-Hua, Wisniewski, Rafal, Petreczky, Mihaly
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2312.09793
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917636690935808
author Eringis, Deividas
Leth, John
Tan, Zheng-Hua
Wisniewski, Rafal
Petreczky, Mihaly
author_facet Eringis, Deividas
Leth, John
Tan, Zheng-Hua
Wisniewski, Rafal
Petreczky, Mihaly
contents In this paper, we derive a PAC-Bayes bound on the generalisation gap, in a supervised time-series setting for a special class of discrete-time non-linear dynamical systems. This class includes stable recurrent neural networks (RNN), and the motivation for this work was its application to RNNs. In order to achieve the results, we impose some stability constraints, on the allowed models. Here, stability is understood in the sense of dynamical systems. For RNNs, these stability conditions can be expressed in terms of conditions on the weights. We assume the processes involved are essentially bounded and the loss functions are Lipschitz. The proposed bound on the generalisation gap depends on the mixing coefficient of the data distribution, and the essential supremum of the data. Furthermore, the bound converges to zero as the dataset size increases. In this paper, we 1) formalize the learning problem, 2) derive a PAC-Bayesian error bound for such systems, 3) discuss various consequences of this error bound, and 4) show an illustrative example, with discussions on computing the proposed bound. Unlike other available bounds the derived bound holds for non i.i.d. data (time-series) and it does not grow with the number of steps of the RNN.
format Preprint
id arxiv_https___arxiv_org_abs_2312_09793
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs
Eringis, Deividas
Leth, John
Tan, Zheng-Hua
Wisniewski, Rafal
Petreczky, Mihaly
Machine Learning
In this paper, we derive a PAC-Bayes bound on the generalisation gap, in a supervised time-series setting for a special class of discrete-time non-linear dynamical systems. This class includes stable recurrent neural networks (RNN), and the motivation for this work was its application to RNNs. In order to achieve the results, we impose some stability constraints, on the allowed models. Here, stability is understood in the sense of dynamical systems. For RNNs, these stability conditions can be expressed in terms of conditions on the weights. We assume the processes involved are essentially bounded and the loss functions are Lipschitz. The proposed bound on the generalisation gap depends on the mixing coefficient of the data distribution, and the essential supremum of the data. Furthermore, the bound converges to zero as the dataset size increases. In this paper, we 1) formalize the learning problem, 2) derive a PAC-Bayesian error bound for such systems, 3) discuss various consequences of this error bound, and 4) show an illustrative example, with discussions on computing the proposed bound. Unlike other available bounds the derived bound holds for non i.i.d. data (time-series) and it does not grow with the number of steps of the RNN.
title PAC-Bayes Generalisation Bounds for Dynamical Systems Including Stable RNNs
topic Machine Learning
url https://arxiv.org/abs/2312.09793