Saved in:
Bibliographic Details
Main Authors: Kengne, William, Wade, Modou
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.08321
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913930801053696
author Kengne, William
Wade, Modou
author_facet Kengne, William
Wade, Modou
contents The explicit regularization and optimality of deep neural networks estimators from independent data have made considerable progress recently. The study of such properties on dependent data is still a challenge. In this paper, we carry out deep learning from strongly mixing observations, and deal with the squared and a broad class of loss functions. We consider sparse-penalized regularization for deep neural network predictor. For a general framework that includes, regression estimation, classification, time series prediction,$\cdots$, oracle inequality for the expected excess risk is established and a bound on the class of Hölder smooth functions is provided. For nonparametric regression from strong mixing data and sub-exponentially error, we provide an oracle inequality for the $L_2$ error and investigate an upper bound of this error on a class of Hölder composition functions. For the specific case of nonparametric autoregression with Gaussian and Laplace errors, a lower bound of the $L_2$ error on this Hölder composition class is established. Up to logarithmic factor, this bound matches its upper bound; so, the deep neural network estimator attains the minimax optimal rate.
format Preprint
id arxiv_https___arxiv_org_abs_2406_08321
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Deep learning from strongly mixing observations: Sparse-penalized regularization and minimax optimality
Kengne, William
Wade, Modou
Machine Learning
The explicit regularization and optimality of deep neural networks estimators from independent data have made considerable progress recently. The study of such properties on dependent data is still a challenge. In this paper, we carry out deep learning from strongly mixing observations, and deal with the squared and a broad class of loss functions. We consider sparse-penalized regularization for deep neural network predictor. For a general framework that includes, regression estimation, classification, time series prediction,$\cdots$, oracle inequality for the expected excess risk is established and a bound on the class of Hölder smooth functions is provided. For nonparametric regression from strong mixing data and sub-exponentially error, we provide an oracle inequality for the $L_2$ error and investigate an upper bound of this error on a class of Hölder composition functions. For the specific case of nonparametric autoregression with Gaussian and Laplace errors, a lower bound of the $L_2$ error on this Hölder composition class is established. Up to logarithmic factor, this bound matches its upper bound; so, the deep neural network estimator attains the minimax optimal rate.
title Deep learning from strongly mixing observations: Sparse-penalized regularization and minimax optimality
topic Machine Learning
url https://arxiv.org/abs/2406.08321