Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chen, Zhiheng, Fang, Guanhua, Yu, Wen
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2406.00630
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911898936541184
author	Chen, Zhiheng Fang, Guanhua Yu, Wen
author_facet	Chen, Zhiheng Fang, Guanhua Yu, Wen
contents	Temporal point process (TPP) is an important tool for modeling and predicting irregularly timed events across various domains. Recently, the recurrent neural network (RNN)-based TPPs have shown practical advantages over traditional parametric TPP models. However, in the current literature, it remains nascent in understanding neural TPPs from theoretical viewpoints. In this paper, we establish the excess risk bounds of RNN-TPPs under many well-known TPP settings. We especially show that an RNN-TPP with no more than four layers can achieve vanishing generalization errors. Our technical contributions include the characterization of the complexity of the multi-layer RNN class, the construction of $\tanh$ neural networks for approximating dynamic event intensity functions, and the truncation technique for alleviating the issue of unbounded event sequences. Our results bridge the gap between TPP's application and neural network theory.
format	Preprint
id	arxiv_https___arxiv_org_abs_2406_00630
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes Chen, Zhiheng Fang, Guanhua Yu, Wen Machine Learning Temporal point process (TPP) is an important tool for modeling and predicting irregularly timed events across various domains. Recently, the recurrent neural network (RNN)-based TPPs have shown practical advantages over traditional parametric TPP models. However, in the current literature, it remains nascent in understanding neural TPPs from theoretical viewpoints. In this paper, we establish the excess risk bounds of RNN-TPPs under many well-known TPP settings. We especially show that an RNN-TPP with no more than four layers can achieve vanishing generalization errors. Our technical contributions include the characterization of the complexity of the multi-layer RNN class, the construction of $\tanh$ neural networks for approximating dynamic event intensity functions, and the truncation technique for alleviating the issue of unbounded event sequences. Our results bridge the gap between TPP's application and neural network theory.
title	On Non-asymptotic Theory of Recurrent Neural Networks in Temporal Point Processes
topic	Machine Learning
url	https://arxiv.org/abs/2406.00630

Similar Items