Salvato in:
Dettagli Bibliografici
Autori principali: Fokam, Cabrel Teguemne, Jentsch, Carsten, Lang, Michel, Pauly, Markus
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2410.00942
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866912055040147456
author Fokam, Cabrel Teguemne
Jentsch, Carsten
Lang, Michel
Pauly, Markus
author_facet Fokam, Cabrel Teguemne
Jentsch, Carsten
Lang, Michel
Pauly, Markus
contents The Random Forest (RF) algorithm can be applied to a broad spectrum of problems, including time series prediction. However, neither the classical IID (Independent and Identically distributed) bootstrap nor block bootstrapping strategies (as implemented in rangerts) completely account for the nature of the Data Generating Process (DGP) while resampling the observations. We propose the combination of RF with a residual bootstrapping technique where we replace the IID bootstrap with the AR-Sieve Bootstrap (ARSB), which assumes the DGP to be an autoregressive process. To assess the new model's predictive performance, we conduct a simulation study using synthetic data generated from different types of DGPs. It turns out that ARSB provides more variation amongst the trees in the forest. Moreover, RF with ARSB shows greater accuracy compared to RF with other bootstrap strategies. However, these improvements are achieved at some efficiency costs.
format Preprint
id arxiv_https___arxiv_org_abs_2410_00942
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle AR-Sieve Bootstrap for the Random Forest and a simulation-based comparison with rangerts time series prediction
Fokam, Cabrel Teguemne
Jentsch, Carsten
Lang, Michel
Pauly, Markus
Machine Learning
The Random Forest (RF) algorithm can be applied to a broad spectrum of problems, including time series prediction. However, neither the classical IID (Independent and Identically distributed) bootstrap nor block bootstrapping strategies (as implemented in rangerts) completely account for the nature of the Data Generating Process (DGP) while resampling the observations. We propose the combination of RF with a residual bootstrapping technique where we replace the IID bootstrap with the AR-Sieve Bootstrap (ARSB), which assumes the DGP to be an autoregressive process. To assess the new model's predictive performance, we conduct a simulation study using synthetic data generated from different types of DGPs. It turns out that ARSB provides more variation amongst the trees in the forest. Moreover, RF with ARSB shows greater accuracy compared to RF with other bootstrap strategies. However, these improvements are achieved at some efficiency costs.
title AR-Sieve Bootstrap for the Random Forest and a simulation-based comparison with rangerts time series prediction
topic Machine Learning
url https://arxiv.org/abs/2410.00942