Saved in:
Bibliographic Details
Main Authors: Kimpton, Louise, Challenor, Peter, Wynn, Henry
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2211.16921
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917689371394048
author Kimpton, Louise
Challenor, Peter
Wynn, Henry
author_facet Kimpton, Louise
Challenor, Peter
Wynn, Henry
contents Binary time series data are very common in many applications, and are typically modelled independently via a Bernoulli process with a single probability of success. However, the probability of a success can be dependent on the outcome successes of past events. Presented here is a novel approach for modelling binary time series data called a binary de Bruijn process which takes into account temporal correlation. The structure is derived from de Bruijn Graphs - a directed graph, where given a set of symbols, V, and a 'word' length, m, the nodes of the graph consist of all possible sequences of V of length m. De Bruijn Graphs are equivalent to mth order Markov chains, where the 'word' length controls the number of states that each individual state is dependent on. This increases correlation over a wider area. To quantify how clustered a sequence generated from a de Bruijn process is, the run lengths of letters are observed along with run length properties. Inference is also presented along with two application examples: precipitation data and the Oxford and Cambridge boat race.
format Preprint
id arxiv_https___arxiv_org_abs_2211_16921
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Binary De Bruijn Processes
Kimpton, Louise
Challenor, Peter
Wynn, Henry
Methodology
Statistics Theory
Binary time series data are very common in many applications, and are typically modelled independently via a Bernoulli process with a single probability of success. However, the probability of a success can be dependent on the outcome successes of past events. Presented here is a novel approach for modelling binary time series data called a binary de Bruijn process which takes into account temporal correlation. The structure is derived from de Bruijn Graphs - a directed graph, where given a set of symbols, V, and a 'word' length, m, the nodes of the graph consist of all possible sequences of V of length m. De Bruijn Graphs are equivalent to mth order Markov chains, where the 'word' length controls the number of states that each individual state is dependent on. This increases correlation over a wider area. To quantify how clustered a sequence generated from a de Bruijn process is, the run lengths of letters are observed along with run length properties. Inference is also presented along with two application examples: precipitation data and the Oxford and Cambridge boat race.
title Binary De Bruijn Processes
topic Methodology
Statistics Theory
url https://arxiv.org/abs/2211.16921