Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ali, Sarwan, Ali, Tamkanat E, Khan, Imdad Ullah, Patterson, Murray
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2412.20617
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866916545730445312
author Ali, Sarwan
Ali, Tamkanat E
Khan, Imdad Ullah
Patterson, Murray
author_facet Ali, Sarwan
Ali, Tamkanat E
Khan, Imdad Ullah
Patterson, Murray
contents In the realm of data analysis and bioinformatics, representing time series data in a manner akin to biological sequences offers a novel approach to leverage sequence analysis techniques. Transforming time series signals into molecular sequence-type representations allows us to enhance pattern recognition by applying sophisticated sequence analysis techniques (e.g. $k$-mers based representation) developed in bioinformatics, uncovering hidden patterns and relationships in complex, non-linear time series data. This paper proposes a method to transform time series signals into biological/molecular sequence-type representations using a unique alphabetic mapping technique. By generating 26 ranges corresponding to the 26 letters of the English alphabet, each value within the time series is mapped to a specific character based on its range. This conversion facilitates the application of sequence analysis algorithms, typically used in bioinformatics, to analyze time series data. We demonstrate the effectiveness of this approach by converting real-world time series signals into character sequences and performing sequence classification. The resulting sequences can be utilized for various sequence-based analysis techniques, offering a new perspective on time series data representation and analysis.
format Preprint
id arxiv_https___arxiv_org_abs_2412_20617
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Converting Time Series Data to Numeric Representations Using Alphabetic Mapping and k-mer strategy
Ali, Sarwan
Ali, Tamkanat E
Khan, Imdad Ullah
Patterson, Murray
Machine Learning
In the realm of data analysis and bioinformatics, representing time series data in a manner akin to biological sequences offers a novel approach to leverage sequence analysis techniques. Transforming time series signals into molecular sequence-type representations allows us to enhance pattern recognition by applying sophisticated sequence analysis techniques (e.g. $k$-mers based representation) developed in bioinformatics, uncovering hidden patterns and relationships in complex, non-linear time series data. This paper proposes a method to transform time series signals into biological/molecular sequence-type representations using a unique alphabetic mapping technique. By generating 26 ranges corresponding to the 26 letters of the English alphabet, each value within the time series is mapped to a specific character based on its range. This conversion facilitates the application of sequence analysis algorithms, typically used in bioinformatics, to analyze time series data. We demonstrate the effectiveness of this approach by converting real-world time series signals into character sequences and performing sequence classification. The resulting sequences can be utilized for various sequence-based analysis techniques, offering a new perspective on time series data representation and analysis.
title Converting Time Series Data to Numeric Representations Using Alphabetic Mapping and k-mer strategy
topic Machine Learning
url https://arxiv.org/abs/2412.20617