Saved in:
Bibliographic Details
Main Authors: Gentile, David, Huang, Joshua, Murphy, James M.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.01989
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917306036125696
author Gentile, David
Huang, Joshua
Murphy, James M.
author_facet Gentile, David
Huang, Joshua
Murphy, James M.
contents Change point detection for time series analysis is a difficult and important problem in applied statistics, for which a variety of approaches have been developed in the past several decades. Here, the Wasserstein metric is employed as a tool for change-point identification in multi-dimensional time series data in order to identify clusters in time series in an unsupervised way. We leverage the simplicity of the optimal transport cost in the 1-dimensional setting to quickly identify both a segmentation (family of change points for a trajectory) and a clustering for the data when the number of segments is much smaller than the number of data points, making no parametric assumptions about the particular distributions involved. Our change point detection method scales linearly in the size of the data and in the dimension of the samples. We test our approach on idealized synthetic data trajectories, as well as real world trajectories coming from the domain of molecular dynamics simulations and underwater acoustics. We find that segmenting these time series via change points obtained by estimating the Wasserstein metric derivative and then clustering the identified segments as measures with similarity measured by the Wasserstein metric, successfully identifies metastable states in the law of the processes.
format Preprint
id arxiv_https___arxiv_org_abs_2603_01989
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Wasserstein-based identification of metastable states in time series data via change point detection and segment clustering
Gentile, David
Huang, Joshua
Murphy, James M.
Statistics Theory
Change point detection for time series analysis is a difficult and important problem in applied statistics, for which a variety of approaches have been developed in the past several decades. Here, the Wasserstein metric is employed as a tool for change-point identification in multi-dimensional time series data in order to identify clusters in time series in an unsupervised way. We leverage the simplicity of the optimal transport cost in the 1-dimensional setting to quickly identify both a segmentation (family of change points for a trajectory) and a clustering for the data when the number of segments is much smaller than the number of data points, making no parametric assumptions about the particular distributions involved. Our change point detection method scales linearly in the size of the data and in the dimension of the samples. We test our approach on idealized synthetic data trajectories, as well as real world trajectories coming from the domain of molecular dynamics simulations and underwater acoustics. We find that segmenting these time series via change points obtained by estimating the Wasserstein metric derivative and then clustering the identified segments as measures with similarity measured by the Wasserstein metric, successfully identifies metastable states in the law of the processes.
title Wasserstein-based identification of metastable states in time series data via change point detection and segment clustering
topic Statistics Theory
url https://arxiv.org/abs/2603.01989