Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Calo, Sergio, Jonsson, Anders, Neu, Gergely, Schwartz, Ludovic, Segovia-Aguas, Javier
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2505.18005
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866915301501698048
author Calo, Sergio
Jonsson, Anders
Neu, Gergely
Schwartz, Ludovic
Segovia-Aguas, Javier
author_facet Calo, Sergio
Jonsson, Anders
Neu, Gergely
Schwartz, Ludovic
Segovia-Aguas, Javier
contents Bisimulation metrics are powerful tools for measuring similarities between stochastic processes, and specifically Markov chains. Recent advances have uncovered that bisimulation metrics are, in fact, optimal-transport distances, which has enabled the development of fast algorithms for computing such metrics with provable accuracy and runtime guarantees. However, these recent methods, as well as all previously known methods, assume full knowledge of the transition dynamics. This is often an impractical assumption in most real-world scenarios, where typically only sample trajectories are available. In this work, we propose a stochastic optimization method that addresses this limitation and estimates bisimulation metrics based on sample access, without requiring explicit transition models. Our approach is derived from a new linear programming (LP) formulation of bisimulation metrics, which we solve using a stochastic primal-dual optimization method. We provide theoretical guarantees on the sample complexity of the algorithm and validate its effectiveness through a series of empirical evaluations.
format Preprint
id arxiv_https___arxiv_org_abs_2505_18005
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Distances for Markov chains from sample streams
Calo, Sergio
Jonsson, Anders
Neu, Gergely
Schwartz, Ludovic
Segovia-Aguas, Javier
Machine Learning
Bisimulation metrics are powerful tools for measuring similarities between stochastic processes, and specifically Markov chains. Recent advances have uncovered that bisimulation metrics are, in fact, optimal-transport distances, which has enabled the development of fast algorithms for computing such metrics with provable accuracy and runtime guarantees. However, these recent methods, as well as all previously known methods, assume full knowledge of the transition dynamics. This is often an impractical assumption in most real-world scenarios, where typically only sample trajectories are available. In this work, we propose a stochastic optimization method that addresses this limitation and estimates bisimulation metrics based on sample access, without requiring explicit transition models. Our approach is derived from a new linear programming (LP) formulation of bisimulation metrics, which we solve using a stochastic primal-dual optimization method. We provide theoretical guarantees on the sample complexity of the algorithm and validate its effectiveness through a series of empirical evaluations.
title Distances for Markov chains from sample streams
topic Machine Learning
url https://arxiv.org/abs/2505.18005