Saved in:
Bibliographic Details
Main Authors: Yi, Mingyang, Wang, Bohan
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2401.13530
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909884864266240
author Yi, Mingyang
Wang, Bohan
author_facet Yi, Mingyang
Wang, Bohan
contents Recently, optimization on the Riemannian manifold have provided valuable insights to the optimization community. In this regard, extending these methods to to the Wasserstein space is of particular interest, since optimization on Wasserstein space is closely connected to practical sampling processes. Generally, the standard (continuous) optimization method on Wasserstein space is Riemannian gradient flow (i.e., Langevin dynamics when minimizing KL divergence). In this paper, we aim to enrich the family of continuous optimization methods in the Wasserstein space, by extending the gradient flow on it into the stochastic gradient descent (SGD) flow and stochastic variance reduction gradient (SVRG) flow. By leveraging the property of Wasserstein space, we construct stochastic differential equations (SDEs) to approximate the corresponding discrete Euclidean dynamics of the desired Riemannian stochastic methods. Then, we obtain the flows in Wasserstein space by Fokker-Planck equation. Finally, we establish convergence rates of the proposed stochastic flows, which align with those known in the Euclidean setting.
format Preprint
id arxiv_https___arxiv_org_abs_2401_13530
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space
Yi, Mingyang
Wang, Bohan
Machine Learning
Recently, optimization on the Riemannian manifold have provided valuable insights to the optimization community. In this regard, extending these methods to to the Wasserstein space is of particular interest, since optimization on Wasserstein space is closely connected to practical sampling processes. Generally, the standard (continuous) optimization method on Wasserstein space is Riemannian gradient flow (i.e., Langevin dynamics when minimizing KL divergence). In this paper, we aim to enrich the family of continuous optimization methods in the Wasserstein space, by extending the gradient flow on it into the stochastic gradient descent (SGD) flow and stochastic variance reduction gradient (SVRG) flow. By leveraging the property of Wasserstein space, we construct stochastic differential equations (SDEs) to approximate the corresponding discrete Euclidean dynamics of the desired Riemannian stochastic methods. Then, we obtain the flows in Wasserstein space by Fokker-Planck equation. Finally, we establish convergence rates of the proposed stochastic flows, which align with those known in the Euclidean setting.
title Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space
topic Machine Learning
url https://arxiv.org/abs/2401.13530