Saved in:
Bibliographic Details
Main Authors: Jin, Kun, Mémoli, Facundo, Smith, Zane, Wan, Zhengchao
Format: Preprint
Published: 2018
Subjects:
Online Access:https://arxiv.org/abs/1810.07793
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915932388982784
author Jin, Kun
Mémoli, Facundo
Smith, Zane
Wan, Zhengchao
author_facet Jin, Kun
Mémoli, Facundo
Smith, Zane
Wan, Zhengchao
contents We introduce the Wasserstein Transform (WT), a general unsupervised framework for updating distance structures on given data sets with the purpose of enhancing features and denoising. Our framework represents each data point by a probability measure reflecting the neighborhood structure of the point, and then updates the distance by computing the Wasserstein distance between these probability measures. The Wasserstein Transform is a general method which extends the mean shift family of algorithms. We study several instances of WT, and in particular, in one of the instances which we call the Gaussian Transform (GT), we utilize Gaussian measures to model neighborhood structures of individual data points. GT is computationally cheaper than other instances of WT since there exists closed form solution for the $\ell^2$-Wasserstein distance between Gaussian measures. We study the relationship between different instances of WT and prove that each of the instances is stable under perturbations. We devise iterative algorithms for performing the above-mentioned WT and propose several strategies to accelerate GT, such as an observation from linear algebra for reducing the number of matrix square root computations. We examine the performance of the Wasserstein Transform method in many tasks, such as denoising, clustering, image segmentation and word embeddings.
format Preprint
id arxiv_https___arxiv_org_abs_1810_07793
institution arXiv
publishDate 2018
record_format arxiv
spellingShingle The Wasserstein transform
Jin, Kun
Mémoli, Facundo
Smith, Zane
Wan, Zhengchao
Machine Learning
We introduce the Wasserstein Transform (WT), a general unsupervised framework for updating distance structures on given data sets with the purpose of enhancing features and denoising. Our framework represents each data point by a probability measure reflecting the neighborhood structure of the point, and then updates the distance by computing the Wasserstein distance between these probability measures. The Wasserstein Transform is a general method which extends the mean shift family of algorithms. We study several instances of WT, and in particular, in one of the instances which we call the Gaussian Transform (GT), we utilize Gaussian measures to model neighborhood structures of individual data points. GT is computationally cheaper than other instances of WT since there exists closed form solution for the $\ell^2$-Wasserstein distance between Gaussian measures. We study the relationship between different instances of WT and prove that each of the instances is stable under perturbations. We devise iterative algorithms for performing the above-mentioned WT and propose several strategies to accelerate GT, such as an observation from linear algebra for reducing the number of matrix square root computations. We examine the performance of the Wasserstein Transform method in many tasks, such as denoising, clustering, image segmentation and word embeddings.
title The Wasserstein transform
topic Machine Learning
url https://arxiv.org/abs/1810.07793