Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jin, Kun, Mémoli, Facundo, Smith, Zane, Wan, Zhengchao
Format:	Preprint
Published:	2018
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/1810.07793
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915932388982784
author	Jin, Kun Mémoli, Facundo Smith, Zane Wan, Zhengchao
author_facet	Jin, Kun Mémoli, Facundo Smith, Zane Wan, Zhengchao
contents	We introduce the Wasserstein Transform (WT), a general unsupervised framework for updating distance structures on given data sets with the purpose of enhancing features and denoising. Our framework represents each data point by a probability measure reflecting the neighborhood structure of the point, and then updates the distance by computing the Wasserstein distance between these probability measures. The Wasserstein Transform is a general method which extends the mean shift family of algorithms. We study several instances of WT, and in particular, in one of the instances which we call the Gaussian Transform (GT), we utilize Gaussian measures to model neighborhood structures of individual data points. GT is computationally cheaper than other instances of WT since there exists closed form solution for the $\ell^2$-Wasserstein distance between Gaussian measures. We study the relationship between different instances of WT and prove that each of the instances is stable under perturbations. We devise iterative algorithms for performing the above-mentioned WT and propose several strategies to accelerate GT, such as an observation from linear algebra for reducing the number of matrix square root computations. We examine the performance of the Wasserstein Transform method in many tasks, such as denoising, clustering, image segmentation and word embeddings.
format	Preprint
id	arxiv_https___arxiv_org_abs_1810_07793
institution	arXiv
publishDate	2018
record_format	arxiv
spellingShingle	The Wasserstein transform Jin, Kun Mémoli, Facundo Smith, Zane Wan, Zhengchao Machine Learning We introduce the Wasserstein Transform (WT), a general unsupervised framework for updating distance structures on given data sets with the purpose of enhancing features and denoising. Our framework represents each data point by a probability measure reflecting the neighborhood structure of the point, and then updates the distance by computing the Wasserstein distance between these probability measures. The Wasserstein Transform is a general method which extends the mean shift family of algorithms. We study several instances of WT, and in particular, in one of the instances which we call the Gaussian Transform (GT), we utilize Gaussian measures to model neighborhood structures of individual data points. GT is computationally cheaper than other instances of WT since there exists closed form solution for the $\ell^2$-Wasserstein distance between Gaussian measures. We study the relationship between different instances of WT and prove that each of the instances is stable under perturbations. We devise iterative algorithms for performing the above-mentioned WT and propose several strategies to accelerate GT, such as an observation from linear algebra for reducing the number of matrix square root computations. We examine the performance of the Wasserstein Transform method in many tasks, such as denoising, clustering, image segmentation and word embeddings.
title	The Wasserstein transform
topic	Machine Learning
url	https://arxiv.org/abs/1810.07793

Similar Items