Saved in:
Bibliographic Details
Main Authors: Hayashi, Koby, Aksoy, Sinan G., Ballard, Grey, Park, Haesun
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.08134
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909406876139520
author Hayashi, Koby
Aksoy, Sinan G.
Ballard, Grey
Park, Haesun
author_facet Hayashi, Koby
Aksoy, Sinan G.
Ballard, Grey
Park, Haesun
contents Symmetric Nonnegative Matrix Factorization (SymNMF) is a technique in data analysis and machine learning that approximates a symmetric matrix with a product of a nonnegative, low-rank matrix and its transpose. To design faster and more scalable algorithms for SymNMF we develop two randomized algorithms for its computation. The first algorithm uses randomized matrix sketching to compute an initial low-rank approximation to the input matrix and proceeds to rapidly compute a SymNMF of the approximation. The second algorithm uses randomized leverage score sampling to approximately solve constrained least squares problems. Many successful methods for SymNMF rely on (approximately) solving sequences of constrained least squares problems. We prove theoretically that leverage score sampling can approximately solve nonnegative least squares problems to a chosen accuracy with high probability. Additionally, we prove sampling complexity results for previously proposed hybrid sampling techniques which deterministically include high leverage score rows. This hybrid scheme is crucial for obtaining speeds ups in practice. Finally we demonstrate that both methods work well in practice by applying them to graph clustering tasks on large real world data sets. These experiments show that our methods approximately maintain solution quality and achieve significant speed ups for both large dense and large sparse problems.
format Preprint
id arxiv_https___arxiv_org_abs_2402_08134
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Randomized Algorithms for Symmetric Nonnegative Matrix Factorization
Hayashi, Koby
Aksoy, Sinan G.
Ballard, Grey
Park, Haesun
Machine Learning
Numerical Analysis
Optimization and Control
65F55, 65F20
Symmetric Nonnegative Matrix Factorization (SymNMF) is a technique in data analysis and machine learning that approximates a symmetric matrix with a product of a nonnegative, low-rank matrix and its transpose. To design faster and more scalable algorithms for SymNMF we develop two randomized algorithms for its computation. The first algorithm uses randomized matrix sketching to compute an initial low-rank approximation to the input matrix and proceeds to rapidly compute a SymNMF of the approximation. The second algorithm uses randomized leverage score sampling to approximately solve constrained least squares problems. Many successful methods for SymNMF rely on (approximately) solving sequences of constrained least squares problems. We prove theoretically that leverage score sampling can approximately solve nonnegative least squares problems to a chosen accuracy with high probability. Additionally, we prove sampling complexity results for previously proposed hybrid sampling techniques which deterministically include high leverage score rows. This hybrid scheme is crucial for obtaining speeds ups in practice. Finally we demonstrate that both methods work well in practice by applying them to graph clustering tasks on large real world data sets. These experiments show that our methods approximately maintain solution quality and achieve significant speed ups for both large dense and large sparse problems.
title Randomized Algorithms for Symmetric Nonnegative Matrix Factorization
topic Machine Learning
Numerical Analysis
Optimization and Control
65F55, 65F20
url https://arxiv.org/abs/2402.08134