Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tepper, Mariano, Bhati, Ishwar Singh, Aguerrebere, Cecilia, Willke, Ted
Format:	Preprint
Published:	2024
Subjects:	Information Retrieval Machine Learning
Online Access:	https://arxiv.org/abs/2410.22347
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910677768077312
author	Tepper, Mariano Bhati, Ishwar Singh Aguerrebere, Cecilia Willke, Ted
author_facet	Tepper, Mariano Bhati, Ishwar Singh Aguerrebere, Cecilia Willke, Ted
contents	Embedding models can generate high-dimensional vectors whose similarity reflects semantic affinities. Thus, accurately and timely retrieving those vectors in a large collection that are similar to a given query has become a critical component of a wide range of applications. In particular, cross-modal retrieval (e.g., where a text query is used to find images) is gaining momentum rapidly. Here, it is challenging to achieve high accuracy as the queries often have different statistical distributions than the database vectors. Moreover, the high vector dimensionality puts these search systems under compute and memory pressure, leading to subpar performance. In this work, we present new linear and nonlinear methods for dimensionality reduction to accelerate high-dimensional vector search while maintaining accuracy in settings with in-distribution (ID) and out-of-distribution (OOD) queries. The linear LeanVec-Sphering outperforms other linear methods, trains faster, comes with no hyperparameters, and allows to set the target dimensionality more flexibly. The nonlinear Generalized LeanVec (GleanVec) uses a piecewise linear scheme to further improve the search accuracy while remaining computationally nimble. Initial experimental results show that LeanVec-Sphering and GleanVec push the state of the art for vector search.
format	Preprint
id	arxiv_https___arxiv_org_abs_2410_22347
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	GleanVec: Accelerating vector search with minimalist nonlinear dimensionality reduction Tepper, Mariano Bhati, Ishwar Singh Aguerrebere, Cecilia Willke, Ted Information Retrieval Machine Learning Embedding models can generate high-dimensional vectors whose similarity reflects semantic affinities. Thus, accurately and timely retrieving those vectors in a large collection that are similar to a given query has become a critical component of a wide range of applications. In particular, cross-modal retrieval (e.g., where a text query is used to find images) is gaining momentum rapidly. Here, it is challenging to achieve high accuracy as the queries often have different statistical distributions than the database vectors. Moreover, the high vector dimensionality puts these search systems under compute and memory pressure, leading to subpar performance. In this work, we present new linear and nonlinear methods for dimensionality reduction to accelerate high-dimensional vector search while maintaining accuracy in settings with in-distribution (ID) and out-of-distribution (OOD) queries. The linear LeanVec-Sphering outperforms other linear methods, trains faster, comes with no hyperparameters, and allows to set the target dimensionality more flexibly. The nonlinear Generalized LeanVec (GleanVec) uses a piecewise linear scheme to further improve the search accuracy while remaining computationally nimble. Initial experimental results show that LeanVec-Sphering and GleanVec push the state of the art for vector search.
title	GleanVec: Accelerating vector search with minimalist nonlinear dimensionality reduction
topic	Information Retrieval Machine Learning
url	https://arxiv.org/abs/2410.22347

Similar Items