Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wacker, Jonas, Kanagawa, Motonobu, Filippone, Maurizio
Format:	Preprint
Published:	2022
Subjects:	Machine Learning Computation
Online Access:	https://arxiv.org/abs/2201.08712
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929456318251008
author	Wacker, Jonas Kanagawa, Motonobu Filippone, Maurizio
author_facet	Wacker, Jonas Kanagawa, Motonobu Filippone, Maurizio
contents	Dot product kernels, such as polynomial and exponential (softmax) kernels, are among the most widely used kernels in machine learning, as they enable modeling the interactions between input features, which is crucial in applications like computer vision, natural language processing, and recommender systems. We make several novel contributions for improving the efficiency of random feature approximations for dot product kernels, to make these kernels more useful in large scale learning. First, we present a generalization of existing random feature approximations for polynomial kernels, such as Rademacher and Gaussian sketches and TensorSRHT, using complex-valued random features. We show empirically that the use of complex features can significantly reduce the variances of these approximations. Second, we provide a theoretical analysis for understanding the factors affecting the efficiency of various random feature approximations, by deriving closed-form expressions for their variances. These variance formulas elucidate conditions under which certain approximations (e.g., TensorSRHT) achieve lower variances than others (e.g., Rademacher sketches), and conditions under which the use of complex features leads to lower variances than real features. Third, by using these variance formulas, which can be evaluated in practice, we develop a data-driven optimization approach to improve random feature approximations for general dot product kernels, which is also applicable to the Gaussian kernel. We describe the improvements brought by these contributions with extensive experiments on a variety of tasks and datasets.
format	Preprint
id	arxiv_https___arxiv_org_abs_2201_08712
institution	arXiv
publishDate	2022
record_format	arxiv
spellingShingle	Improved Random Features for Dot Product Kernels Wacker, Jonas Kanagawa, Motonobu Filippone, Maurizio Machine Learning Computation Dot product kernels, such as polynomial and exponential (softmax) kernels, are among the most widely used kernels in machine learning, as they enable modeling the interactions between input features, which is crucial in applications like computer vision, natural language processing, and recommender systems. We make several novel contributions for improving the efficiency of random feature approximations for dot product kernels, to make these kernels more useful in large scale learning. First, we present a generalization of existing random feature approximations for polynomial kernels, such as Rademacher and Gaussian sketches and TensorSRHT, using complex-valued random features. We show empirically that the use of complex features can significantly reduce the variances of these approximations. Second, we provide a theoretical analysis for understanding the factors affecting the efficiency of various random feature approximations, by deriving closed-form expressions for their variances. These variance formulas elucidate conditions under which certain approximations (e.g., TensorSRHT) achieve lower variances than others (e.g., Rademacher sketches), and conditions under which the use of complex features leads to lower variances than real features. Third, by using these variance formulas, which can be evaluated in practice, we develop a data-driven optimization approach to improve random feature approximations for general dot product kernels, which is also applicable to the Gaussian kernel. We describe the improvements brought by these contributions with extensive experiments on a variety of tasks and datasets.
title	Improved Random Features for Dot Product Kernels
topic	Machine Learning Computation
url	https://arxiv.org/abs/2201.08712

Similar Items