Saved in:
Bibliographic Details
Main Authors: Tuo, Rui, Zou, Lu
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.04248
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914000092004352
author Tuo, Rui
Zou, Lu
author_facet Tuo, Rui
Zou, Lu
contents An asymptotic theory is established for linear functionals of the predictive function given by kernel ridge regression, when the reproducing kernel Hilbert space is equivalent to a Sobolev space. The theory covers a wide variety of linear functionals, including point evaluations, evaluation of derivatives, $L_2$ inner products, etc. We establish the upper and lower bounds of the estimates and their asymptotic normality. It is shown that $λ\sim n^{-1}$ is the universal optimal order of magnitude for the smoothing parameter to balance the variance and the worst-case bias. The theory also implies that the optimal $L_\infty$ error of kernel ridge regression can be attained under the optimal smoothing parameter $λ\sim n^{-1}\log n$. These optimal rates for the smoothing parameter differ from the known optimal rate $λ\sim n^{-\frac{2m}{2m+d}}$ that minimizes the $L_2$ error of the kernel ridge regression.
format Preprint
id arxiv_https___arxiv_org_abs_2403_04248
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Asymptotic Theory for Linear Functionals of Kernel Ridge Regression
Tuo, Rui
Zou, Lu
Statistics Theory
An asymptotic theory is established for linear functionals of the predictive function given by kernel ridge regression, when the reproducing kernel Hilbert space is equivalent to a Sobolev space. The theory covers a wide variety of linear functionals, including point evaluations, evaluation of derivatives, $L_2$ inner products, etc. We establish the upper and lower bounds of the estimates and their asymptotic normality. It is shown that $λ\sim n^{-1}$ is the universal optimal order of magnitude for the smoothing parameter to balance the variance and the worst-case bias. The theory also implies that the optimal $L_\infty$ error of kernel ridge regression can be attained under the optimal smoothing parameter $λ\sim n^{-1}\log n$. These optimal rates for the smoothing parameter differ from the known optimal rate $λ\sim n^{-\frac{2m}{2m+d}}$ that minimizes the $L_2$ error of the kernel ridge regression.
title Asymptotic Theory for Linear Functionals of Kernel Ridge Regression
topic Statistics Theory
url https://arxiv.org/abs/2403.04248