Enregistré dans:
Détails bibliographiques
Auteurs principaux: Yoon, Dongsun, Jung, Sungkyu
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2411.15899
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866913981469294592
author Yoon, Dongsun
Jung, Sungkyu
author_facet Yoon, Dongsun
Jung, Sungkyu
contents We propose a novel estimator for the principal component (PC) subspace tailored to the high-dimension, low-sample size (HDLSS) context. The method, termed Adaptive Reference-Guided (ARG) estimator, is designed for data exhibiting spiked covariance structures and seeks to improve upon the conventional sample PC subspace by leveraging auxiliary information from reference vectors, presumed to carry prior knowledge about the true PC subspace. The estimator is constructed by first identifying vectors asymptotically orthogonal to the true PC subspace within a signal subspace, the subspace spanned by the leading sample PC directions and the references, and then taking the orthogonal complement. The estimator is adaptive, as it automatically selects the subspace asymptotically closest to the true PC subspace inside the signal subspace, without requiring parameter tuning. We show that when the reference vectors carry nontrivial information, the proposed estimator asymptotically reduces all principal angles between the estimated and true PC subspaces compared to the naive sample-based estimator. Interestingly, despite being derived from a completely different rationale, the ARG estimator is theoretically equivalent to an estimator based on James-Stein shrinkage. Our results thus establish a theoretical foundation that unifies these two distinct approaches.
format Preprint
id arxiv_https___arxiv_org_abs_2411_15899
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Adaptive Reference-Guided Estimation of Principal Component Subspace in High Dimensions
Yoon, Dongsun
Jung, Sungkyu
Statistics Theory
62H25
We propose a novel estimator for the principal component (PC) subspace tailored to the high-dimension, low-sample size (HDLSS) context. The method, termed Adaptive Reference-Guided (ARG) estimator, is designed for data exhibiting spiked covariance structures and seeks to improve upon the conventional sample PC subspace by leveraging auxiliary information from reference vectors, presumed to carry prior knowledge about the true PC subspace. The estimator is constructed by first identifying vectors asymptotically orthogonal to the true PC subspace within a signal subspace, the subspace spanned by the leading sample PC directions and the references, and then taking the orthogonal complement. The estimator is adaptive, as it automatically selects the subspace asymptotically closest to the true PC subspace inside the signal subspace, without requiring parameter tuning. We show that when the reference vectors carry nontrivial information, the proposed estimator asymptotically reduces all principal angles between the estimated and true PC subspaces compared to the naive sample-based estimator. Interestingly, despite being derived from a completely different rationale, the ARG estimator is theoretically equivalent to an estimator based on James-Stein shrinkage. Our results thus establish a theoretical foundation that unifies these two distinct approaches.
title Adaptive Reference-Guided Estimation of Principal Component Subspace in High Dimensions
topic Statistics Theory
62H25
url https://arxiv.org/abs/2411.15899