Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tian, Ye, Wu, Sanyou, Feng, Long
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2503.21608
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908287117557760
author	Tian, Ye Wu, Sanyou Feng, Long
author_facet	Tian, Ye Wu, Sanyou Feng, Long
contents	Identifying low-dimensional latent structures within high-dimensional data has long been a central topic in the machine learning community, driven by the need for data compression, storage, transmission, and deeper data understanding. Traditional methods, such as principal component analysis (PCA) and autoencoders (AE), operate in an unsupervised manner, ignoring label information even when it is available. In this work, we introduce a unified method capable of learning latent spaces in both unsupervised and supervised settings. We formulate the problem as a nonlinear multiple-response regression within an index model context. By applying the generalized Stein's lemma, the latent space can be estimated without knowing the nonlinear link functions. Our method can be viewed as a nonlinear generalization of PCA. Moreover, unlike AE and other neural network methods that operate as "black boxes", our approach not only offers better interpretability but also reduces computational complexity while providing strong theoretical guarantees. Comprehensive numerical experiments and real data analyses demonstrate the superior performance of our method.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_21608
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Nonlinear Multiple Response Regression and Learning of Latent Spaces Tian, Ye Wu, Sanyou Feng, Long Machine Learning Identifying low-dimensional latent structures within high-dimensional data has long been a central topic in the machine learning community, driven by the need for data compression, storage, transmission, and deeper data understanding. Traditional methods, such as principal component analysis (PCA) and autoencoders (AE), operate in an unsupervised manner, ignoring label information even when it is available. In this work, we introduce a unified method capable of learning latent spaces in both unsupervised and supervised settings. We formulate the problem as a nonlinear multiple-response regression within an index model context. By applying the generalized Stein's lemma, the latent space can be estimated without knowing the nonlinear link functions. Our method can be viewed as a nonlinear generalization of PCA. Moreover, unlike AE and other neural network methods that operate as "black boxes", our approach not only offers better interpretability but also reduces computational complexity while providing strong theoretical guarantees. Comprehensive numerical experiments and real data analyses demonstrate the superior performance of our method.
title	Nonlinear Multiple Response Regression and Learning of Latent Spaces
topic	Machine Learning
url	https://arxiv.org/abs/2503.21608

Similar Items