Saved in:
Bibliographic Details
Main Authors: Shi, Jiaxin, Gao, Yuan, Pan, Rui, Wang, Hansheng
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.08457
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911837622108160
author Shi, Jiaxin
Gao, Yuan
Pan, Rui
Wang, Hansheng
author_facet Shi, Jiaxin
Gao, Yuan
Pan, Rui
Wang, Hansheng
contents In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent variables. Our method assumes that the dependency structure of the observed binary data can be fully captured by the continuous latent factors. To estimate the model, a moment-based estimation method is developed. The proposed method is able to deal with both discontinuity and high dimensionality. Most importantly, the asymptotic properties of the resulting estimators are rigorously established. Extensive simulation studies are presented to demonstrate the proposed methodology. A real dataset about product descriptions is analysed for illustration.
format Preprint
id arxiv_https___arxiv_org_abs_2404_08457
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle A Latent Factor Model for High-Dimensional Binary Data
Shi, Jiaxin
Gao, Yuan
Pan, Rui
Wang, Hansheng
Methodology
In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent variables. Our method assumes that the dependency structure of the observed binary data can be fully captured by the continuous latent factors. To estimate the model, a moment-based estimation method is developed. The proposed method is able to deal with both discontinuity and high dimensionality. Most importantly, the asymptotic properties of the resulting estimators are rigorously established. Extensive simulation studies are presented to demonstrate the proposed methodology. A real dataset about product descriptions is analysed for illustration.
title A Latent Factor Model for High-Dimensional Binary Data
topic Methodology
url https://arxiv.org/abs/2404.08457