Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.08457 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866911837622108160 |
|---|---|
| author | Shi, Jiaxin Gao, Yuan Pan, Rui Wang, Hansheng |
| author_facet | Shi, Jiaxin Gao, Yuan Pan, Rui Wang, Hansheng |
| contents | In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent variables. Our method assumes that the dependency structure of the observed binary data can be fully captured by the continuous latent factors. To estimate the model, a moment-based estimation method is developed. The proposed method is able to deal with both discontinuity and high dimensionality. Most importantly, the asymptotic properties of the resulting estimators are rigorously established. Extensive simulation studies are presented to demonstrate the proposed methodology. A real dataset about product descriptions is analysed for illustration. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2404_08457 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | A Latent Factor Model for High-Dimensional Binary Data Shi, Jiaxin Gao, Yuan Pan, Rui Wang, Hansheng Methodology In this study, we develop a latent factor model for analysing high-dimensional binary data. Specifically, a standard probit model is used to describe the regression relationship between the observed binary data and the continuous latent variables. Our method assumes that the dependency structure of the observed binary data can be fully captured by the continuous latent factors. To estimate the model, a moment-based estimation method is developed. The proposed method is able to deal with both discontinuity and high dimensionality. Most importantly, the asymptotic properties of the resulting estimators are rigorously established. Extensive simulation studies are presented to demonstrate the proposed methodology. A real dataset about product descriptions is analysed for illustration. |
| title | A Latent Factor Model for High-Dimensional Binary Data |
| topic | Methodology |
| url | https://arxiv.org/abs/2404.08457 |