Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Zhongyang, Yang, Sichen
Format: Preprint
Veröffentlicht: 2022
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2209.14859
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866917453384122368
author Li, Zhongyang
Yang, Sichen
author_facet Li, Zhongyang
Yang, Sichen
contents We study exact recovery for community detection in a Gaussian mixture model with dependent and heterogeneous Gaussian noise. The noise covariance matrix $Σ$ may be non-diagonal and, in the general formulation, singular. In the singular case, we write the Gaussian likelihood on the support of the induced measure and show that the maximum likelihood estimator (MLE) is a constrained quadratic optimization problem involving the Moore--Penrose inverse. For general covariance structures, we obtain sufficient conditions for exact recovery of the MLE when the community sizes are unknown and when they are known. These conditions are driven by the $Σ$-whitened separation $L_Σ(x,y)$ together with local one-step comparison inequalities in the near-truth regime. Under the additional assumption that $Σ$ is invertible, we derive converse results showing failure of exact recovery when a large family of local perturbations has sufficiently nondegenerate Gaussian comparison statistics. We then analyze a full-rank non-diagonal block-covariance model, prove a sharp exact-recovery threshold in the unknown-size setting, and identify a general no-gap mechanism under which the sufficient and necessary conditions coincide asymptotically.
format Preprint
id arxiv_https___arxiv_org_abs_2209_14859
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Exact Recovery of Community Detection in dependent Gaussian Mixture Models
Li, Zhongyang
Yang, Sichen
Statistics Theory
Probability
Machine Learning
91D30
We study exact recovery for community detection in a Gaussian mixture model with dependent and heterogeneous Gaussian noise. The noise covariance matrix $Σ$ may be non-diagonal and, in the general formulation, singular. In the singular case, we write the Gaussian likelihood on the support of the induced measure and show that the maximum likelihood estimator (MLE) is a constrained quadratic optimization problem involving the Moore--Penrose inverse. For general covariance structures, we obtain sufficient conditions for exact recovery of the MLE when the community sizes are unknown and when they are known. These conditions are driven by the $Σ$-whitened separation $L_Σ(x,y)$ together with local one-step comparison inequalities in the near-truth regime. Under the additional assumption that $Σ$ is invertible, we derive converse results showing failure of exact recovery when a large family of local perturbations has sufficiently nondegenerate Gaussian comparison statistics. We then analyze a full-rank non-diagonal block-covariance model, prove a sharp exact-recovery threshold in the unknown-size setting, and identify a general no-gap mechanism under which the sufficient and necessary conditions coincide asymptotically.
title Exact Recovery of Community Detection in dependent Gaussian Mixture Models
topic Statistics Theory
Probability
Machine Learning
91D30
url https://arxiv.org/abs/2209.14859