Salvato in:
Dettagli Bibliografici
Autori principali: Ye, Hanxuan, Qian, Zachary, Li, Hongzhe
Natura: Preprint
Pubblicazione: 2025
Soggetti:
Accesso online:https://arxiv.org/abs/2510.24948
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866918177543290880
author Ye, Hanxuan
Qian, Zachary
Li, Hongzhe
author_facet Ye, Hanxuan
Qian, Zachary
Li, Hongzhe
contents Recent advances in high-resolution sequencing have paved the way for population-scale analysis in single-cell RNA-sequencing (scRNA-seq) data. scRNA-seq data, in particular, have proven to be extremely powerful in profiling a variety of outcomes such as disease and aging. The abundance of scRNA-seq data makes it possible to model each individual's gene expression as a probability density across cells, offering a richer representation than summary statistics such as means or variances, and allowing for more nuanced group comparisons. To this end, we propose a model-agnostic framework for density estimation and inference based on specially designed exponential families~(SEF), which accommodates diverse underlying models without requiring prior specifications. The proposed method enables estimation and visualization for both individual-specific and group-level gene expression densities, as well as conducting formal hypothesis testing for expression density difference across groups of interest. It relies on relaxed assumptions with established asymptotic properties and a consistent covariance estimator for valid inference. Through simulation under various scenarios, the SEF-based approach demonstrates good error control and improved statistical power over competing methods,including pseudo-bulk tests and moment estimators. Application to a population-scale scRNA-seq dataset from patients with systemic lupus erythematosus identified genes and gene sets that are missed from pseudo-bulk based tests.
format Preprint
id arxiv_https___arxiv_org_abs_2510_24948
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Differential Density Analysis in Single-Cell Genomics Using Specially Designed Exponential Families
Ye, Hanxuan
Qian, Zachary
Li, Hongzhe
Methodology
Recent advances in high-resolution sequencing have paved the way for population-scale analysis in single-cell RNA-sequencing (scRNA-seq) data. scRNA-seq data, in particular, have proven to be extremely powerful in profiling a variety of outcomes such as disease and aging. The abundance of scRNA-seq data makes it possible to model each individual's gene expression as a probability density across cells, offering a richer representation than summary statistics such as means or variances, and allowing for more nuanced group comparisons. To this end, we propose a model-agnostic framework for density estimation and inference based on specially designed exponential families~(SEF), which accommodates diverse underlying models without requiring prior specifications. The proposed method enables estimation and visualization for both individual-specific and group-level gene expression densities, as well as conducting formal hypothesis testing for expression density difference across groups of interest. It relies on relaxed assumptions with established asymptotic properties and a consistent covariance estimator for valid inference. Through simulation under various scenarios, the SEF-based approach demonstrates good error control and improved statistical power over competing methods,including pseudo-bulk tests and moment estimators. Application to a population-scale scRNA-seq dataset from patients with systemic lupus erythematosus identified genes and gene sets that are missed from pseudo-bulk based tests.
title Differential Density Analysis in Single-Cell Genomics Using Specially Designed Exponential Families
topic Methodology
url https://arxiv.org/abs/2510.24948