Saved in:
Bibliographic Details
Main Authors: Gerard, David, Stephens, Matthew
Format: Preprint
Published: 2017
Subjects:
Online Access:https://arxiv.org/abs/1709.10066
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911407472115712
author Gerard, David
Stephens, Matthew
author_facet Gerard, David
Stephens, Matthew
contents We combine two important ideas in the analysis of large-scale genomics experiments (e.g. experiments that aim to identify genes that are differentially expressed between two conditions). The first is use of Empirical Bayes (EB) methods to handle the large number of potentially-sparse effects, and estimate false discovery rates and related quantities. The second is use of factor analysis methods to deal with sources of unwanted variation such as batch effects and unmeasured confounders. We describe a simple modular fitting procedure that combines key ideas from both these lines of research. This yields new, powerful EB methods for analyzing genomics experiments that account for both sparse effects and unwanted variation. In realistic simulations, these new methods provide significant gains in power and calibration over competing methods. In real data analysis we find that different methods, while often conceptually similar, can vary widely in their assessments of statistical significance. This highlights the need for care in both choice of methods and interpretation of results. All methods introduced in this paper are implemented in the R package vicar available at https://github.com/dcgerard/vicar .
format Preprint
id arxiv_https___arxiv_org_abs_1709_10066
institution arXiv
publishDate 2017
record_format arxiv
spellingShingle Empirical Bayes Shrinkage and False Discovery Rate Estimation, Allowing For Unwanted Variation
Gerard, David
Stephens, Matthew
Methodology
We combine two important ideas in the analysis of large-scale genomics experiments (e.g. experiments that aim to identify genes that are differentially expressed between two conditions). The first is use of Empirical Bayes (EB) methods to handle the large number of potentially-sparse effects, and estimate false discovery rates and related quantities. The second is use of factor analysis methods to deal with sources of unwanted variation such as batch effects and unmeasured confounders. We describe a simple modular fitting procedure that combines key ideas from both these lines of research. This yields new, powerful EB methods for analyzing genomics experiments that account for both sparse effects and unwanted variation. In realistic simulations, these new methods provide significant gains in power and calibration over competing methods. In real data analysis we find that different methods, while often conceptually similar, can vary widely in their assessments of statistical significance. This highlights the need for care in both choice of methods and interpretation of results. All methods introduced in this paper are implemented in the R package vicar available at https://github.com/dcgerard/vicar .
title Empirical Bayes Shrinkage and False Discovery Rate Estimation, Allowing For Unwanted Variation
topic Methodology
url https://arxiv.org/abs/1709.10066