Saved in:
Bibliographic Details
Main Authors: Clinch, Madelyn, Bradley, Jonathan R.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.02655
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913717325660160
author Clinch, Madelyn
Bradley, Jonathan R.
author_facet Clinch, Madelyn
Bradley, Jonathan R.
contents Fine particulate matter and aerosol optical thickness are of interest to atmospheric scientists for understanding air quality and its various health/environmental impacts. The available data are extremely large, making uncertainty quantification in a fully Bayesian framework quite difficult, as traditional implementations do not scale reasonably to the size of the data. We specifically consider roughly 8 million observations obtained from NASA's Moderate Resolution Imaging Spectroradiometer (MODIS) instrument. To analyze data on this scale, we introduce Scalable Multivariate Exact Posterior Regression (SM-EPR) which combines the recently introduced data subset approach and Exact Posterior Regression (EPR). EPR is a new Bayesian hierarchical model where it is possible to sample independent replicates of fixed and random effects directly from the posterior without the use of Markov chain Monte Carlo (MCMC). We extend EPR to the multivariate spatial context, where the multiple variables may be distributed according to different distributions. The combination of the data subset approach with EPR allows one to perform exact Bayesian inference without MCMC for effectively any sample size. Additional motivation is provided via technical results illustrating favorable Kullback-Leibler and covariance properties. We demonstrate SM-EPR using a motivating big remote sensing data application and provide several simulations.
format Preprint
id arxiv_https___arxiv_org_abs_2410_02655
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Exact Bayesian Inference for Multivariate Spatial Data of Any Size with Application to Air Pollution Monitoring
Clinch, Madelyn
Bradley, Jonathan R.
Methodology
Fine particulate matter and aerosol optical thickness are of interest to atmospheric scientists for understanding air quality and its various health/environmental impacts. The available data are extremely large, making uncertainty quantification in a fully Bayesian framework quite difficult, as traditional implementations do not scale reasonably to the size of the data. We specifically consider roughly 8 million observations obtained from NASA's Moderate Resolution Imaging Spectroradiometer (MODIS) instrument. To analyze data on this scale, we introduce Scalable Multivariate Exact Posterior Regression (SM-EPR) which combines the recently introduced data subset approach and Exact Posterior Regression (EPR). EPR is a new Bayesian hierarchical model where it is possible to sample independent replicates of fixed and random effects directly from the posterior without the use of Markov chain Monte Carlo (MCMC). We extend EPR to the multivariate spatial context, where the multiple variables may be distributed according to different distributions. The combination of the data subset approach with EPR allows one to perform exact Bayesian inference without MCMC for effectively any sample size. Additional motivation is provided via technical results illustrating favorable Kullback-Leibler and covariance properties. We demonstrate SM-EPR using a motivating big remote sensing data application and provide several simulations.
title Exact Bayesian Inference for Multivariate Spatial Data of Any Size with Application to Air Pollution Monitoring
topic Methodology
url https://arxiv.org/abs/2410.02655