Saved in:
Bibliographic Details
Main Authors: Wang, Shijie, Chakraborty, Saptarshi, Qin, Qian, Bai, Ray
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.05986
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914830558953472
author Wang, Shijie
Chakraborty, Saptarshi
Qin, Qian
Bai, Ray
author_facet Wang, Shijie
Chakraborty, Saptarshi
Qin, Qian
Bai, Ray
contents Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a valid probability density. Under default hyperparameters, we show that neural-$g$ is very flexible and capable of capturing many unknown densities, including those with flat regions, heavy tails, and/or discontinuities. In contrast, existing methods struggle to capture all of these prior shapes. We provide justification for neural-$g$ by establishing a new universal approximation theorem regarding the capability of neural networks to learn arbitrary probability mass functions. To accelerate convergence of our numerical implementation, we utilize a weighted average gradient descent approach to update the network parameters. Finally, we extend neural-$g$ to multivariate prior density estimation. We illustrate the efficacy of our approach through simulations and analyses of real datasets. A software package to implement neural-$g$ is publicly available at https://github.com/shijiew97/neuralG.
format Preprint
id arxiv_https___arxiv_org_abs_2406_05986
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Neural-g: A Deep Learning Framework for Mixing Density Estimation
Wang, Shijie
Chakraborty, Saptarshi
Qin, Qian
Bai, Ray
Machine Learning
Mixing (or prior) density estimation is an important problem in machine learning and statistics, especially in empirical Bayes $g$-modeling where accurately estimating the prior is necessary for making good posterior inferences. In this paper, we propose neural-$g$, a new neural network-based estimator for $g$-modeling. Neural-$g$ uses a softmax output layer to ensure that the estimated prior is a valid probability density. Under default hyperparameters, we show that neural-$g$ is very flexible and capable of capturing many unknown densities, including those with flat regions, heavy tails, and/or discontinuities. In contrast, existing methods struggle to capture all of these prior shapes. We provide justification for neural-$g$ by establishing a new universal approximation theorem regarding the capability of neural networks to learn arbitrary probability mass functions. To accelerate convergence of our numerical implementation, we utilize a weighted average gradient descent approach to update the network parameters. Finally, we extend neural-$g$ to multivariate prior density estimation. We illustrate the efficacy of our approach through simulations and analyses of real datasets. A software package to implement neural-$g$ is publicly available at https://github.com/shijiew97/neuralG.
title Neural-g: A Deep Learning Framework for Mixing Density Estimation
topic Machine Learning
url https://arxiv.org/abs/2406.05986