Saved in:
Bibliographic Details
Main Authors: Rügamer, David, Pfisterer, Florian, Bischl, Bernd, Grün, Bettina
Format: Preprint
Published: 2022
Subjects:
Online Access:https://arxiv.org/abs/2211.09875
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911531004854272
author Rügamer, David
Pfisterer, Florian
Bischl, Bernd
Grün, Bettina
author_facet Rügamer, David
Pfisterer, Florian
Bischl, Bernd
Grün, Bettina
contents In this work, we propose an efficient implementation of mixtures of experts distributional regression models which exploits robust estimation by using stochastic first-order optimization techniques with adaptive learning rate schedulers. We take advantage of the flexibility and scalability of neural network software and implement the proposed framework in mixdistreg, an R software package that allows for the definition of mixtures of many different families, estimation in high-dimensional and large sample size settings and robust optimization based on TensorFlow. Numerical experiments with simulated and real-world data applications show that optimization is as reliable as estimation via classical approaches in many different settings and that results may be obtained for complicated scenarios where classical approaches consistently fail.
format Preprint
id arxiv_https___arxiv_org_abs_2211_09875
institution arXiv
publishDate 2022
record_format arxiv
spellingShingle Mixture of Experts Distributional Regression: Implementation Using Robust Estimation with Adaptive First-order Methods
Rügamer, David
Pfisterer, Florian
Bischl, Bernd
Grün, Bettina
Computation
In this work, we propose an efficient implementation of mixtures of experts distributional regression models which exploits robust estimation by using stochastic first-order optimization techniques with adaptive learning rate schedulers. We take advantage of the flexibility and scalability of neural network software and implement the proposed framework in mixdistreg, an R software package that allows for the definition of mixtures of many different families, estimation in high-dimensional and large sample size settings and robust optimization based on TensorFlow. Numerical experiments with simulated and real-world data applications show that optimization is as reliable as estimation via classical approaches in many different settings and that results may be obtained for complicated scenarios where classical approaches consistently fail.
title Mixture of Experts Distributional Regression: Implementation Using Robust Estimation with Adaptive First-order Methods
topic Computation
url https://arxiv.org/abs/2211.09875