Saved in:
Bibliographic Details
Main Authors: Dunbar, Oliver R. A., Nelsen, Nicholas H., Mutic, Maya
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.00584
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916629531590656
author Dunbar, Oliver R. A.
Nelsen, Nicholas H.
Mutic, Maya
author_facet Dunbar, Oliver R. A.
Nelsen, Nicholas H.
Mutic, Maya
contents Randomized algorithms exploit stochasticity to reduce computational complexity. One important example is random feature regression (RFR) that accelerates Gaussian process regression (GPR). RFR approximates an unknown function with a random neural network whose hidden weights and biases are sampled from a probability distribution. Only the final output layer is fit to data. In randomized algorithms like RFR, the hyperparameters that characterize the sampling distribution greatly impact performance, yet are not directly accessible from samples. This makes optimization of hyperparameters via standard (gradient-based) optimization tools inapplicable. Inspired by Bayesian ideas from GPR, this paper introduces a random objective function that is tailored for hyperparameter tuning of vector-valued random features. The objective is minimized with ensemble Kalman inversion (EKI). EKI is a gradient-free particle-based optimizer that is scalable to high-dimensions and robust to randomness in objective functions. A numerical study showcases the new black-box methodology to learn hyperparameter distributions in several problems that are sensitive to the hyperparameter selection: two global sensitivity analyses, integrating a chaotic dynamical system, and solving a Bayesian inverse problem from atmospheric dynamics. The success of the proposed EKI-based algorithm for RFR suggests its potential for automated optimization of hyperparameters arising in other randomized algorithms.
format Preprint
id arxiv_https___arxiv_org_abs_2407_00584
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Hyperparameter Optimization for Randomized Algorithms: A Case Study on Random Features
Dunbar, Oliver R. A.
Nelsen, Nicholas H.
Mutic, Maya
Machine Learning
Computation
Randomized algorithms exploit stochasticity to reduce computational complexity. One important example is random feature regression (RFR) that accelerates Gaussian process regression (GPR). RFR approximates an unknown function with a random neural network whose hidden weights and biases are sampled from a probability distribution. Only the final output layer is fit to data. In randomized algorithms like RFR, the hyperparameters that characterize the sampling distribution greatly impact performance, yet are not directly accessible from samples. This makes optimization of hyperparameters via standard (gradient-based) optimization tools inapplicable. Inspired by Bayesian ideas from GPR, this paper introduces a random objective function that is tailored for hyperparameter tuning of vector-valued random features. The objective is minimized with ensemble Kalman inversion (EKI). EKI is a gradient-free particle-based optimizer that is scalable to high-dimensions and robust to randomness in objective functions. A numerical study showcases the new black-box methodology to learn hyperparameter distributions in several problems that are sensitive to the hyperparameter selection: two global sensitivity analyses, integrating a chaotic dynamical system, and solving a Bayesian inverse problem from atmospheric dynamics. The success of the proposed EKI-based algorithm for RFR suggests its potential for automated optimization of hyperparameters arising in other randomized algorithms.
title Hyperparameter Optimization for Randomized Algorithms: A Case Study on Random Features
topic Machine Learning
Computation
url https://arxiv.org/abs/2407.00584