Saved in:
Bibliographic Details
Main Authors: Rathnakumar, Rahul, Huang, Jiayu, Yan, Hao, Liu, Yongming
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.01015
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914853209243648
author Rathnakumar, Rahul
Huang, Jiayu
Yan, Hao
Liu, Yongming
author_facet Rathnakumar, Rahul
Huang, Jiayu
Yan, Hao
Liu, Yongming
contents This paper addresses the need for deep learning models to integrate well-defined constraints into their outputs, driven by their application in surrogate models, learning with limited data and partial information, and scenarios requiring flexible model behavior to incorporate non-data sample information. We introduce Bayesian Entropy Neural Networks (BENN), a framework grounded in Maximum Entropy (MaxEnt) principles, designed to impose constraints on Bayesian Neural Network (BNN) predictions. BENN is capable of constraining not only the predicted values but also their derivatives and variances, ensuring a more robust and reliable model output. To achieve simultaneous uncertainty quantification and constraint satisfaction, we employ the method of multipliers approach. This allows for the concurrent estimation of neural network parameters and the Lagrangian multipliers associated with the constraints. Our experiments, spanning diverse applications such as beam deflection modeling and microstructure generation, demonstrate the effectiveness of BENN. The results highlight significant improvements over traditional BNNs and showcase competitive performance relative to contemporary constrained deep learning methods.
format Preprint
id arxiv_https___arxiv_org_abs_2407_01015
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Bayesian Entropy Neural Networks for Physics-Aware Prediction
Rathnakumar, Rahul
Huang, Jiayu
Yan, Hao
Liu, Yongming
Machine Learning
I.5.1
This paper addresses the need for deep learning models to integrate well-defined constraints into their outputs, driven by their application in surrogate models, learning with limited data and partial information, and scenarios requiring flexible model behavior to incorporate non-data sample information. We introduce Bayesian Entropy Neural Networks (BENN), a framework grounded in Maximum Entropy (MaxEnt) principles, designed to impose constraints on Bayesian Neural Network (BNN) predictions. BENN is capable of constraining not only the predicted values but also their derivatives and variances, ensuring a more robust and reliable model output. To achieve simultaneous uncertainty quantification and constraint satisfaction, we employ the method of multipliers approach. This allows for the concurrent estimation of neural network parameters and the Lagrangian multipliers associated with the constraints. Our experiments, spanning diverse applications such as beam deflection modeling and microstructure generation, demonstrate the effectiveness of BENN. The results highlight significant improvements over traditional BNNs and showcase competitive performance relative to contemporary constrained deep learning methods.
title Bayesian Entropy Neural Networks for Physics-Aware Prediction
topic Machine Learning
I.5.1
url https://arxiv.org/abs/2407.01015