Saved in:
Bibliographic Details
Main Authors: De Luca, G. Bruno, Nachman, Benjamin, Silverstein, Eva, Zheng, Henry
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.18419
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910853379391488
author De Luca, G. Bruno
Nachman, Benjamin
Silverstein, Eva
Zheng, Henry
author_facet De Luca, G. Bruno
Nachman, Benjamin
Silverstein, Eva
Zheng, Henry
contents A growing number of applications in particle physics and beyond use neural networks as unbinned likelihood ratio estimators applied to real or simulated data. Precision requirements on the inference tasks demand a high-level of stability from these networks, which are affected by the stochastic nature of training. We show how physics concepts can be used to stabilize network training through a physics-inspired optimizer. In particular, the Energy Conserving Descent (ECD) optimization framework uses classical Hamiltonian dynamics on the space of network parameters to reduce the dependence on the initial conditions while also stabilizing the result near the minimum of the loss function. We develop a version of this optimizer known as $ECD_{q=1}$, which has few free hyperparameters with limited ranges guided by physical reasoning. We apply $ECD_{q=1}$ to representative likelihood-ratio estimation tasks in particle physics and find that it out-performs the widely-used Adam optimizer. We expect that ECD will be a useful tool for wide array of data-limited problems, where it is computationally expensive to exhaustively optimize hyperparameters and mitigate fluctuations with ensembling.
format Preprint
id arxiv_https___arxiv_org_abs_2501_18419
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Optimizers for Stabilizing Likelihood-free Inference
De Luca, G. Bruno
Nachman, Benjamin
Silverstein, Eva
Zheng, Henry
High Energy Physics - Phenomenology
High Energy Physics - Theory
A growing number of applications in particle physics and beyond use neural networks as unbinned likelihood ratio estimators applied to real or simulated data. Precision requirements on the inference tasks demand a high-level of stability from these networks, which are affected by the stochastic nature of training. We show how physics concepts can be used to stabilize network training through a physics-inspired optimizer. In particular, the Energy Conserving Descent (ECD) optimization framework uses classical Hamiltonian dynamics on the space of network parameters to reduce the dependence on the initial conditions while also stabilizing the result near the minimum of the loss function. We develop a version of this optimizer known as $ECD_{q=1}$, which has few free hyperparameters with limited ranges guided by physical reasoning. We apply $ECD_{q=1}$ to representative likelihood-ratio estimation tasks in particle physics and find that it out-performs the widely-used Adam optimizer. We expect that ECD will be a useful tool for wide array of data-limited problems, where it is computationally expensive to exhaustively optimize hyperparameters and mitigate fluctuations with ensembling.
title Optimizers for Stabilizing Likelihood-free Inference
topic High Energy Physics - Phenomenology
High Energy Physics - Theory
url https://arxiv.org/abs/2501.18419