Saved in:
Bibliographic Details
Main Authors: Manzhos, Sergei, Ihara, Manabu
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.08457
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915693760348160
author Manzhos, Sergei
Ihara, Manabu
author_facet Manzhos, Sergei
Ihara, Manabu
contents Recently, a Gaussian Process Regression - neural network (GPRNN) hybrid machine learning method was proposed, which is based on additive-kernel GPR in redundant coordinates constructed by rules [J. Phys. Chem. A 127 (2023) 7823]. The method combined the expressive power of an NN with the robustness of linear regression, in particular, with respect to overfitting when the number of neurons is increased beyond optimal. We introduce opt-GPRNN, in which the redundant coordinates of GPRNN are optimized with a Monte Carlo algorithm and show that when combined with optimization of redundant coordinates, GPRNN attains the lowest test set error with much fewer terms / neurons and retains the advantage of avoiding overfitting when the number of neurons is increased beyond optimal value. The method, opt-GPRNN possesses an expressive power closer to that of a multilayer NN and could obviate the need for deep NNs in some applications. With optimized redundant coordinates, a dimensionality reduction regime is also possible. Examples of application to machine learning an interatomic potential and materials informatics are given.
format Preprint
id arxiv_https___arxiv_org_abs_2509_08457
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Gaussian Process Regression -- Neural Network Hybrid with Optimized Redundant Coordinates
Manzhos, Sergei
Ihara, Manabu
Machine Learning
Recently, a Gaussian Process Regression - neural network (GPRNN) hybrid machine learning method was proposed, which is based on additive-kernel GPR in redundant coordinates constructed by rules [J. Phys. Chem. A 127 (2023) 7823]. The method combined the expressive power of an NN with the robustness of linear regression, in particular, with respect to overfitting when the number of neurons is increased beyond optimal. We introduce opt-GPRNN, in which the redundant coordinates of GPRNN are optimized with a Monte Carlo algorithm and show that when combined with optimization of redundant coordinates, GPRNN attains the lowest test set error with much fewer terms / neurons and retains the advantage of avoiding overfitting when the number of neurons is increased beyond optimal value. The method, opt-GPRNN possesses an expressive power closer to that of a multilayer NN and could obviate the need for deep NNs in some applications. With optimized redundant coordinates, a dimensionality reduction regime is also possible. Examples of application to machine learning an interatomic potential and materials informatics are given.
title Gaussian Process Regression -- Neural Network Hybrid with Optimized Redundant Coordinates
topic Machine Learning
url https://arxiv.org/abs/2509.08457