Affichage MARC: :: Library Catalog

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Tsitsvero, Mikhail, Jin, Mingoo, Lyalin, Andrey
Format:	Preprint
Publié:	2022
Sujets:	Chemical Physics Machine Learning Computational Physics 60G15, 60-08, 68-04, 68T99, 92E99
Accès en ligne:	https://arxiv.org/abs/2207.07654
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

_version_	1866911779789996032
author	Tsitsvero, Mikhail Jin, Mingoo Lyalin, Andrey
author_facet	Tsitsvero, Mikhail Jin, Mingoo Lyalin, Andrey
contents	Uncertainty control and scalability to large datasets are the two main issues for the deployment of Gaussian process (GP) models within the autonomous machine learning-based prediction pipelines in material science and chemistry. One way to address both of these issues is by introducing the latent inducing point variables and choosing the right approximation for the marginal log-likelihood objective. Here, we empirically show that variational learning of the inducing points in a molecular descriptor space improves the prediction of energies and atomic forces on two molecular dynamics datasets. First, we show that variational GPs can learn to represent the configurations of the molecules of different types that were not present within the initialization set of configurations. We provide a comparison of alternative log-likelihood training objectives and variational distributions. Among several evaluated approximate marginal log-likelihood objectives, we show that predictive log-likelihood provides excellent uncertainty estimates at the slight expense of predictive quality. Furthermore, we extend our study to a large molecular crystal system, showing that variational GP models perform well for predicting atomic forces by efficiently learning a sparse representation of the dataset.
format	Preprint
id	arxiv_https___arxiv_org_abs_2207_07654
institution	arXiv
publishDate	2022
record_format	arxiv
spellingShingle	Learning inducing points and uncertainty on molecular data by scalable variational Gaussian processes Tsitsvero, Mikhail Jin, Mingoo Lyalin, Andrey Chemical Physics Machine Learning Computational Physics 60G15, 60-08, 68-04, 68T99, 92E99 Uncertainty control and scalability to large datasets are the two main issues for the deployment of Gaussian process (GP) models within the autonomous machine learning-based prediction pipelines in material science and chemistry. One way to address both of these issues is by introducing the latent inducing point variables and choosing the right approximation for the marginal log-likelihood objective. Here, we empirically show that variational learning of the inducing points in a molecular descriptor space improves the prediction of energies and atomic forces on two molecular dynamics datasets. First, we show that variational GPs can learn to represent the configurations of the molecules of different types that were not present within the initialization set of configurations. We provide a comparison of alternative log-likelihood training objectives and variational distributions. Among several evaluated approximate marginal log-likelihood objectives, we show that predictive log-likelihood provides excellent uncertainty estimates at the slight expense of predictive quality. Furthermore, we extend our study to a large molecular crystal system, showing that variational GP models perform well for predicting atomic forces by efficiently learning a sparse representation of the dataset.
title	Learning inducing points and uncertainty on molecular data by scalable variational Gaussian processes
topic	Chemical Physics Machine Learning Computational Physics 60G15, 60-08, 68-04, 68T99, 92E99
url	https://arxiv.org/abs/2207.07654

Documents similaires