Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Madhavan, Varun, Sebastian, Amal S, Ramsundar, Bharath, Viswanathan, Venkatasubramanian
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2407.06209
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909247136071680
author Madhavan, Varun
Sebastian, Amal S
Ramsundar, Bharath
Viswanathan, Venkatasubramanian
author_facet Madhavan, Varun
Sebastian, Amal S
Ramsundar, Bharath
Viswanathan, Venkatasubramanian
contents In this work, we describe a novel approach to building a neural PDE solver leveraging recent advances in transformer based neural network architectures. Our model can provide solutions for different values of PDE parameters without any need for retraining the network. The training is carried out in a self-supervised manner, similar to pretraining approaches applied in language and vision tasks. We hypothesize that the model is in effect learning a family of operators (for multiple parameters) mapping the initial condition to the solution of the PDE at any future time step t. We compare this approach with the Fourier Neural Operator (FNO), and demonstrate that it can generalize over the space of PDE parameters, despite having a higher prediction error for individual parameter values compared to the FNO. We show that performance on a specific parameter can be improved by finetuning the model with very small amounts of data. We also demonstrate that the model scales with data as well as model size.
format Preprint
id arxiv_https___arxiv_org_abs_2407_06209
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Self-supervised Pretraining for Partial Differential Equations
Madhavan, Varun
Sebastian, Amal S
Ramsundar, Bharath
Viswanathan, Venkatasubramanian
Machine Learning
In this work, we describe a novel approach to building a neural PDE solver leveraging recent advances in transformer based neural network architectures. Our model can provide solutions for different values of PDE parameters without any need for retraining the network. The training is carried out in a self-supervised manner, similar to pretraining approaches applied in language and vision tasks. We hypothesize that the model is in effect learning a family of operators (for multiple parameters) mapping the initial condition to the solution of the PDE at any future time step t. We compare this approach with the Fourier Neural Operator (FNO), and demonstrate that it can generalize over the space of PDE parameters, despite having a higher prediction error for individual parameter values compared to the FNO. We show that performance on a specific parameter can be improved by finetuning the model with very small amounts of data. We also demonstrate that the model scales with data as well as model size.
title Self-supervised Pretraining for Partial Differential Equations
topic Machine Learning
url https://arxiv.org/abs/2407.06209