Saved in:
Bibliographic Details
Main Authors: Totounferoush, Amin, Kotchourko, Serge, Mahoney, Michael W., Staab, Steffen
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.19081
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910892177752064
author Totounferoush, Amin
Kotchourko, Serge
Mahoney, Michael W.
Staab, Steffen
author_facet Totounferoush, Amin
Kotchourko, Serge
Mahoney, Michael W.
Staab, Steffen
contents Partial differential equations (PDEs) govern a wide range of physical systems, but solving them efficiently remains a major challenge. The idea of a scientific foundation model (SciFM) is emerging as a promising tool for learning transferable representations across diverse domains. However, SciFMs require large amounts of solution data, which may be scarce or computationally expensive to generate. To maximize generalization while reducing data dependence, we propose incorporating PDE residuals into pre-training either as the sole learning signal or in combination with data loss to compensate for limited or infeasible training data. We evaluate this constraint-aware pre-training across three key benchmarks: (i) generalization to new physics, where material properties, e.g., the diffusion coefficient, is shifted with respect to the training distribution; (ii) generalization to entirely new PDEs, requiring adaptation to different operators; and (iii) robustness against noisy fine-tuning data, ensuring stability in real-world applications. Our results show that pre-training with PDE constraints significantly enhances generalization, outperforming models trained solely on solution data across all benchmarks. These findings prove the effectiveness of our proposed constraint-aware pre-training as a crucial component for SciFMs, providing a scalable approach to data-efficient, generalizable PDE solvers.
format Preprint
id arxiv_https___arxiv_org_abs_2503_19081
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Paving the way for scientific foundation models: enhancing generalization and robustness in PDEs with constraint-aware pre-training
Totounferoush, Amin
Kotchourko, Serge
Mahoney, Michael W.
Staab, Steffen
Machine Learning
Partial differential equations (PDEs) govern a wide range of physical systems, but solving them efficiently remains a major challenge. The idea of a scientific foundation model (SciFM) is emerging as a promising tool for learning transferable representations across diverse domains. However, SciFMs require large amounts of solution data, which may be scarce or computationally expensive to generate. To maximize generalization while reducing data dependence, we propose incorporating PDE residuals into pre-training either as the sole learning signal or in combination with data loss to compensate for limited or infeasible training data. We evaluate this constraint-aware pre-training across three key benchmarks: (i) generalization to new physics, where material properties, e.g., the diffusion coefficient, is shifted with respect to the training distribution; (ii) generalization to entirely new PDEs, requiring adaptation to different operators; and (iii) robustness against noisy fine-tuning data, ensuring stability in real-world applications. Our results show that pre-training with PDE constraints significantly enhances generalization, outperforming models trained solely on solution data across all benchmarks. These findings prove the effectiveness of our proposed constraint-aware pre-training as a crucial component for SciFMs, providing a scalable approach to data-efficient, generalizable PDE solvers.
title Paving the way for scientific foundation models: enhancing generalization and robustness in PDEs with constraint-aware pre-training
topic Machine Learning
url https://arxiv.org/abs/2503.19081