Saved in:
Bibliographic Details
Main Authors: Gallusser, Florian, Hentschel, Simon, Krause, Anna, Hotho, Andreas
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.02506
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908350405410816
author Gallusser, Florian
Hentschel, Simon
Krause, Anna
Hotho, Andreas
author_facet Gallusser, Florian
Hentschel, Simon
Krause, Anna
Hotho, Andreas
contents Deep Learning models have achieved state-of-the-art performance in medium-range weather prediction but often fail to maintain physically consistent rollouts beyond 14 days. In contrast, a few atmospheric models demonstrate stability over decades, though the key design choices enabling this remain unclear. This study quantitatively compares the long-term stability of three prominent DL-MWP architectures - FourCastNet, SFNO, and ClimaX - trained on ERA5 reanalysis data at 5.625° resolution. We systematically assess the impact of autoregressive training steps, model capacity, and choice of prognostic variables, identifying configurations that enable stable 10-year rollouts while preserving the statistical properties of the reference dataset. Notably, rollouts with SFNO exhibit the greatest robustness to hyperparameter choices, yet all models can experience instability depending on the random seed and the set of prognostic variables
format Preprint
id arxiv_https___arxiv_org_abs_2505_02506
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Exploring Design Choices for Autoregressive Deep Learning Climate Models
Gallusser, Florian
Hentschel, Simon
Krause, Anna
Hotho, Andreas
Machine Learning
Deep Learning models have achieved state-of-the-art performance in medium-range weather prediction but often fail to maintain physically consistent rollouts beyond 14 days. In contrast, a few atmospheric models demonstrate stability over decades, though the key design choices enabling this remain unclear. This study quantitatively compares the long-term stability of three prominent DL-MWP architectures - FourCastNet, SFNO, and ClimaX - trained on ERA5 reanalysis data at 5.625° resolution. We systematically assess the impact of autoregressive training steps, model capacity, and choice of prognostic variables, identifying configurations that enable stable 10-year rollouts while preserving the statistical properties of the reference dataset. Notably, rollouts with SFNO exhibit the greatest robustness to hyperparameter choices, yet all models can experience instability depending on the random seed and the set of prognostic variables
title Exploring Design Choices for Autoregressive Deep Learning Climate Models
topic Machine Learning
url https://arxiv.org/abs/2505.02506