Saved in:
Bibliographic Details
Main Authors: Li, Shixuan, Yang, Wei, Zhang, Peiyu, Xiao, Xiongye, Cao, Defu, Qin, Yuehan, Zhang, Xiaole, Zhao, Yue, Bogdan, Paul
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.11059
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915154211373056
author Li, Shixuan
Yang, Wei
Zhang, Peiyu
Xiao, Xiongye
Cao, Defu
Qin, Yuehan
Zhang, Xiaole
Zhao, Yue
Bogdan, Paul
author_facet Li, Shixuan
Yang, Wei
Zhang, Peiyu
Xiao, Xiongye
Cao, Defu
Qin, Yuehan
Zhang, Xiaole
Zhao, Yue
Bogdan, Paul
contents Weather forecasting is crucial for public safety, disaster prevention and mitigation, agricultural production, and energy management, with global relevance. Although deep learning has significantly advanced weather prediction, current methods face critical limitations: (i) they often struggle to capture both dynamic temporal dependencies and short-term abrupt changes, making extreme weather modeling difficult; (ii) they incur high computational costs due to extensive training and resource requirements; (iii) they have limited adaptability to multi-scale frequencies, leading to challenges when separating global trends from local fluctuations. To address these issues, we propose ClimateLLM, a foundation model for weather forecasting. It captures spatiotemporal dependencies via a cross-temporal and cross-spatial collaborative modeling framework that integrates Fourier-based frequency decomposition with Large Language Models (LLMs) to strengthen spatial and temporal modeling. Our framework uses a Mixture-of-Experts (MoE) mechanism that adaptively processes different frequency components, enabling efficient handling of both global signals and localized extreme events. In addition, we introduce a cross-temporal and cross-spatial dynamic prompting mechanism, allowing LLMs to incorporate meteorological patterns across multiple scales effectively. Extensive experiments on real-world datasets show that ClimateLLM outperforms state-of-the-art approaches in accuracy and efficiency, as a scalable solution for global weather forecasting.
format Preprint
id arxiv_https___arxiv_org_abs_2502_11059
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models
Li, Shixuan
Yang, Wei
Zhang, Peiyu
Xiao, Xiongye
Cao, Defu
Qin, Yuehan
Zhang, Xiaole
Zhao, Yue
Bogdan, Paul
Machine Learning
Artificial Intelligence
Weather forecasting is crucial for public safety, disaster prevention and mitigation, agricultural production, and energy management, with global relevance. Although deep learning has significantly advanced weather prediction, current methods face critical limitations: (i) they often struggle to capture both dynamic temporal dependencies and short-term abrupt changes, making extreme weather modeling difficult; (ii) they incur high computational costs due to extensive training and resource requirements; (iii) they have limited adaptability to multi-scale frequencies, leading to challenges when separating global trends from local fluctuations. To address these issues, we propose ClimateLLM, a foundation model for weather forecasting. It captures spatiotemporal dependencies via a cross-temporal and cross-spatial collaborative modeling framework that integrates Fourier-based frequency decomposition with Large Language Models (LLMs) to strengthen spatial and temporal modeling. Our framework uses a Mixture-of-Experts (MoE) mechanism that adaptively processes different frequency components, enabling efficient handling of both global signals and localized extreme events. In addition, we introduce a cross-temporal and cross-spatial dynamic prompting mechanism, allowing LLMs to incorporate meteorological patterns across multiple scales effectively. Extensive experiments on real-world datasets show that ClimateLLM outperforms state-of-the-art approaches in accuracy and efficiency, as a scalable solution for global weather forecasting.
title ClimateLLM: Efficient Weather Forecasting via Frequency-Aware Large Language Models
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2502.11059