Saved in:
Bibliographic Details
Main Authors: Aljawarneh, Shadi, Lara, Juan A., Yassein, Muneer Bani
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.21176
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914111467552768
author Aljawarneh, Shadi
Lara, Juan A.
Yassein, Muneer Bani
author_facet Aljawarneh, Shadi
Lara, Juan A.
Yassein, Muneer Bani
contents The Meteorology is a field where huge amounts of data are generated, mainly collected by sensors at weather stations, where different variables can be measured. Those data have some particularities such as high volume and dimensionality, the frequent existence of missing values in some stations, and the high correlation between collected variables. In this regard, it is crucial to make use of Big Data and Data Mining techniques to deal with those data and extract useful knowledge from them that can be used, for instance, to predict weather phenomena. In this paper, we propose a visual big data system that is designed to deal with high amounts of weather-related data and lets the user analyze those data to perform predictive tasks over the considered variables (temperature and rainfall). The proposed system collects open data and loads them onto a local NoSQL database fusing them at different levels of temporal and spatial aggregation in order to perform a predictive analysis using univariate and multivariate approaches as well as forecasting based on training data from neighbor stations in cases with high rates of missing values. The system has been assessed in terms of usability and predictive performance, obtaining an overall normalized mean squared error value of 0.00013, and an overall directional symmetry value of nearly 0.84. Our system has been rated positively by a group of experts in the area (all aspects of the system except graphic desing were rated 3 or above in a 1-5 scale). The promising preliminary results obtained demonstrate the validity of our system and invite us to keep working on this area.
format Preprint
id arxiv_https___arxiv_org_abs_2510_21176
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A visual big data system for the prediction of weather-related variables: Jordan-Spain case study
Aljawarneh, Shadi
Lara, Juan A.
Yassein, Muneer Bani
Machine Learning
The Meteorology is a field where huge amounts of data are generated, mainly collected by sensors at weather stations, where different variables can be measured. Those data have some particularities such as high volume and dimensionality, the frequent existence of missing values in some stations, and the high correlation between collected variables. In this regard, it is crucial to make use of Big Data and Data Mining techniques to deal with those data and extract useful knowledge from them that can be used, for instance, to predict weather phenomena. In this paper, we propose a visual big data system that is designed to deal with high amounts of weather-related data and lets the user analyze those data to perform predictive tasks over the considered variables (temperature and rainfall). The proposed system collects open data and loads them onto a local NoSQL database fusing them at different levels of temporal and spatial aggregation in order to perform a predictive analysis using univariate and multivariate approaches as well as forecasting based on training data from neighbor stations in cases with high rates of missing values. The system has been assessed in terms of usability and predictive performance, obtaining an overall normalized mean squared error value of 0.00013, and an overall directional symmetry value of nearly 0.84. Our system has been rated positively by a group of experts in the area (all aspects of the system except graphic desing were rated 3 or above in a 1-5 scale). The promising preliminary results obtained demonstrate the validity of our system and invite us to keep working on this area.
title A visual big data system for the prediction of weather-related variables: Jordan-Spain case study
topic Machine Learning
url https://arxiv.org/abs/2510.21176