שמור ב:
מידע ביבליוגרפי
Main Authors: Menges, Daniel, Rasheed, Adil
פורמט: Preprint
יצא לאור: 2024
נושאים:
גישה מקוונת:https://arxiv.org/abs/2403.19721
תגים: הוספת תג
אין תגיות, היה/י הראשונ/ה לתייג את הרשומה!
_version_ 1866910510767669248
author Menges, Daniel
Rasheed, Adil
author_facet Menges, Daniel
Rasheed, Adil
contents In the current data-intensive era, big data has become a significant asset for Artificial Intelligence (AI), serving as a foundation for developing data-driven models and providing insight into various unknown fields. This study navigates through the challenges of data uncertainties, storage limitations, and predictive data-driven modeling using big data. We utilize Robust Principal Component Analysis (RPCA) for effective noise reduction and outlier elimination, and Optimal Sensor Placement (OSP) for efficient data compression and storage. The proposed OSP technique enables data compression without substantial information loss while simultaneously reducing storage needs. While RPCA offers an enhanced alternative to traditional Principal Component Analysis (PCA) for high-dimensional data management, the scope of this work extends its utilization, focusing on robust, data-driven modeling applicable to huge data sets in real-time. For that purpose, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network, are applied to model and predict data based on a low-dimensional subset obtained from OSP, leading to a crucial acceleration of the training phase. LSTMs are feasible for capturing long-term dependencies in time series data, making them particularly suited for predicting the future states of physical systems on historical data. All the presented algorithms are not only theorized but also simulated and validated using real thermal imaging data mapping a ship's engine.
format Preprint
id arxiv_https___arxiv_org_abs_2403_19721
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data
Menges, Daniel
Rasheed, Adil
Machine Learning
Artificial Intelligence
Image and Video Processing
In the current data-intensive era, big data has become a significant asset for Artificial Intelligence (AI), serving as a foundation for developing data-driven models and providing insight into various unknown fields. This study navigates through the challenges of data uncertainties, storage limitations, and predictive data-driven modeling using big data. We utilize Robust Principal Component Analysis (RPCA) for effective noise reduction and outlier elimination, and Optimal Sensor Placement (OSP) for efficient data compression and storage. The proposed OSP technique enables data compression without substantial information loss while simultaneously reducing storage needs. While RPCA offers an enhanced alternative to traditional Principal Component Analysis (PCA) for high-dimensional data management, the scope of this work extends its utilization, focusing on robust, data-driven modeling applicable to huge data sets in real-time. For that purpose, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network, are applied to model and predict data based on a low-dimensional subset obtained from OSP, leading to a crucial acceleration of the training phase. LSTMs are feasible for capturing long-term dependencies in time series data, making them particularly suited for predicting the future states of physical systems on historical data. All the presented algorithms are not only theorized but also simulated and validated using real thermal imaging data mapping a ship's engine.
title Computationally and Memory-Efficient Robust Predictive Analytics Using Big Data
topic Machine Learning
Artificial Intelligence
Image and Video Processing
url https://arxiv.org/abs/2403.19721