Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Williamson, Alexander, Dodson, Richard, Elahi, Pascal J., Rhee, Jonghwan, Gong, Qian, Meyer, Martin, Rozgonyi, Kristof, Wicenec, Andreas, Chen, Jieyang, Podhorszki, Norbert, Klasky, Scott, Mitchell, Daniel
Format: Preprint
Veröffentlicht: 2024
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2410.02285
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866909927324254208
author Williamson, Alexander
Dodson, Richard
Elahi, Pascal J.
Rhee, Jonghwan
Gong, Qian
Meyer, Martin
Rozgonyi, Kristof
Wicenec, Andreas
Chen, Jieyang
Podhorszki, Norbert
Klasky, Scott
Mitchell, Daniel
author_facet Williamson, Alexander
Dodson, Richard
Elahi, Pascal J.
Rhee, Jonghwan
Gong, Qian
Meyer, Martin
Rozgonyi, Kristof
Wicenec, Andreas
Chen, Jieyang
Podhorszki, Norbert
Klasky, Scott
Mitchell, Daniel
contents The next generation of radio astronomy telescopes are challenging existing data analysis paradigms, as they have an order of magnitude more antennas and larger bandwidth. Foremost amongst these are deep spectral line surveys, because these have the largest number of epochs and spectral channels per dataset. For example, the Deep Investigation of Neutral Gas Origins (DINGO) project on the Australian Square Kilometre Array Pathfinder (ASKAP) aims to observe over 3,000 hours spread over hundreds of observing sessions, covering two pointings and two frequency settings. The two primary problems encountered when processing this data are the need for storage and that processing is primarily I/O limited. To address these issues, we have implemented an deep imaging pipeline based on the storage of an intermediate data product in the software ASKAPSoft, that of the uv-gridded data, and have demonstrated lossy and lossless compression of this data on ASKAP, using MGARD and ADIOS2 libraries. We find data compression ratios from a factor of 7 (lossless) up to 20 (using lossy compression with an absolute error bound of $10^{-4}$), and processing is faster by a factor of 7 for lossless compression. We discuss the effectiveness of lossy MGARD compression and its adherence to the designated error bounds, the trade-off between these error bounds and the corresponding compression ratios, as well as the potential consequences of these I/O and storage improvements on the science quality of the data products.
format Preprint
id arxiv_https___arxiv_org_abs_2410_02285
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Deep Investigation of Neutral Gas Origins (DINGO): Options for the Processing and Storage of Radio Astronomy Data for robust Deep Spectral Line Imaging in the SKA-Era using uv-Grids
Williamson, Alexander
Dodson, Richard
Elahi, Pascal J.
Rhee, Jonghwan
Gong, Qian
Meyer, Martin
Rozgonyi, Kristof
Wicenec, Andreas
Chen, Jieyang
Podhorszki, Norbert
Klasky, Scott
Mitchell, Daniel
Instrumentation and Methods for Astrophysics
The next generation of radio astronomy telescopes are challenging existing data analysis paradigms, as they have an order of magnitude more antennas and larger bandwidth. Foremost amongst these are deep spectral line surveys, because these have the largest number of epochs and spectral channels per dataset. For example, the Deep Investigation of Neutral Gas Origins (DINGO) project on the Australian Square Kilometre Array Pathfinder (ASKAP) aims to observe over 3,000 hours spread over hundreds of observing sessions, covering two pointings and two frequency settings. The two primary problems encountered when processing this data are the need for storage and that processing is primarily I/O limited. To address these issues, we have implemented an deep imaging pipeline based on the storage of an intermediate data product in the software ASKAPSoft, that of the uv-gridded data, and have demonstrated lossy and lossless compression of this data on ASKAP, using MGARD and ADIOS2 libraries. We find data compression ratios from a factor of 7 (lossless) up to 20 (using lossy compression with an absolute error bound of $10^{-4}$), and processing is faster by a factor of 7 for lossless compression. We discuss the effectiveness of lossy MGARD compression and its adherence to the designated error bounds, the trade-off between these error bounds and the corresponding compression ratios, as well as the potential consequences of these I/O and storage improvements on the science quality of the data products.
title Deep Investigation of Neutral Gas Origins (DINGO): Options for the Processing and Storage of Radio Astronomy Data for robust Deep Spectral Line Imaging in the SKA-Era using uv-Grids
topic Instrumentation and Methods for Astrophysics
url https://arxiv.org/abs/2410.02285