Saved in:
Bibliographic Details
Main Authors: Hutchinson, Bradley D., Pilachowski, Catherine A., Johnson, Christian I.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.17882
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913987395846144
author Hutchinson, Bradley D.
Pilachowski, Catherine A.
Johnson, Christian I.
author_facet Hutchinson, Bradley D.
Pilachowski, Catherine A.
Johnson, Christian I.
contents Observational astronomy has undergone a significant transformation driven by large-scale surveys, such as the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) Survey, the Sloan Digital Sky Survey (SDSS), and the Gaia Mission. These programs yield large, complex datasets that pose significant challenges for conventional analysis methods, and as a result, many different machine learning techniques are being tested and deployed. We introduce a new approach to analyzing multiband photometry by using a long-short term memory autoencoder (LSTM-AE). This model provides input-dependent reweighting across passbands on a star-by-star basis, enabling it to encode patterns present in the stars' spectral energy distributions (SEDs) into a two-dimensional latent space. We showcase this by using Pan-STARRS grizy mean magnitudes, and we use globular clusters, labels from SIMBAD, Gaia DR3 parallaxes, and PanSTARRS images to aid our analysis and understanding of the latent space. For 3,112,259 stars in an annulus around the North Galactic Cap, 99.51% have their full SED shape reconstructed--that is the absolute difference between the observed and the model predicted magnitude in every band--within five hundredths of a magnitude. We show that the model likely denoises photometric data, potentially improving the quality of measurements. Lastly, we show that the detection of rare stellar types can be performed by analyzing poorly reconstructed photometry.
format Preprint
id arxiv_https___arxiv_org_abs_2507_17882
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Wavelength-Aware Unsupervised Learning Approach for Large, Multicolor, Photometric Surveys
Hutchinson, Bradley D.
Pilachowski, Catherine A.
Johnson, Christian I.
Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
Observational astronomy has undergone a significant transformation driven by large-scale surveys, such as the Panoramic Survey Telescope and Rapid Response System (Pan-STARRS) Survey, the Sloan Digital Sky Survey (SDSS), and the Gaia Mission. These programs yield large, complex datasets that pose significant challenges for conventional analysis methods, and as a result, many different machine learning techniques are being tested and deployed. We introduce a new approach to analyzing multiband photometry by using a long-short term memory autoencoder (LSTM-AE). This model provides input-dependent reweighting across passbands on a star-by-star basis, enabling it to encode patterns present in the stars' spectral energy distributions (SEDs) into a two-dimensional latent space. We showcase this by using Pan-STARRS grizy mean magnitudes, and we use globular clusters, labels from SIMBAD, Gaia DR3 parallaxes, and PanSTARRS images to aid our analysis and understanding of the latent space. For 3,112,259 stars in an annulus around the North Galactic Cap, 99.51% have their full SED shape reconstructed--that is the absolute difference between the observed and the model predicted magnitude in every band--within five hundredths of a magnitude. We show that the model likely denoises photometric data, potentially improving the quality of measurements. Lastly, we show that the detection of rare stellar types can be performed by analyzing poorly reconstructed photometry.
title A Wavelength-Aware Unsupervised Learning Approach for Large, Multicolor, Photometric Surveys
topic Instrumentation and Methods for Astrophysics
Solar and Stellar Astrophysics
url https://arxiv.org/abs/2507.17882