Saved in:
Bibliographic Details
Main Author: Campagne, Jean-Eric
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.23127
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909606928711680
author Campagne, Jean-Eric
author_facet Campagne, Jean-Eric
contents Generative models have recently revolutionized image generation tasks across diverse domains, including galaxy image synthesis. This study investigates the statistical learning and consistency of three generative models: light-weight-gan (a GAN-based model), Glow (a Normalizing Flow-based model), and a diffusion model based on a U-Net denoiser, all trained on non-overlapping subsets of the SDSS dataset of 64x64 grayscale images. While all models produce visually realistic images with well-preserved morphological variable distributions, we focus on their ability to learn and generalize the underlying data distribution. The diffusion model shows a transition from memorization to generalization as the dataset size increases, confirming previous findings. Smaller datasets lead to overfitting, while larger datasets enable novel sample generation, supported by the denoising process's theoretical basis. For the flow-based model, we propose an "inversion test" leveraging its bijective nature. Similarly, the GAN-based model achieves comparable morphological consistency but lacks bijectivity. We then introduce a "discriminator test", which shows successful learning for larger datasets but poorer confidence with smaller ones. Across all models, dataset sizes below O(100,000) pose challenges to learning. Along our experiments, the "two-models" framework enables robust evaluations, highlighting both the potential and limitations of these models. These findings provide valuable insights into statistical learning in generative modeling, with applications certainly extending beyond galaxy image generation.
format Preprint
id arxiv_https___arxiv_org_abs_2503_23127
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Galaxy Imaging with Generative Models: Insights from a Two-Models Framework
Campagne, Jean-Eric
Instrumentation and Methods for Astrophysics
Generative models have recently revolutionized image generation tasks across diverse domains, including galaxy image synthesis. This study investigates the statistical learning and consistency of three generative models: light-weight-gan (a GAN-based model), Glow (a Normalizing Flow-based model), and a diffusion model based on a U-Net denoiser, all trained on non-overlapping subsets of the SDSS dataset of 64x64 grayscale images. While all models produce visually realistic images with well-preserved morphological variable distributions, we focus on their ability to learn and generalize the underlying data distribution. The diffusion model shows a transition from memorization to generalization as the dataset size increases, confirming previous findings. Smaller datasets lead to overfitting, while larger datasets enable novel sample generation, supported by the denoising process's theoretical basis. For the flow-based model, we propose an "inversion test" leveraging its bijective nature. Similarly, the GAN-based model achieves comparable morphological consistency but lacks bijectivity. We then introduce a "discriminator test", which shows successful learning for larger datasets but poorer confidence with smaller ones. Across all models, dataset sizes below O(100,000) pose challenges to learning. Along our experiments, the "two-models" framework enables robust evaluations, highlighting both the potential and limitations of these models. These findings provide valuable insights into statistical learning in generative modeling, with applications certainly extending beyond galaxy image generation.
title Galaxy Imaging with Generative Models: Insights from a Two-Models Framework
topic Instrumentation and Methods for Astrophysics
url https://arxiv.org/abs/2503.23127