Saved in:
Bibliographic Details
Main Authors: Saha, Anushka, Gandrakota, Abhijith, Morozov, Alexandre V.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.05926
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917259039997952
author Saha, Anushka
Gandrakota, Abhijith
Morozov, Alexandre V.
author_facet Saha, Anushka
Gandrakota, Abhijith
Morozov, Alexandre V.
contents Modeling count data is important in physics and other scientific disciplines, where measurements often involve discrete, non-negative quantities such as photon or neutrino detection events. Traditional parametric approaches can be trained to generate integer-count predictions but may struggle with capturing complex, non-linear dependencies often observed in the data. Gaussian process (GP) regression provides a robust non-parametric alternative to modeling continuous data; however, it cannot generate integer outputs. We propose the Poisson Log-Normal (PoLoN) process, a framework that employs GP to model Poisson log-rates. As in GP regression, our approach relies on the correlations between data points captured via GP kernel structure rather than explicit functional parameterizations. We demonstrate that the PoLoN predictive distribution is Poisson-LogNormal and provide an algorithm for optimizing kernel hyperparameters. Furthermore, we adapt the PoLoN approach to the problem of detecting weak localized signals superimposed on a smoothly varying background - a task of considerable interest in many areas of science and engineering. Our framework allows us to predict the strength, location and width of the detected signals. We evaluate PoLoN's performance using both synthetic and real-world datasets, including the open dataset from CERN which was used to detect the Higgs boson at the Large Hadron Collider. Our results indicate that the PoLoN process can be used as a non-parametric alternative for analyzing, predicting, and extracting signals from integer-valued data.
format Preprint
id arxiv_https___arxiv_org_abs_2602_05926
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Poisson Log-Normal Process for Count Data Prediction
Saha, Anushka
Gandrakota, Abhijith
Morozov, Alexandre V.
Data Analysis, Statistics and Probability
High Energy Physics - Experiment
60G15
Modeling count data is important in physics and other scientific disciplines, where measurements often involve discrete, non-negative quantities such as photon or neutrino detection events. Traditional parametric approaches can be trained to generate integer-count predictions but may struggle with capturing complex, non-linear dependencies often observed in the data. Gaussian process (GP) regression provides a robust non-parametric alternative to modeling continuous data; however, it cannot generate integer outputs. We propose the Poisson Log-Normal (PoLoN) process, a framework that employs GP to model Poisson log-rates. As in GP regression, our approach relies on the correlations between data points captured via GP kernel structure rather than explicit functional parameterizations. We demonstrate that the PoLoN predictive distribution is Poisson-LogNormal and provide an algorithm for optimizing kernel hyperparameters. Furthermore, we adapt the PoLoN approach to the problem of detecting weak localized signals superimposed on a smoothly varying background - a task of considerable interest in many areas of science and engineering. Our framework allows us to predict the strength, location and width of the detected signals. We evaluate PoLoN's performance using both synthetic and real-world datasets, including the open dataset from CERN which was used to detect the Higgs boson at the Large Hadron Collider. Our results indicate that the PoLoN process can be used as a non-parametric alternative for analyzing, predicting, and extracting signals from integer-valued data.
title Poisson Log-Normal Process for Count Data Prediction
topic Data Analysis, Statistics and Probability
High Energy Physics - Experiment
60G15
url https://arxiv.org/abs/2602.05926