Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Li, Xiaoli, Chen, Yang, Meng, Xiao-Li, van Dyk, David, Bonamente, Massimiliano, Kashyap, Vinay
Format:	Preprint
Published:	2025
Subjects:	Methodology Applications
Online Access:	https://arxiv.org/abs/2510.03466
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914072971182080
author	Li, Xiaoli Chen, Yang Meng, Xiao-Li van Dyk, David Bonamente, Massimiliano Kashyap, Vinay
author_facet	Li, Xiaoli Chen, Yang Meng, Xiao-Li van Dyk, David Bonamente, Massimiliano Kashyap, Vinay
contents	The C statistic is a widely used likelihood-ratio statistic for model fitting and goodness-of-fit assessments with Poisson data in high-energy physics and astrophysics. Although it enjoys convenient asymptotic properties, the statistic is routinely applied in cases where its nominal null distribution relies on unwarranted assumptions. Because researchers do not typically carry out robustness checks, their scientific findings are left vulnerable to misleading significance calculations. With an emphasis on low-count scenarios, we present a comprehensive study of the theoretical properties of C statistics and related goodness-of-fit algorithms. We focus on common ``plug-in'' algorithms where moments of C are obtained by assuming the true parameter equals its estimate. To correct such methods, we provide a suite of new principled user-friendly algorithms and well-calibrated p-values that are ready for immediate deployment in the (astro)physics data-analysis pipeline. Using both theoretical and numerical results, we show (a) standard $χ^2$-based goodness-of-fit assessments are invalid in low-count settings, (b) naive methods (e.g., vanilla bootstrap) result in biased null distributions, and (c) the corrected Z-test based on conditioning and high-order asymptotics gives the best precision with low computational cost. We illustrate our methods via a suite of simulations and applied astrophysical analyses. An open-source Python package is provided in a GitHub repository.
format	Preprint
id	arxiv_https___arxiv_org_abs_2510_03466
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Making high-order asymptotics practical: correcting goodness-of-fit test for astronomical count data Li, Xiaoli Chen, Yang Meng, Xiao-Li van Dyk, David Bonamente, Massimiliano Kashyap, Vinay Methodology Applications The C statistic is a widely used likelihood-ratio statistic for model fitting and goodness-of-fit assessments with Poisson data in high-energy physics and astrophysics. Although it enjoys convenient asymptotic properties, the statistic is routinely applied in cases where its nominal null distribution relies on unwarranted assumptions. Because researchers do not typically carry out robustness checks, their scientific findings are left vulnerable to misleading significance calculations. With an emphasis on low-count scenarios, we present a comprehensive study of the theoretical properties of C statistics and related goodness-of-fit algorithms. We focus on common ``plug-in'' algorithms where moments of C are obtained by assuming the true parameter equals its estimate. To correct such methods, we provide a suite of new principled user-friendly algorithms and well-calibrated p-values that are ready for immediate deployment in the (astro)physics data-analysis pipeline. Using both theoretical and numerical results, we show (a) standard $χ^2$-based goodness-of-fit assessments are invalid in low-count settings, (b) naive methods (e.g., vanilla bootstrap) result in biased null distributions, and (c) the corrected Z-test based on conditioning and high-order asymptotics gives the best precision with low computational cost. We illustrate our methods via a suite of simulations and applied astrophysical analyses. An open-source Python package is provided in a GitHub repository.
title	Making high-order asymptotics practical: correcting goodness-of-fit test for astronomical count data
topic	Methodology Applications
url	https://arxiv.org/abs/2510.03466

Similar Items