Saved in:
Bibliographic Details
Main Authors: Ćmiel, Bogdan, Ledwina, Teresa
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.01671
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910855668432896
author Ćmiel, Bogdan
Ledwina, Teresa
author_facet Ćmiel, Bogdan
Ledwina, Teresa
contents This paper reconsiders the problem of testing the equality of two unspecified continuous distributions. The framework, which we propose, allows for readable and insightful data visualisation and helps to understand and quantify how two groups of data differ. We consider a useful weighted rank empirical process on (0,1) and utilise a grid-based approach, based on diadic partitions of (0,1), to discretize the continuous process and construct local simultaneous acceptance regions. These regions help to identify statistically significant deviations from the null model. In addition, the form of the process and its dicretization lead to a highly interpretable visualisation of distributional differences. We also introduce a new two-sample test, explicitly related to the visualisation. Numerical studies show that the new test procedure performs very well. We illustrate the use and diagnostic capabilities of our approach by an application to a known set of neuroscience data.
format Preprint
id arxiv_https___arxiv_org_abs_2503_01671
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Diagnostic tools for exploring differences in distributional properties between two samples: nonparametric approach
Ćmiel, Bogdan
Ledwina, Teresa
Methodology
This paper reconsiders the problem of testing the equality of two unspecified continuous distributions. The framework, which we propose, allows for readable and insightful data visualisation and helps to understand and quantify how two groups of data differ. We consider a useful weighted rank empirical process on (0,1) and utilise a grid-based approach, based on diadic partitions of (0,1), to discretize the continuous process and construct local simultaneous acceptance regions. These regions help to identify statistically significant deviations from the null model. In addition, the form of the process and its dicretization lead to a highly interpretable visualisation of distributional differences. We also introduce a new two-sample test, explicitly related to the visualisation. Numerical studies show that the new test procedure performs very well. We illustrate the use and diagnostic capabilities of our approach by an application to a known set of neuroscience data.
title Diagnostic tools for exploring differences in distributional properties between two samples: nonparametric approach
topic Methodology
url https://arxiv.org/abs/2503.01671