Saved in:
Bibliographic Details
Main Authors: Chen, Yuchen, Lei, Jing
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.00164
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917580245041152
author Chen, Yuchen
Lei, Jing
author_facet Chen, Yuchen
Lei, Jing
contents In some high-dimensional and semiparametric inference problems involving two populations, the parameter of interest can be characterized by two-sample U-statistics involving some nuisance parameters. In this work we first extend the framework of one-step estimation with cross-fitting to two-sample U-statistics, showing that using an orthogonalized influence function can effectively remove the first order bias, resulting in asymptotically normal estimates of the parameter of interest. As an example, we apply this method and theory to the problem of testing two-sample conditional distributions, also known as strong ignorability. When combined with a conformal-based rank-sum test, we discover that the nuisance parameters can be divided into two categories, where in one category the nuisance estimation accuracy does not affect the testing validity, whereas in the other the nuisance estimation accuracy must satisfy the usual requirement for the test to be valid. We believe these findings provide further insights into and enhance the conformal inference toolbox.
format Preprint
id arxiv_https___arxiv_org_abs_2402_00164
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle De-Biased Two-Sample U-Statistics With Application To Conditional Distribution Testing
Chen, Yuchen
Lei, Jing
Methodology
In some high-dimensional and semiparametric inference problems involving two populations, the parameter of interest can be characterized by two-sample U-statistics involving some nuisance parameters. In this work we first extend the framework of one-step estimation with cross-fitting to two-sample U-statistics, showing that using an orthogonalized influence function can effectively remove the first order bias, resulting in asymptotically normal estimates of the parameter of interest. As an example, we apply this method and theory to the problem of testing two-sample conditional distributions, also known as strong ignorability. When combined with a conformal-based rank-sum test, we discover that the nuisance parameters can be divided into two categories, where in one category the nuisance estimation accuracy does not affect the testing validity, whereas in the other the nuisance estimation accuracy must satisfy the usual requirement for the test to be valid. We believe these findings provide further insights into and enhance the conformal inference toolbox.
title De-Biased Two-Sample U-Statistics With Application To Conditional Distribution Testing
topic Methodology
url https://arxiv.org/abs/2402.00164