Saved in:
Bibliographic Details
Main Author: Fukumoto, Kentaro
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.08184
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908644350623744
author Fukumoto, Kentaro
author_facet Fukumoto, Kentaro
contents When scholars suspect units are dependent on each other within clusters but independent of each other across clusters, they employ cluster-robust standard errors (CRSEs). Nevertheless, what to cluster over is sometimes unknown. For instance, in the case of cross-sectional survey samples, clusters may be households, municipalities, counties, or states. A few approaches have been proposed, although they are based on asymptotics. I propose a new method to address this issue that works in a finite sample: reclustering. That is, we randomly and repeatedly group fine clusters into new gross clusters and calculate a statistic such as CRSEs. Under the null hypothesis that fine clusters are independent of each other, how they are grouped into gross clusters should not matter for any cluster-sensitive statistic. Thus, if the statistic based on the original clustering is a significant outlier against the distributions of the statistics induced by reclustering, it is reasonable to reject the null hypothesis and employ gross clusters. I compare the performance of reclustering with that of a few previous tests using Monte Carlo simulation and application.
format Preprint
id arxiv_https___arxiv_org_abs_2511_08184
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Reclustering: A New Method to Test the Appropriate Level of Clustering
Fukumoto, Kentaro
Methodology
When scholars suspect units are dependent on each other within clusters but independent of each other across clusters, they employ cluster-robust standard errors (CRSEs). Nevertheless, what to cluster over is sometimes unknown. For instance, in the case of cross-sectional survey samples, clusters may be households, municipalities, counties, or states. A few approaches have been proposed, although they are based on asymptotics. I propose a new method to address this issue that works in a finite sample: reclustering. That is, we randomly and repeatedly group fine clusters into new gross clusters and calculate a statistic such as CRSEs. Under the null hypothesis that fine clusters are independent of each other, how they are grouped into gross clusters should not matter for any cluster-sensitive statistic. Thus, if the statistic based on the original clustering is a significant outlier against the distributions of the statistics induced by reclustering, it is reasonable to reject the null hypothesis and employ gross clusters. I compare the performance of reclustering with that of a few previous tests using Monte Carlo simulation and application.
title Reclustering: A New Method to Test the Appropriate Level of Clustering
topic Methodology
url https://arxiv.org/abs/2511.08184