Saved in:
Bibliographic Details
Main Authors: Li, Jiachun, Simchi-Levi, David, Zhao, Yunxiao
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.05552
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910694997229568
author Li, Jiachun
Simchi-Levi, David
Zhao, Yunxiao
author_facet Li, Jiachun
Simchi-Levi, David
Zhao, Yunxiao
contents Given n experiment subjects with potentially heterogeneous covariates and two possible treatments, namely active treatment and control, this paper addresses the fundamental question of determining the optimal accuracy in estimating the treatment effect. Furthermore, we propose an experimental design that approaches this optimal accuracy, giving a (non-asymptotic) answer to this fundamental yet still open question. The methodological contribution is listed as following. First, we establish an idealized optimal estimator with minimal variance as benchmark, and then demonstrate that adaptive experiment is necessary to achieve near-optimal estimation accuracy. Secondly, by incorporating the concept of doubly robust method into sequential experimental design, we frame the optimal estimation problem as an online bandit learning problem, bridging the two fields of statistical estimation and bandit learning. Using tools and ideas from both bandit algorithm design and adaptive statistical estimation, we propose a general low switching adaptive experiment framework, which could be a generic research paradigm for a wide range of adaptive experimental design. Through novel lower bound techniques for non-i.i.d. data, we demonstrate the optimality of our proposed experiment. Numerical result indicates that the estimation accuracy approaches optimal with as few as two or three policy updates.
format Preprint
id arxiv_https___arxiv_org_abs_2410_05552
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Optimal Adaptive Experimental Design for Estimating Treatment Effect
Li, Jiachun
Simchi-Levi, David
Zhao, Yunxiao
Machine Learning
Given n experiment subjects with potentially heterogeneous covariates and two possible treatments, namely active treatment and control, this paper addresses the fundamental question of determining the optimal accuracy in estimating the treatment effect. Furthermore, we propose an experimental design that approaches this optimal accuracy, giving a (non-asymptotic) answer to this fundamental yet still open question. The methodological contribution is listed as following. First, we establish an idealized optimal estimator with minimal variance as benchmark, and then demonstrate that adaptive experiment is necessary to achieve near-optimal estimation accuracy. Secondly, by incorporating the concept of doubly robust method into sequential experimental design, we frame the optimal estimation problem as an online bandit learning problem, bridging the two fields of statistical estimation and bandit learning. Using tools and ideas from both bandit algorithm design and adaptive statistical estimation, we propose a general low switching adaptive experiment framework, which could be a generic research paradigm for a wide range of adaptive experimental design. Through novel lower bound techniques for non-i.i.d. data, we demonstrate the optimality of our proposed experiment. Numerical result indicates that the estimation accuracy approaches optimal with as few as two or three policy updates.
title Optimal Adaptive Experimental Design for Estimating Treatment Effect
topic Machine Learning
url https://arxiv.org/abs/2410.05552