Saved in:
Bibliographic Details
Main Authors: Liu, Dan, He, Wenqing
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.04697
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915056739942400
author Liu, Dan
He, Wenqing
author_facet Liu, Dan
He, Wenqing
contents The study of precision medicine involves dynamic treatment regimes (DTRs), which are sequences of treatment decision rules recommended by taking patient-level information as input. The primary goal of the DTR study is to identify an optimal DTR, a sequence of treatment decision rules that leads to the best expected clinical outcome. Statistical methods have been developed in recent years to estimate an optimal DTR, including Q-learning, a regression-based method in the DTR literature. Although there are many studies concerning Q-learning, little attention has been given in the presence of noisy data, such as misclassified outcomes. In this paper, we investigate the effect of outcome misclassification on Q-learning and propose a correction method to accommodate the misclassification effect. Simulation studies are conducted to demonstrate the satisfactory performance of the proposed method. We illustrate the proposed method in two examples from the National Health and Nutrition Examination Survey Data I Epidemiologic Follow-up Study and the smoking cessation program.
format Preprint
id arxiv_https___arxiv_org_abs_2404_04697
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Q-learning in Dynamic Treatment Regimes with Misclassified Binary Outcome
Liu, Dan
He, Wenqing
Methodology
The study of precision medicine involves dynamic treatment regimes (DTRs), which are sequences of treatment decision rules recommended by taking patient-level information as input. The primary goal of the DTR study is to identify an optimal DTR, a sequence of treatment decision rules that leads to the best expected clinical outcome. Statistical methods have been developed in recent years to estimate an optimal DTR, including Q-learning, a regression-based method in the DTR literature. Although there are many studies concerning Q-learning, little attention has been given in the presence of noisy data, such as misclassified outcomes. In this paper, we investigate the effect of outcome misclassification on Q-learning and propose a correction method to accommodate the misclassification effect. Simulation studies are conducted to demonstrate the satisfactory performance of the proposed method. We illustrate the proposed method in two examples from the National Health and Nutrition Examination Survey Data I Epidemiologic Follow-up Study and the smoking cessation program.
title Q-learning in Dynamic Treatment Regimes with Misclassified Binary Outcome
topic Methodology
url https://arxiv.org/abs/2404.04697