Saved in:
Bibliographic Details
Main Authors: Oh, Eun Jeong, Ahn, Seungjun, Tham, Tristan, Qian, Min
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.16732
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908424800829440
author Oh, Eun Jeong
Ahn, Seungjun
Tham, Tristan
Qian, Min
author_facet Oh, Eun Jeong
Ahn, Seungjun
Tham, Tristan
Qian, Min
contents Accurate survival predicting models are essential for improving targeted cancer therapies and clinical care among cancer patients. In this article, we investigate and develop a method to improve predictions of survival in cancer by leveraging two-phase data with expert knowledge and prognostic index. Our work is motivated by two-phase data in nasopharyngeal cancer (NPC), where traditional covariates are readily available for all subjects, but the primary viral factor, Human Papillomavirus (HPV), is substantially missing. To address this challenge, we propose an expert guided method that incorporates prognostic index based on the observed covariates and clinical importance of key factors. The proposed method makes efficient use of available data, not simply discarding patients with unknown HPV status. We apply the proposed method and evaluate it against other existing approaches through a series of simulation studies and real data example of NPC patients. Under various settings, the proposed method consistently outperforms competing methods in terms of c-index, calibration slope, and integrated Brier score. By efficiently leveraging two-phase data, the model provides a more accurate and reliable predictive ability of survival models.
format Preprint
id arxiv_https___arxiv_org_abs_2503_16732
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Leveraging Two-Phase Data for Improved Prediction of Survival Outcomes with Application to Nasopharyngeal Cancer
Oh, Eun Jeong
Ahn, Seungjun
Tham, Tristan
Qian, Min
Methodology
Accurate survival predicting models are essential for improving targeted cancer therapies and clinical care among cancer patients. In this article, we investigate and develop a method to improve predictions of survival in cancer by leveraging two-phase data with expert knowledge and prognostic index. Our work is motivated by two-phase data in nasopharyngeal cancer (NPC), where traditional covariates are readily available for all subjects, but the primary viral factor, Human Papillomavirus (HPV), is substantially missing. To address this challenge, we propose an expert guided method that incorporates prognostic index based on the observed covariates and clinical importance of key factors. The proposed method makes efficient use of available data, not simply discarding patients with unknown HPV status. We apply the proposed method and evaluate it against other existing approaches through a series of simulation studies and real data example of NPC patients. Under various settings, the proposed method consistently outperforms competing methods in terms of c-index, calibration slope, and integrated Brier score. By efficiently leveraging two-phase data, the model provides a more accurate and reliable predictive ability of survival models.
title Leveraging Two-Phase Data for Improved Prediction of Survival Outcomes with Application to Nasopharyngeal Cancer
topic Methodology
url https://arxiv.org/abs/2503.16732