Saved in:
Bibliographic Details
Main Authors: Do, Tien Huu, Masquelier, Antoine, Lee, Nae Eoun, Crowther, Jonathan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.23607
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912678946013184
author Do, Tien Huu
Masquelier, Antoine
Lee, Nae Eoun
Crowther, Jonathan
author_facet Do, Tien Huu
Masquelier, Antoine
Lee, Nae Eoun
Crowther, Jonathan
contents Clinical trials are a systematic endeavor to assess the safety and efficacy of new drugs or treatments. Conducting such trials typically demands significant financial investment and meticulous planning, highlighting the need for accurate predictions of trial outcomes. Accurately predicting patient enrollment, a key factor in trial success, is one of the primary challenges during the planning phase. In this work, we propose a novel deep learning-based method to address this critical challenge. Our method, implemented as a neural network model, leverages pre-trained language models (PLMs) to capture the complexities and nuances of clinical documents, transforming them into expressive representations. These representations are then combined with encoded tabular features via an attention mechanism. To account for uncertainties in enrollment prediction, we enhance the model with a probabilistic layer based on the Gamma distribution, which enables range estimation. We apply the proposed model to predict clinical trial duration, assuming site-level enrollment follows a Poisson-Gamma process. We carry out extensive experiments on real-world clinical trial data, and show that the proposed method can effectively predict the number of patients enrolled at a number of sites for a given clinical trial, outperforming established baseline models.
format Preprint
id arxiv_https___arxiv_org_abs_2507_23607
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Deep Learning-based Prediction of Clinical Trial Enrollment with Uncertainty Estimates
Do, Tien Huu
Masquelier, Antoine
Lee, Nae Eoun
Crowther, Jonathan
Machine Learning
Artificial Intelligence
Computation and Language
Clinical trials are a systematic endeavor to assess the safety and efficacy of new drugs or treatments. Conducting such trials typically demands significant financial investment and meticulous planning, highlighting the need for accurate predictions of trial outcomes. Accurately predicting patient enrollment, a key factor in trial success, is one of the primary challenges during the planning phase. In this work, we propose a novel deep learning-based method to address this critical challenge. Our method, implemented as a neural network model, leverages pre-trained language models (PLMs) to capture the complexities and nuances of clinical documents, transforming them into expressive representations. These representations are then combined with encoded tabular features via an attention mechanism. To account for uncertainties in enrollment prediction, we enhance the model with a probabilistic layer based on the Gamma distribution, which enables range estimation. We apply the proposed model to predict clinical trial duration, assuming site-level enrollment follows a Poisson-Gamma process. We carry out extensive experiments on real-world clinical trial data, and show that the proposed method can effectively predict the number of patients enrolled at a number of sites for a given clinical trial, outperforming established baseline models.
title Deep Learning-based Prediction of Clinical Trial Enrollment with Uncertainty Estimates
topic Machine Learning
Artificial Intelligence
Computation and Language
url https://arxiv.org/abs/2507.23607