Saved in:
Bibliographic Details
Main Authors: Nessari, Saman, Bozorgi-Amiri, Ali
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.19014
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911225919569920
author Nessari, Saman
Bozorgi-Amiri, Ali
author_facet Nessari, Saman
Bozorgi-Amiri, Ali
contents Current medical practice depends on standardized treatment frameworks and empirical methodologies that neglect individual patient variations, leading to suboptimal health outcomes. We develop a comprehensive system integrating Large Language Models (LLMs), Conditional Tabular Generative Adversarial Networks (CTGAN), T-learner counterfactual models, and contextual bandit approaches to provide customized, data-informed clinical recommendations. The approach utilizes LLMs to process unstructured medical narratives into structured datasets (93.2% accuracy), uses CTGANs to produce realistic synthetic patient data (55% accuracy via two-sample verification), deploys T-learners to forecast patient-specific treatment responses (84.3% accuracy), and integrates prior-informed contextual bandits to enhance online therapeutic selection by effectively balancing exploration of new possibilities with exploitation of existing knowledge. Testing on stage III colon cancer datasets revealed that our KernelUCB approach obtained 0.60-0.61 average reward scores across 5,000 rounds, exceeding other reference methods. This comprehensive system overcomes cold-start limitations in online learning environments, improves computational effectiveness, and constitutes notable progress toward individualized medicine adapted to specific patient characteristics.
format Preprint
id arxiv_https___arxiv_org_abs_2510_19014
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
Nessari, Saman
Bozorgi-Amiri, Ali
Machine Learning
Artificial Intelligence
Current medical practice depends on standardized treatment frameworks and empirical methodologies that neglect individual patient variations, leading to suboptimal health outcomes. We develop a comprehensive system integrating Large Language Models (LLMs), Conditional Tabular Generative Adversarial Networks (CTGAN), T-learner counterfactual models, and contextual bandit approaches to provide customized, data-informed clinical recommendations. The approach utilizes LLMs to process unstructured medical narratives into structured datasets (93.2% accuracy), uses CTGANs to produce realistic synthetic patient data (55% accuracy via two-sample verification), deploys T-learners to forecast patient-specific treatment responses (84.3% accuracy), and integrates prior-informed contextual bandits to enhance online therapeutic selection by effectively balancing exploration of new possibilities with exploitation of existing knowledge. Testing on stage III colon cancer datasets revealed that our KernelUCB approach obtained 0.60-0.61 average reward scores across 5,000 rounds, exceeding other reference methods. This comprehensive system overcomes cold-start limitations in online learning environments, improves computational effectiveness, and constitutes notable progress toward individualized medicine adapted to specific patient characteristics.
title Prior-informed optimization of treatment recommendation via bandit algorithms trained on large language model-processed historical records
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2510.19014