Saved in:
Bibliographic Details
Main Authors: Liventsev, Vadim, Fritz, Tobias
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.17501
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911886671347712
author Liventsev, Vadim
Fritz, Tobias
author_facet Liventsev, Vadim
Fritz, Tobias
contents Reinforcement Learning in Healthcare is typically concerned with narrow self-contained tasks such as sepsis prediction or anesthesia control. However, previous research has demonstrated the potential of generalist models (the prime example being Large Language Models) to outperform task-specific approaches due to their capability for implicit transfer learning. To enable training of foundation models for Healthcare as well as leverage the capabilities of state of the art Transformer architectures, we propose the paradigm of Healthcare as Sequence Modeling, in which interaction between the patient and the healthcare provider is represented as an event stream and tasks like diagnosis and treatment selection are modeled as prediction of future events in the stream. To explore this paradigm experimentally we develop MIMIC-SEQ, a sequence modeling benchmark derived by translating heterogenous clinical records from MIMIC-IV dataset into a uniform event stream format, train a baseline model and explore its capabilities.
format Preprint
id arxiv_https___arxiv_org_abs_2402_17501
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Intensive Care as One Big Sequence Modeling Problem
Liventsev, Vadim
Fritz, Tobias
Machine Learning
Artificial Intelligence
Reinforcement Learning in Healthcare is typically concerned with narrow self-contained tasks such as sepsis prediction or anesthesia control. However, previous research has demonstrated the potential of generalist models (the prime example being Large Language Models) to outperform task-specific approaches due to their capability for implicit transfer learning. To enable training of foundation models for Healthcare as well as leverage the capabilities of state of the art Transformer architectures, we propose the paradigm of Healthcare as Sequence Modeling, in which interaction between the patient and the healthcare provider is represented as an event stream and tasks like diagnosis and treatment selection are modeled as prediction of future events in the stream. To explore this paradigm experimentally we develop MIMIC-SEQ, a sequence modeling benchmark derived by translating heterogenous clinical records from MIMIC-IV dataset into a uniform event stream format, train a baseline model and explore its capabilities.
title Intensive Care as One Big Sequence Modeling Problem
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2402.17501