Saved in:
Bibliographic Details
Main Authors: Pungitore, Sarah, Yadav, Shashank, Subbian, Vignesh
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.19265
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913925706022912
author Pungitore, Sarah
Yadav, Shashank
Subbian, Vignesh
author_facet Pungitore, Sarah
Yadav, Shashank
Subbian, Vignesh
contents Computational phenotyping is essential for biomedical research but often requires significant time and resources, especially since traditional methods typically involve extensive manual data review. While machine learning and natural language processing advancements have helped, further improvements are needed. Few studies have explored using Large Language Models (LLMs) for these tasks despite known advantages of LLMs for text-based tasks. To facilitate further research in this area, we developed an evaluation framework, Evaluation of PHEnotyping for Observational Health Data (PHEONA), that outlines context-specific considerations. We applied and demonstrated PHEONA on concept classification, a specific task within a broader phenotyping process for Acute Respiratory Failure (ARF) respiratory support therapies. From the sample concepts tested, we achieved high classification accuracy, suggesting the potential for LLM-based methods to improve computational phenotyping processes.
format Preprint
id arxiv_https___arxiv_org_abs_2503_19265
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping
Pungitore, Sarah
Yadav, Shashank
Subbian, Vignesh
Computation and Language
Computational phenotyping is essential for biomedical research but often requires significant time and resources, especially since traditional methods typically involve extensive manual data review. While machine learning and natural language processing advancements have helped, further improvements are needed. Few studies have explored using Large Language Models (LLMs) for these tasks despite known advantages of LLMs for text-based tasks. To facilitate further research in this area, we developed an evaluation framework, Evaluation of PHEnotyping for Observational Health Data (PHEONA), that outlines context-specific considerations. We applied and demonstrated PHEONA on concept classification, a specific task within a broader phenotyping process for Acute Respiratory Failure (ARF) respiratory support therapies. From the sample concepts tested, we achieved high classification accuracy, suggesting the potential for LLM-based methods to improve computational phenotyping processes.
title PHEONA: An Evaluation Framework for Large Language Model-based Approaches to Computational Phenotyping
topic Computation and Language
url https://arxiv.org/abs/2503.19265