Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kwon, Sunjae, Wang, Xun, Liu, Weisong, Druhl, Emily, Sung, Minhee L., Reisman, Joel I., Li, Wenjun, Kerns, Robert D., Becker, William, Yu, Hong
Format: Preprint
Veröffentlicht: 2023
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2307.02591
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866911810089648128
author Kwon, Sunjae
Wang, Xun
Liu, Weisong
Druhl, Emily
Sung, Minhee L.
Reisman, Joel I.
Li, Wenjun
Kerns, Robert D.
Becker, William
Yu, Hong
author_facet Kwon, Sunjae
Wang, Xun
Liu, Weisong
Druhl, Emily
Sung, Minhee L.
Reisman, Joel I.
Li, Wenjun
Kerns, Robert D.
Becker, William
Yu, Hong
contents Opioid related aberrant behaviors (ORABs) present novel risk factors for opioid overdose. This paper introduces a novel biomedical natural language processing benchmark dataset named ODD, for ORAB Detection Dataset. ODD is an expert-annotated dataset designed to identify ORABs from patients' EHR notes and classify them into nine categories; 1) Confirmed Aberrant Behavior, 2) Suggested Aberrant Behavior, 3) Opioids, 4) Indication, 5) Diagnosed opioid dependency, 6) Benzodiazepines, 7) Medication Changes, 8) Central Nervous System-related, and 9) Social Determinants of Health. We explored two state-of-the-art natural language processing models (fine-tuning and prompt-tuning approaches) to identify ORAB. Experimental results show that the prompt-tuning models outperformed the fine-tuning models in most categories and the gains were especially higher among uncommon categories (Suggested Aberrant Behavior, Confirmed Aberrant Behaviors, Diagnosed Opioid Dependence, and Medication Change). Although the best model achieved the highest 88.17% on macro average area under precision recall curve, uncommon classes still have a large room for performance improvement. ODD is publicly available.
format Preprint
id arxiv_https___arxiv_org_abs_2307_02591
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle ODD: A Benchmark Dataset for the Natural Language Processing based Opioid Related Aberrant Behavior Detection
Kwon, Sunjae
Wang, Xun
Liu, Weisong
Druhl, Emily
Sung, Minhee L.
Reisman, Joel I.
Li, Wenjun
Kerns, Robert D.
Becker, William
Yu, Hong
Computation and Language
Artificial Intelligence
Opioid related aberrant behaviors (ORABs) present novel risk factors for opioid overdose. This paper introduces a novel biomedical natural language processing benchmark dataset named ODD, for ORAB Detection Dataset. ODD is an expert-annotated dataset designed to identify ORABs from patients' EHR notes and classify them into nine categories; 1) Confirmed Aberrant Behavior, 2) Suggested Aberrant Behavior, 3) Opioids, 4) Indication, 5) Diagnosed opioid dependency, 6) Benzodiazepines, 7) Medication Changes, 8) Central Nervous System-related, and 9) Social Determinants of Health. We explored two state-of-the-art natural language processing models (fine-tuning and prompt-tuning approaches) to identify ORAB. Experimental results show that the prompt-tuning models outperformed the fine-tuning models in most categories and the gains were especially higher among uncommon categories (Suggested Aberrant Behavior, Confirmed Aberrant Behaviors, Diagnosed Opioid Dependence, and Medication Change). Although the best model achieved the highest 88.17% on macro average area under precision recall curve, uncommon classes still have a large room for performance improvement. ODD is publicly available.
title ODD: A Benchmark Dataset for the Natural Language Processing based Opioid Related Aberrant Behavior Detection
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2307.02591