Saved in:
Bibliographic Details
Main Authors: Kim, Jiyeong, Ma, Stephen P., Chen, Michael L., Galatzer-Levy, Isaac R., Torous, John, van Roessel, Peter J., Sharp, Christopher, Pfeffer, Michael A., Rodriguez, Carolyn I., Linos, Eleni, Chen, Jonathan H.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.11384
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915198298750976
author Kim, Jiyeong
Ma, Stephen P.
Chen, Michael L.
Galatzer-Levy, Isaac R.
Torous, John
van Roessel, Peter J.
Sharp, Christopher
Pfeffer, Michael A.
Rodriguez, Carolyn I.
Linos, Eleni
Chen, Jonathan H.
author_facet Kim, Jiyeong
Ma, Stephen P.
Chen, Michael L.
Galatzer-Levy, Isaac R.
Torous, John
van Roessel, Peter J.
Sharp, Christopher
Pfeffer, Michael A.
Rodriguez, Carolyn I.
Linos, Eleni
Chen, Jonathan H.
contents Patients with diabetes are at increased risk of comorbid depression or anxiety, complicating their management. This study evaluated the performance of large language models (LLMs) in detecting these symptoms from secure patient messages. We applied multiple approaches, including engineered prompts, systemic persona, temperature adjustments, and zero-shot and few-shot learning, to identify the best-performing model and enhance performance. Three out of five LLMs demonstrated excellent performance (over 90% of F-1 and accuracy), with Llama 3.1 405B achieving 93% in both F-1 and accuracy using a zero-shot approach. While LLMs showed promise in binary classification and handling complex metrics like Patient Health Questionnaire-4, inconsistencies in challenging cases warrant further real-life assessment. The findings highlight the potential of LLMs to assist in timely screening and referrals, providing valuable empirical knowledge for real-world triage systems that could improve mental health care for patients with chronic diseases.
format Preprint
id arxiv_https___arxiv_org_abs_2503_11384
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages
Kim, Jiyeong
Ma, Stephen P.
Chen, Michael L.
Galatzer-Levy, Isaac R.
Torous, John
van Roessel, Peter J.
Sharp, Christopher
Pfeffer, Michael A.
Rodriguez, Carolyn I.
Linos, Eleni
Chen, Jonathan H.
Artificial Intelligence
Computation and Language
Patients with diabetes are at increased risk of comorbid depression or anxiety, complicating their management. This study evaluated the performance of large language models (LLMs) in detecting these symptoms from secure patient messages. We applied multiple approaches, including engineered prompts, systemic persona, temperature adjustments, and zero-shot and few-shot learning, to identify the best-performing model and enhance performance. Three out of five LLMs demonstrated excellent performance (over 90% of F-1 and accuracy), with Llama 3.1 405B achieving 93% in both F-1 and accuracy using a zero-shot approach. While LLMs showed promise in binary classification and handling complex metrics like Patient Health Questionnaire-4, inconsistencies in challenging cases warrant further real-life assessment. The findings highlight the potential of LLMs to assist in timely screening and referrals, providing valuable empirical knowledge for real-world triage systems that could improve mental health care for patients with chronic diseases.
title Optimizing Large Language Models for Detecting Symptoms of Comorbid Depression or Anxiety in Chronic Diseases: Insights from Patient Messages
topic Artificial Intelligence
Computation and Language
url https://arxiv.org/abs/2503.11384