Saved in:
Bibliographic Details
Main Authors: Yang, Haowei, Shen, Ziyu, Shao, Junli, Men, Luyao, Han, Xinyue, Dong, Jing
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.11052
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912484166729728
author Yang, Haowei
Shen, Ziyu
Shao, Junli
Men, Luyao
Han, Xinyue
Dong, Jing
author_facet Yang, Haowei
Shen, Ziyu
Shao, Junli
Men, Luyao
Han, Xinyue
Dong, Jing
contents Timely identification and accurate risk stratification of cardiovascular disease (CVD) remain essential for reducing global mortality. While existing prediction models primarily leverage structured data, unstructured clinical notes contain valuable early indicators. This study introduces a novel LLM-augmented clinical NLP pipeline that employs domain-adapted large language models for symptom extraction, contextual reasoning, and correlation from free-text reports. Our approach integrates cardiovascular-specific fine-tuning, prompt-based inference, and entity-aware reasoning. Evaluations on MIMIC-III and CARDIO-NLP datasets demonstrate improved performance in precision, recall, F1-score, and AUROC, with high clinical relevance (kappa = 0.82) assessed by cardiologists. Challenges such as contextual hallucination, which occurs when plausible information contracts with provided source, and temporal ambiguity, which is related with models struggling with chronological ordering of events are addressed using prompt engineering and hybrid rule-based verification. This work underscores the potential of LLMs in clinical decision support systems (CDSS), advancing early warning systems and enhancing the translation of patient narratives into actionable risk assessments.
format Preprint
id arxiv_https___arxiv_org_abs_2507_11052
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle LLM-Augmented Symptom Analysis for Cardiovascular Disease Risk Prediction: A Clinical NLP
Yang, Haowei
Shen, Ziyu
Shao, Junli
Men, Luyao
Han, Xinyue
Dong, Jing
Computation and Language
Artificial Intelligence
Timely identification and accurate risk stratification of cardiovascular disease (CVD) remain essential for reducing global mortality. While existing prediction models primarily leverage structured data, unstructured clinical notes contain valuable early indicators. This study introduces a novel LLM-augmented clinical NLP pipeline that employs domain-adapted large language models for symptom extraction, contextual reasoning, and correlation from free-text reports. Our approach integrates cardiovascular-specific fine-tuning, prompt-based inference, and entity-aware reasoning. Evaluations on MIMIC-III and CARDIO-NLP datasets demonstrate improved performance in precision, recall, F1-score, and AUROC, with high clinical relevance (kappa = 0.82) assessed by cardiologists. Challenges such as contextual hallucination, which occurs when plausible information contracts with provided source, and temporal ambiguity, which is related with models struggling with chronological ordering of events are addressed using prompt engineering and hybrid rule-based verification. This work underscores the potential of LLMs in clinical decision support systems (CDSS), advancing early warning systems and enhancing the translation of patient narratives into actionable risk assessments.
title LLM-Augmented Symptom Analysis for Cardiovascular Disease Risk Prediction: A Clinical NLP
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2507.11052