Saved in:
Bibliographic Details
Main Authors: Lansiaux, Edouard, Azzouz, Ramy, Chazard, Emmanuel, Vromant, Amélie, Wiel, Eric
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.01080
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918351900508160
author Lansiaux, Edouard
Azzouz, Ramy
Chazard, Emmanuel
Vromant, Amélie
Wiel, Eric
author_facet Lansiaux, Edouard
Azzouz, Ramy
Chazard, Emmanuel
Vromant, Amélie
Wiel, Eric
contents Emergency departments struggle with persistent triage errors, especially undertriage and overtriage, which are aggravated by growing patient volumes and staff shortages. This study evaluated three AI models [TRIAGEMASTER (NLP), URGENTIAPARSE (LLM), and EMERGINET (JEPA)] against the FRENCH triage scale and nurse practice, using seven months of adult triage data from Roger Salengro Hospital in Lille, France. Among the models, the LLM-based URGENTIAPARSE consistently outperformed both AI alternatives and nurse triage, achieving the highest accuracy (F1-score 0.900, AUC-ROC 0.879) and superior performance in predicting hospitalization needs (GEMSA). Its robustness across structured data and raw transcripts highlighted the advantage of LLM architectures in abstracting patient information. Overall, the findings suggest that integrating LLM-based AI into emergency department workflows could significantly enhance patient safety and operational efficiency, though successful adoption will depend on addressing limitations and ensuring ethical transparency.
format Preprint
id arxiv_https___arxiv_org_abs_2507_01080
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Development and Comparative Evaluation of Three Artificial Intelligence Models (NLP, LLM, JEPA) for Predicting Triage in Emergency Departments: A 7-Month Retrospective Proof-of-Concept
Lansiaux, Edouard
Azzouz, Ramy
Chazard, Emmanuel
Vromant, Amélie
Wiel, Eric
Machine Learning
Performance
Emergency departments struggle with persistent triage errors, especially undertriage and overtriage, which are aggravated by growing patient volumes and staff shortages. This study evaluated three AI models [TRIAGEMASTER (NLP), URGENTIAPARSE (LLM), and EMERGINET (JEPA)] against the FRENCH triage scale and nurse practice, using seven months of adult triage data from Roger Salengro Hospital in Lille, France. Among the models, the LLM-based URGENTIAPARSE consistently outperformed both AI alternatives and nurse triage, achieving the highest accuracy (F1-score 0.900, AUC-ROC 0.879) and superior performance in predicting hospitalization needs (GEMSA). Its robustness across structured data and raw transcripts highlighted the advantage of LLM architectures in abstracting patient information. Overall, the findings suggest that integrating LLM-based AI into emergency department workflows could significantly enhance patient safety and operational efficiency, though successful adoption will depend on addressing limitations and ensuring ethical transparency.
title Development and Comparative Evaluation of Three Artificial Intelligence Models (NLP, LLM, JEPA) for Predicting Triage in Emergency Departments: A 7-Month Retrospective Proof-of-Concept
topic Machine Learning
Performance
url https://arxiv.org/abs/2507.01080