MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Edin, Joakim, Balaganeshan, Sedrah Butt, Kristensen, Annike Kjølby, Maaløe, Lars, Louloudis, Ioannis, Brunak, Søren
Natura:	Preprint
Pubblicazione:	2026
Soggetti:	Machine Learning 62P10 J.3
Accesso online:	https://arxiv.org/abs/2603.00221
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866911483023065088
author	Edin, Joakim Balaganeshan, Sedrah Butt Kristensen, Annike Kjølby Maaløe, Lars Louloudis, Ioannis Brunak, Søren
author_facet	Edin, Joakim Balaganeshan, Sedrah Butt Kristensen, Annike Kjølby Maaløe, Lars Louloudis, Ioannis Brunak, Søren
contents	Medical coding translates clinical documentation into standardized codes for billing, research, and public health, but manual coding is time-consuming and error-prone. Existing automation efforts rely on small datasets that poorly represent real-world patient heterogeneity. We trained a language model on 5.8 million electronic health records from 1.8 million patients across nearly all specialties in Eastern Denmark (2006--2016) to predict ICD-10 codes from clinical notes, medications, and laboratory results. Evaluated on 270,000 held-out patients, the model achieved a micro F1 of 71.8% and a top-10 recall of 95.5%. Performance varied by specialty (F1: 53--91%), with higher scores in specialties with well-defined diagnostic criteria. Codes appearing predominantly as secondary diagnoses had markedly lower F1 scores. For three such codes (suicide-related behaviors, weight disorders, and hypertension), the model identified thousands of uncoded cases, of which 76-86% were confirmed valid upon manual review, suggesting systematic under-coding rather than model error. These findings suggest under-coding of secondary diagnoses in Eastern Denmark during this period, with potential implications for epidemiological research, public health surveillance, and understanding of multimorbidity. Similar time constraints and reimbursement structures in other healthcare systems suggest this may not be isolated to this dataset. The model can automate coding for approximately 50% of cases and provide accurate suggestions for most others, and may offer a practical solution to help capture missed secondary conditions.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_00221
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	A medical coding language model trained on clinical narratives from a population-wide cohort of 1.8 million patients Edin, Joakim Balaganeshan, Sedrah Butt Kristensen, Annike Kjølby Maaløe, Lars Louloudis, Ioannis Brunak, Søren Machine Learning 62P10 J.3 Medical coding translates clinical documentation into standardized codes for billing, research, and public health, but manual coding is time-consuming and error-prone. Existing automation efforts rely on small datasets that poorly represent real-world patient heterogeneity. We trained a language model on 5.8 million electronic health records from 1.8 million patients across nearly all specialties in Eastern Denmark (2006--2016) to predict ICD-10 codes from clinical notes, medications, and laboratory results. Evaluated on 270,000 held-out patients, the model achieved a micro F1 of 71.8% and a top-10 recall of 95.5%. Performance varied by specialty (F1: 53--91%), with higher scores in specialties with well-defined diagnostic criteria. Codes appearing predominantly as secondary diagnoses had markedly lower F1 scores. For three such codes (suicide-related behaviors, weight disorders, and hypertension), the model identified thousands of uncoded cases, of which 76-86% were confirmed valid upon manual review, suggesting systematic under-coding rather than model error. These findings suggest under-coding of secondary diagnoses in Eastern Denmark during this period, with potential implications for epidemiological research, public health surveillance, and understanding of multimorbidity. Similar time constraints and reimbursement structures in other healthcare systems suggest this may not be isolated to this dataset. The model can automate coding for approximately 50% of cases and provide accurate suggestions for most others, and may offer a practical solution to help capture missed secondary conditions.
title	A medical coding language model trained on clinical narratives from a population-wide cohort of 1.8 million patients
topic	Machine Learning 62P10 J.3
url	https://arxiv.org/abs/2603.00221

Documenti analoghi