Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Li, Deyi, Yao, Zijun, Xu, Qi, Liang, Muxuan, Li, Lingyao, Xu, Zijian, Liu, Mei
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2603.10180
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866912960049315840
author Li, Deyi
Yao, Zijun
Xu, Qi
Liang, Muxuan
Li, Lingyao
Xu, Zijian
Liu, Mei
author_facet Li, Deyi
Yao, Zijun
Xu, Qi
Liang, Muxuan
Li, Lingyao
Xu, Zijian
Liu, Mei
contents The growing adoption of electronic health record (EHR) systems has provided unprecedented opportunities for predictive modeling to guide clinical decision making. Structured EHRs contain longitudinal observations of patients across hospital visits, where each visit is represented by a set of medical codes. While sequence-based, graph-based, and graph-enhanced sequence approaches have been developed to capture rich code interactions over time or within the same visits, they often overlook the inherent heterogeneous roles of medical codes arising from distinct clinical characteristics and contexts. To this end, in this study we propose the Disease Trajectory-aware Transformer for EHR (DT-BEHRT), a graph-enhanced sequential architecture that disentangles disease trajectories by explicitly modeling diagnosis-centric interactions within organ systems and capturing asynchronous progression patterns. To further enhance the representation robustness, we design a tailored pre-training methodology that combines trajectory-level code masking with ontology-informed ancestor prediction, promoting semantic alignment across multiple modeling modules. Extensive experiments on multiple benchmark datasets demonstrate that DT-BEHRT achieves strong predictive performance and provides interpretable patient representations that align with clinicians' disease-centered reasoning. The source code is publicly accessible at https://github.com/GatorAIM/DT-BEHRT.git.
format Preprint
id arxiv_https___arxiv_org_abs_2603_10180
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle DT-BEHRT: Disease Trajectory-aware Transformer for Interpretable Patient Representation Learning
Li, Deyi
Yao, Zijun
Xu, Qi
Liang, Muxuan
Li, Lingyao
Xu, Zijian
Liu, Mei
Machine Learning
The growing adoption of electronic health record (EHR) systems has provided unprecedented opportunities for predictive modeling to guide clinical decision making. Structured EHRs contain longitudinal observations of patients across hospital visits, where each visit is represented by a set of medical codes. While sequence-based, graph-based, and graph-enhanced sequence approaches have been developed to capture rich code interactions over time or within the same visits, they often overlook the inherent heterogeneous roles of medical codes arising from distinct clinical characteristics and contexts. To this end, in this study we propose the Disease Trajectory-aware Transformer for EHR (DT-BEHRT), a graph-enhanced sequential architecture that disentangles disease trajectories by explicitly modeling diagnosis-centric interactions within organ systems and capturing asynchronous progression patterns. To further enhance the representation robustness, we design a tailored pre-training methodology that combines trajectory-level code masking with ontology-informed ancestor prediction, promoting semantic alignment across multiple modeling modules. Extensive experiments on multiple benchmark datasets demonstrate that DT-BEHRT achieves strong predictive performance and provides interpretable patient representations that align with clinicians' disease-centered reasoning. The source code is publicly accessible at https://github.com/GatorAIM/DT-BEHRT.git.
title DT-BEHRT: Disease Trajectory-aware Transformer for Interpretable Patient Representation Learning
topic Machine Learning
url https://arxiv.org/abs/2603.10180