Saved in:
Bibliographic Details
Main Authors: Jin, Jiarui, Wang, Haoyu, Wu, Xingliang, Fang, Xiaocheng, Lan, Xiang, Wang, Zihan, Zhang, Deyun, Liu, Bo, Zhang, Yingying, Wu, Xian, Li, Hongyan, Hong, Shenda
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.04279
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Electrocardiography (ECG) serves as an indispensable diagnostic tool in clinical practice, yet existing multimodal large language models (MLLMs) remain unreliable for ECG interpretation, often producing plausible but clinically incorrect analyses. To address this, we propose ECG-R1, the first reasoning ECG MLLM designed for reliable ECG interpretation via three innovations. First, we construct the interpretation corpus using \textit{Protocol-Guided Instruction Data Generation}, grounding interpretation in measurable ECG features and monograph-defined quantitative thresholds and diagnostic logic. Second, we present a modality-decoupled architecture with \textit{Interleaved Modality Dropout} to improve robustness and cross-modal consistency when either the ECG signal or ECG image is missing. Third, we present \textit{Reinforcement Learning with ECG Diagnostic Evidence Rewards} to strengthen evidence-grounded ECG interpretation. Additionally, we systematically evaluate the ECG interpretation capabilities of proprietary, open-source, and medical MLLMs, and provide the first quantitative evidence that severe hallucinations are widespread, suggesting that the public should not directly trust these outputs without independent verification. Code is available at \href{https://github.com/PKUDigitalHealth/ECG-R1}{here}.