Saved in:
| Main Authors: | , , , , , , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.11536 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866910019976429568 |
|---|---|
| author | Huang, De-Xing Yu, Chaohui Zhou, Xiao-Hu Xiang, Tian-Yu Zhang, Qin-Yi Gui, Mei-Jiang Ma, Rui-Ze Wang, Chen-Yu Xiao, Nu-Fang Wang, Fan Hou, Zeng-Guang |
| author_facet | Huang, De-Xing Yu, Chaohui Zhou, Xiao-Hu Xiang, Tian-Yu Zhang, Qin-Yi Gui, Mei-Jiang Ma, Rui-Ze Wang, Chen-Yu Xiao, Nu-Fang Wang, Fan Hou, Zeng-Guang |
| contents | X-ray angiography is the gold standard imaging modality for cardiovascular diseases. However, current deep learning approaches for X-ray angiogram analysis are severely constrained by the scarcity of annotated data. While large-scale self-supervised learning (SSL) has emerged as a promising solution, its potential in this domain remains largely unexplored, primarily due to the lack of effective SSL frameworks and large-scale datasets. To bridge this gap, we introduce a vascular anatomy-aware masked image modeling (VasoMIM) framework that explicitly integrates domain-specific anatomical knowledge. Specifically, VasoMIM comprises two key designs: an anatomy-guided masking strategy and an anatomical consistency loss. The former strategically masks vessel-containing patches to compel the model to learn robust vascular semantics, while the latter preserves structural consistency of vessels between original and reconstructed images, enhancing the discriminability of the learned representations. In conjunction with VasoMIM, we curate XA-170K, the largest X-ray angiogram pre-training dataset to date. We validate VasoMIM on four downstream tasks across six datasets, where it demonstrates superior transferability and achieves state-of-the-art performance compared to existing methods. These findings highlight the significant potential of VasoMIM as a foundation model for advancing a wide range of X-ray angiogram analysis tasks. VasoMIM and XA-170K will be available at https://github.com/Dxhuang-CASIA/XA-SSL. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2602_11536 |
| institution | arXiv |
| publishDate | 2026 |
| record_format | arxiv |
| spellingShingle | Vascular anatomy-aware self-supervised pre-training for X-ray angiogram analysis Huang, De-Xing Yu, Chaohui Zhou, Xiao-Hu Xiang, Tian-Yu Zhang, Qin-Yi Gui, Mei-Jiang Ma, Rui-Ze Wang, Chen-Yu Xiao, Nu-Fang Wang, Fan Hou, Zeng-Guang Computer Vision and Pattern Recognition X-ray angiography is the gold standard imaging modality for cardiovascular diseases. However, current deep learning approaches for X-ray angiogram analysis are severely constrained by the scarcity of annotated data. While large-scale self-supervised learning (SSL) has emerged as a promising solution, its potential in this domain remains largely unexplored, primarily due to the lack of effective SSL frameworks and large-scale datasets. To bridge this gap, we introduce a vascular anatomy-aware masked image modeling (VasoMIM) framework that explicitly integrates domain-specific anatomical knowledge. Specifically, VasoMIM comprises two key designs: an anatomy-guided masking strategy and an anatomical consistency loss. The former strategically masks vessel-containing patches to compel the model to learn robust vascular semantics, while the latter preserves structural consistency of vessels between original and reconstructed images, enhancing the discriminability of the learned representations. In conjunction with VasoMIM, we curate XA-170K, the largest X-ray angiogram pre-training dataset to date. We validate VasoMIM on four downstream tasks across six datasets, where it demonstrates superior transferability and achieves state-of-the-art performance compared to existing methods. These findings highlight the significant potential of VasoMIM as a foundation model for advancing a wide range of X-ray angiogram analysis tasks. VasoMIM and XA-170K will be available at https://github.com/Dxhuang-CASIA/XA-SSL. |
| title | Vascular anatomy-aware self-supervised pre-training for X-ray angiogram analysis |
| topic | Computer Vision and Pattern Recognition |
| url | https://arxiv.org/abs/2602.11536 |