Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.09333 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866913133046530048 |
|---|---|
| author | Wu, Weiyi Diao, Xingjian Zhang, Chunhui Gao, Chongyang Xu, Xinwen Li, Siting Gui, Jiang |
| author_facet | Wu, Weiyi Diao, Xingjian Zhang, Chunhui Gao, Chongyang Xu, Xinwen Li, Siting Gui, Jiang |
| contents | Whole slide images (WSIs) pose fundamental computational challenges due to their gigapixel resolution and the sparse distribution of informative regions. Existing approaches often treat image patches independently or reshape them in ways that distort spatial context, thereby obscuring the hierarchical pyramid representations intrinsic to WSIs. We introduce Sparse Pyramid Attention Networks (SPAN), a hierarchical framework that preserves spatial relationships while allocating computation to informative regions. SPAN constructs multi-scale representations directly from single-scale inputs, enabling precise hierarchical modeling of WSI data. We demonstrate SPAN's versatility through two variants: SPAN-MIL for slide classification and SPAN-UNet for segmentation. Comprehensive evaluations across multiple public datasets show that SPAN effectively captures hierarchical structure and contextual relationships. Our results provide clear evidence that architectural inductive biases and hierarchical representations enhance both slide-level and patch-level performance. By addressing key computational challenges in WSI analysis, SPAN provides an effective framework for computational pathology and demonstrates important design principles for large-scale medical image analysis. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2406_09333 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Learning Spatial-Preserving Hierarchical Representations for Digital Pathology Wu, Weiyi Diao, Xingjian Zhang, Chunhui Gao, Chongyang Xu, Xinwen Li, Siting Gui, Jiang Computer Vision and Pattern Recognition Whole slide images (WSIs) pose fundamental computational challenges due to their gigapixel resolution and the sparse distribution of informative regions. Existing approaches often treat image patches independently or reshape them in ways that distort spatial context, thereby obscuring the hierarchical pyramid representations intrinsic to WSIs. We introduce Sparse Pyramid Attention Networks (SPAN), a hierarchical framework that preserves spatial relationships while allocating computation to informative regions. SPAN constructs multi-scale representations directly from single-scale inputs, enabling precise hierarchical modeling of WSI data. We demonstrate SPAN's versatility through two variants: SPAN-MIL for slide classification and SPAN-UNet for segmentation. Comprehensive evaluations across multiple public datasets show that SPAN effectively captures hierarchical structure and contextual relationships. Our results provide clear evidence that architectural inductive biases and hierarchical representations enhance both slide-level and patch-level performance. By addressing key computational challenges in WSI analysis, SPAN provides an effective framework for computational pathology and demonstrates important design principles for large-scale medical image analysis. |
| title | Learning Spatial-Preserving Hierarchical Representations for Digital Pathology |
| topic | Computer Vision and Pattern Recognition |
| url | https://arxiv.org/abs/2406.09333 |