Saved in:
Bibliographic Details
Main Authors: Wu, Weiyi, Diao, Xingjian, Zhang, Chunhui, Gao, Chongyang, Xu, Xinwen, Li, Siting, Gui, Jiang
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2406.09333
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913133046530048
author Wu, Weiyi
Diao, Xingjian
Zhang, Chunhui
Gao, Chongyang
Xu, Xinwen
Li, Siting
Gui, Jiang
author_facet Wu, Weiyi
Diao, Xingjian
Zhang, Chunhui
Gao, Chongyang
Xu, Xinwen
Li, Siting
Gui, Jiang
contents Whole slide images (WSIs) pose fundamental computational challenges due to their gigapixel resolution and the sparse distribution of informative regions. Existing approaches often treat image patches independently or reshape them in ways that distort spatial context, thereby obscuring the hierarchical pyramid representations intrinsic to WSIs. We introduce Sparse Pyramid Attention Networks (SPAN), a hierarchical framework that preserves spatial relationships while allocating computation to informative regions. SPAN constructs multi-scale representations directly from single-scale inputs, enabling precise hierarchical modeling of WSI data. We demonstrate SPAN's versatility through two variants: SPAN-MIL for slide classification and SPAN-UNet for segmentation. Comprehensive evaluations across multiple public datasets show that SPAN effectively captures hierarchical structure and contextual relationships. Our results provide clear evidence that architectural inductive biases and hierarchical representations enhance both slide-level and patch-level performance. By addressing key computational challenges in WSI analysis, SPAN provides an effective framework for computational pathology and demonstrates important design principles for large-scale medical image analysis.
format Preprint
id arxiv_https___arxiv_org_abs_2406_09333
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Learning Spatial-Preserving Hierarchical Representations for Digital Pathology
Wu, Weiyi
Diao, Xingjian
Zhang, Chunhui
Gao, Chongyang
Xu, Xinwen
Li, Siting
Gui, Jiang
Computer Vision and Pattern Recognition
Whole slide images (WSIs) pose fundamental computational challenges due to their gigapixel resolution and the sparse distribution of informative regions. Existing approaches often treat image patches independently or reshape them in ways that distort spatial context, thereby obscuring the hierarchical pyramid representations intrinsic to WSIs. We introduce Sparse Pyramid Attention Networks (SPAN), a hierarchical framework that preserves spatial relationships while allocating computation to informative regions. SPAN constructs multi-scale representations directly from single-scale inputs, enabling precise hierarchical modeling of WSI data. We demonstrate SPAN's versatility through two variants: SPAN-MIL for slide classification and SPAN-UNet for segmentation. Comprehensive evaluations across multiple public datasets show that SPAN effectively captures hierarchical structure and contextual relationships. Our results provide clear evidence that architectural inductive biases and hierarchical representations enhance both slide-level and patch-level performance. By addressing key computational challenges in WSI analysis, SPAN provides an effective framework for computational pathology and demonstrates important design principles for large-scale medical image analysis.
title Learning Spatial-Preserving Hierarchical Representations for Digital Pathology
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2406.09333