Saved in:
Bibliographic Details
Main Authors: Wang, Chunshi, Zhao, Bin, Ding, Shuxue
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.20809
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916867308781568
author Wang, Chunshi
Zhao, Bin
Ding, Shuxue
author_facet Wang, Chunshi
Zhao, Bin
Ding, Shuxue
contents Building footprint extraction holds immense significance in remote sensing image analysis and has great value in urban planning, land use, environmental protection and disaster assessment. Despite the progress made by conventional and deep learning approaches in this field, they continue to encounter significant challenges. This paper introduces a novel plug-and-play attention module, Split Coordinate Attention (SCA), which ingeniously captures spatially remote interactions by employing two spatial range of pooling kernels, strategically encoding each channel along x and y planes, and separately performs a series of split operations for each feature group, thus enabling more efficient semantic feature extraction. By inserting into a 2D CNN to form an effective SCANet, our SCANet outperforms recent SOTA methods on the public Wuhan University (WHU) Building Dataset and Massachusetts Building Dataset in terms of various metrics. Particularly SCANet achieves the best IoU, 91.61% and 75.49% for the two datasets. Our code is available at https://github.com/AiEson/SCANet
format Preprint
id arxiv_https___arxiv_org_abs_2507_20809
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle SCANet: Split Coordinate Attention Network for Building Footprint Extraction
Wang, Chunshi
Zhao, Bin
Ding, Shuxue
Computer Vision and Pattern Recognition
Building footprint extraction holds immense significance in remote sensing image analysis and has great value in urban planning, land use, environmental protection and disaster assessment. Despite the progress made by conventional and deep learning approaches in this field, they continue to encounter significant challenges. This paper introduces a novel plug-and-play attention module, Split Coordinate Attention (SCA), which ingeniously captures spatially remote interactions by employing two spatial range of pooling kernels, strategically encoding each channel along x and y planes, and separately performs a series of split operations for each feature group, thus enabling more efficient semantic feature extraction. By inserting into a 2D CNN to form an effective SCANet, our SCANet outperforms recent SOTA methods on the public Wuhan University (WHU) Building Dataset and Massachusetts Building Dataset in terms of various metrics. Particularly SCANet achieves the best IoU, 91.61% and 75.49% for the two datasets. Our code is available at https://github.com/AiEson/SCANet
title SCANet: Split Coordinate Attention Network for Building Footprint Extraction
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2507.20809