Saved in:
Bibliographic Details
Main Authors: Kamran, Muhammad, Sheikholeslami, Mohammad Moein, Wichmann, Andreas, Sohn, Gunho
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.17038
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918102575349760
author Kamran, Muhammad
Sheikholeslami, Mohammad Moein
Wichmann, Andreas
Sohn, Gunho
author_facet Kamran, Muhammad
Sheikholeslami, Mohammad Moein
Wichmann, Andreas
Sohn, Gunho
contents In recent years, the number of remote satellites orbiting the Earth has grown significantly, streaming vast amounts of high-resolution visual data to support diverse applications across civil, public, and military domains. Among these applications, the generation and updating of spatial maps of the built environment have become critical due to the extensive coverage and detailed imagery provided by satellites. However, reconstructing spatial maps from satellite imagery is a complex computer vision task, requiring the creation of high-level object representations, such as primitives, to accurately capture the built environment. While the past decade has witnessed remarkable advancements in object detection and representation using visual data, primitives-based object representation remains a persistent challenge in computer vision. Consequently, high-quality spatial maps often rely on labor-intensive and manual processes. This paper introduces a novel deep learning methodology leveraging Graph Convolutional Networks (GCNs) to address these challenges in building footprint reconstruction. The proposed approach enhances performance by incorporating geometric regularity into building boundaries, integrating multi-scale and multi-resolution features, and embedding Attraction Field Maps into the network. These innovations provide a scalable and precise solution for automated building footprint extraction from a single satellite image, paving the way for impactful applications in urban planning, disaster management, and large-scale spatial analysis. Our model, Decoupled-PolyGCN, outperforms existing methods by 6% in AP and 10% in AR, demonstrating its ability to deliver accurate and regularized building footprints across diverse and challenging scenarios.
format Preprint
id arxiv_https___arxiv_org_abs_2507_17038
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Transformer Based Building Boundary Reconstruction using Attraction Field Maps
Kamran, Muhammad
Sheikholeslami, Mohammad Moein
Wichmann, Andreas
Sohn, Gunho
Computer Vision and Pattern Recognition
In recent years, the number of remote satellites orbiting the Earth has grown significantly, streaming vast amounts of high-resolution visual data to support diverse applications across civil, public, and military domains. Among these applications, the generation and updating of spatial maps of the built environment have become critical due to the extensive coverage and detailed imagery provided by satellites. However, reconstructing spatial maps from satellite imagery is a complex computer vision task, requiring the creation of high-level object representations, such as primitives, to accurately capture the built environment. While the past decade has witnessed remarkable advancements in object detection and representation using visual data, primitives-based object representation remains a persistent challenge in computer vision. Consequently, high-quality spatial maps often rely on labor-intensive and manual processes. This paper introduces a novel deep learning methodology leveraging Graph Convolutional Networks (GCNs) to address these challenges in building footprint reconstruction. The proposed approach enhances performance by incorporating geometric regularity into building boundaries, integrating multi-scale and multi-resolution features, and embedding Attraction Field Maps into the network. These innovations provide a scalable and precise solution for automated building footprint extraction from a single satellite image, paving the way for impactful applications in urban planning, disaster management, and large-scale spatial analysis. Our model, Decoupled-PolyGCN, outperforms existing methods by 6% in AP and 10% in AR, demonstrating its ability to deliver accurate and regularized building footprints across diverse and challenging scenarios.
title Transformer Based Building Boundary Reconstruction using Attraction Field Maps
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2507.17038