Saved in:
Bibliographic Details
Main Authors: Yu, Yinan, Gonzalez-Caceres, Alex, Scheidegger, Samuel, Somanath, Sanjay, Hollberg, Alexander
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.04406
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915431984398336
author Yu, Yinan
Gonzalez-Caceres, Alex
Scheidegger, Samuel
Somanath, Sanjay
Hollberg, Alexander
author_facet Yu, Yinan
Gonzalez-Caceres, Alex
Scheidegger, Samuel
Somanath, Sanjay
Hollberg, Alexander
contents Renovating existing buildings is essential for climate impact. Early-phase renovation planning requires simulations based on thermal 3D models at Level of Detail (LoD) 3, which include features like windows. However, scalable and accurate identification of such features remains a challenge. This paper presents the Scalable Image-to-3D Facade Parser (SI3FP), a pipeline that generates LoD3 thermal models by extracting geometries from images using both computer vision and deep learning. Unlike existing methods relying on segmentation and projection, SI3FP directly models geometric primitives in the orthographic image plane, providing a unified interface while reducing perspective distortions. SI3FP supports both sparse (e.g., Google Street View) and dense (e.g., hand-held camera) data sources. Tested on typical Swedish residential buildings, SI3FP achieved approximately 5% error in window-to-wall ratio estimates, demonstrating sufficient accuracy for early-stage renovation analysis. The pipeline facilitates large-scale energy renovation planning and has broader applications in urban development and planning.
format Preprint
id arxiv_https___arxiv_org_abs_2508_04406
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models
Yu, Yinan
Gonzalez-Caceres, Alex
Scheidegger, Samuel
Somanath, Sanjay
Hollberg, Alexander
Computer Vision and Pattern Recognition
Artificial Intelligence
Renovating existing buildings is essential for climate impact. Early-phase renovation planning requires simulations based on thermal 3D models at Level of Detail (LoD) 3, which include features like windows. However, scalable and accurate identification of such features remains a challenge. This paper presents the Scalable Image-to-3D Facade Parser (SI3FP), a pipeline that generates LoD3 thermal models by extracting geometries from images using both computer vision and deep learning. Unlike existing methods relying on segmentation and projection, SI3FP directly models geometric primitives in the orthographic image plane, providing a unified interface while reducing perspective distortions. SI3FP supports both sparse (e.g., Google Street View) and dense (e.g., hand-held camera) data sources. Tested on typical Swedish residential buildings, SI3FP achieved approximately 5% error in window-to-wall ratio estimates, demonstrating sufficient accuracy for early-stage renovation analysis. The pipeline facilitates large-scale energy renovation planning and has broader applications in urban development and planning.
title Deep Learning-based Scalable Image-to-3D Facade Parser for Generating Thermal 3D Building Models
topic Computer Vision and Pattern Recognition
Artificial Intelligence
url https://arxiv.org/abs/2508.04406