Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ndulue, Nimrod Millenium, Millan-Romera, Jose Andres, Giorgi, Matteo, Voos, Holger, Sanchez-Lopez, Jose Luis
Format: Preprint
Veröffentlicht: 2026
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2604.27821
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866913076912062464
author Ndulue, Nimrod Millenium
Millan-Romera, Jose Andres
Giorgi, Matteo
Voos, Holger
Sanchez-Lopez, Jose Luis
author_facet Ndulue, Nimrod Millenium
Millan-Romera, Jose Andres
Giorgi, Matteo
Voos, Holger
Sanchez-Lopez, Jose Luis
contents Accurate localization is a fundamental requirement for autonomous robots operating in indoor environments. Scene graphs encode the spatial structure of an environment as a hierarchy of semantic entities and their relationships, and can be constructed both online from robot sensor data and offline from architectural priors such as Building Information Models (BIM). Matching these two complementary representations enables drift correction in SLAM by grounding robot observations against a known structural prior. However, establishing reliable node-to-node correspondences between them remains an open challenge: existing combinatorial methods are prohibitively expensive at scale, and prior learned approaches address only flat graph matching, ignoring the multi-level semantic structure present in both representations. Here we present a learned, end-to-end differentiable pipeline that augments both graphs with semantically motivated edge types encoding intra- and inter- level relationships, explicitly exploiting this hierarchy to enable simultaneous matching from high-level room concepts down to low-level wall surfaces. Trained exclusively on floor plans, the proposed method outperforms the combinatorial baseline in F1 on real LiDAR environments while running an order of magnitude faster, demonstrating viable zero-shot generalization for BIM-assisted robot localization.
format Preprint
id arxiv_https___arxiv_org_abs_2604_27821
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Learning-Based Hierarchical Scene Graph Matching for Robot Localization Leveraging Prior Maps
Ndulue, Nimrod Millenium
Millan-Romera, Jose Andres
Giorgi, Matteo
Voos, Holger
Sanchez-Lopez, Jose Luis
Robotics
Accurate localization is a fundamental requirement for autonomous robots operating in indoor environments. Scene graphs encode the spatial structure of an environment as a hierarchy of semantic entities and their relationships, and can be constructed both online from robot sensor data and offline from architectural priors such as Building Information Models (BIM). Matching these two complementary representations enables drift correction in SLAM by grounding robot observations against a known structural prior. However, establishing reliable node-to-node correspondences between them remains an open challenge: existing combinatorial methods are prohibitively expensive at scale, and prior learned approaches address only flat graph matching, ignoring the multi-level semantic structure present in both representations. Here we present a learned, end-to-end differentiable pipeline that augments both graphs with semantically motivated edge types encoding intra- and inter- level relationships, explicitly exploiting this hierarchy to enable simultaneous matching from high-level room concepts down to low-level wall surfaces. Trained exclusively on floor plans, the proposed method outperforms the combinatorial baseline in F1 on real LiDAR environments while running an order of magnitude faster, demonstrating viable zero-shot generalization for BIM-assisted robot localization.
title Learning-Based Hierarchical Scene Graph Matching for Robot Localization Leveraging Prior Maps
topic Robotics
url https://arxiv.org/abs/2604.27821