Saved in:
Bibliographic Details
Main Authors: Jiao, Yang, Qiu, Yiding, Christensen, Henrik I.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.21602
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915514707607552
author Jiao, Yang
Qiu, Yiding
Christensen, Henrik I.
author_facet Jiao, Yang
Qiu, Yiding
Christensen, Henrik I.
contents Object-level Simultaneous Localization and Mapping (SLAM), which incorporates semantic information for high-level scene understanding, faces challenges of under-constrained optimization due to sparse observations. Prior work has introduced additional constraints using commonsense knowledge, but obtaining such priors has traditionally been labor-intensive and lacks generalizability across diverse object categories. We address this limitation by leveraging large language models (LLMs) to provide commonsense knowledge of object geometric attributes, specifically size and orientation, as prior factors in a graph-based SLAM framework. These priors are particularly beneficial during the initial phase when object observations are limited. We implement a complete pipeline integrating these priors, achieving robust data association on sparse object-level features and enabling real-time object SLAM. Our system, evaluated on the TUM RGB-D and 3RScan datasets, improves mapping accuracy by 36.8\% over the latest baseline. Additionally, we present real-world experiments in the supplementary video, demonstrating its real-time performance.
format Preprint
id arxiv_https___arxiv_org_abs_2509_21602
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Real-Time Indoor Object SLAM with LLM-Enhanced Priors
Jiao, Yang
Qiu, Yiding
Christensen, Henrik I.
Robotics
Object-level Simultaneous Localization and Mapping (SLAM), which incorporates semantic information for high-level scene understanding, faces challenges of under-constrained optimization due to sparse observations. Prior work has introduced additional constraints using commonsense knowledge, but obtaining such priors has traditionally been labor-intensive and lacks generalizability across diverse object categories. We address this limitation by leveraging large language models (LLMs) to provide commonsense knowledge of object geometric attributes, specifically size and orientation, as prior factors in a graph-based SLAM framework. These priors are particularly beneficial during the initial phase when object observations are limited. We implement a complete pipeline integrating these priors, achieving robust data association on sparse object-level features and enabling real-time object SLAM. Our system, evaluated on the TUM RGB-D and 3RScan datasets, improves mapping accuracy by 36.8\% over the latest baseline. Additionally, we present real-world experiments in the supplementary video, demonstrating its real-time performance.
title Real-Time Indoor Object SLAM with LLM-Enhanced Priors
topic Robotics
url https://arxiv.org/abs/2509.21602