Saved in:
Bibliographic Details
Main Authors: Cheng, Jintao, Li, Weibin, He, Zhijian, Wu, Jin, Vong, Chi Man, Zhang, Wei
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.00841
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Visual Place Recognition (VPR) demands representations robust to drastic environmental and viewpoint shifts. Existing aggregation paradigms either depend on extensive supervised training or rely on first-order pooling, often struggling to preserve structural correlations under extreme shifts or incurring high adaptation costs. In this work, we propose Riemannian Invariant Aggregation (RIA), a unified geometric framework that explicitly models second-order scene structure on the Symmetric Positive Definite (SPD) manifold. By treating perturbations as tractable congruence transformations, RIA leverages geometry-aware Riemannian mappings to project covariance descriptors into a linearized Euclidean space, effectively preserving invariant structural components while suppressing noise. Extensive evaluations demonstrate that RIA achieves zero-shot performance comparable to supervised methods, and establishes state-of-the-art accuracy with simple fine-tuning, particularly in unstructured environments. The source code will be released.