Saved in:
Bibliographic Details
Main Authors: Chen, Yang, Chen, Xieyuanli, Li, Junxiang, Tang, Jie, Wu, Tao
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.09377
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918380903071744
author Chen, Yang
Chen, Xieyuanli
Li, Junxiang
Tang, Jie
Wu, Tao
author_facet Chen, Yang
Chen, Xieyuanli
Li, Junxiang
Tang, Jie
Wu, Tao
contents Robust cross-view geo-localization (CVGL) remains challenging despite the surge in recent progress. Existing methods still rely on field-of-view (FoV)-specific training paradigms, where models are optimized under a fixed FoV but collapse when tested on unseen FoVs and unknown orientations. This limitation necessitates deploying multiple models to cover diverse variations. Although studies have explored dynamic FoV training by simply randomizing FoVs, they failed to achieve robustness across diverse conditions -- implicitly assuming all FoVs are equally difficult. To address this gap, we present SinGeo, a simple yet powerful framework that enables a single model to realize robust cross-view geo-localization without additional modules or explicit transformations. SinGeo employs a dual discriminative learning architecture that enhances intra-view discriminability within both ground and satellite branches, and is the first to introduce a curriculum learning strategy to achieve robust CVGL. Extensive evaluations on four benchmark datasets reveal that SinGeo sets state-of-the-art (SOTA) results under diverse conditions, and notably outperforms methods specifically trained for extreme FoVs. Beyond superior performance, SinGeo also exhibits cross-architecture transferability. Furthermore, we propose a consistency evaluation method to quantitatively assess model stability under varying views, providing an explainable perspective for understanding and advancing robustness in future CVGL research. Codes will be available upon acceptance.
format Preprint
id arxiv_https___arxiv_org_abs_2603_09377
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
Chen, Yang
Chen, Xieyuanli
Li, Junxiang
Tang, Jie
Wu, Tao
Computer Vision and Pattern Recognition
Robust cross-view geo-localization (CVGL) remains challenging despite the surge in recent progress. Existing methods still rely on field-of-view (FoV)-specific training paradigms, where models are optimized under a fixed FoV but collapse when tested on unseen FoVs and unknown orientations. This limitation necessitates deploying multiple models to cover diverse variations. Although studies have explored dynamic FoV training by simply randomizing FoVs, they failed to achieve robustness across diverse conditions -- implicitly assuming all FoVs are equally difficult. To address this gap, we present SinGeo, a simple yet powerful framework that enables a single model to realize robust cross-view geo-localization without additional modules or explicit transformations. SinGeo employs a dual discriminative learning architecture that enhances intra-view discriminability within both ground and satellite branches, and is the first to introduce a curriculum learning strategy to achieve robust CVGL. Extensive evaluations on four benchmark datasets reveal that SinGeo sets state-of-the-art (SOTA) results under diverse conditions, and notably outperforms methods specifically trained for extreme FoVs. Beyond superior performance, SinGeo also exhibits cross-architecture transferability. Furthermore, we propose a consistency evaluation method to quantitatively assess model stability under varying views, providing an explainable perspective for understanding and advancing robustness in future CVGL research. Codes will be available upon acceptance.
title SinGeo: Unlock Single Model's Potential for Robust Cross-View Geo-Localization
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2603.09377