Saved in:
Bibliographic Details
Main Authors: Wen, Tao, Wang, Jiepeng, Chen, Yabo, Xu, Shugong, Zhang, Chi, Li, Xuelong
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.12103
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913796498391040
author Wen, Tao
Wang, Jiepeng
Chen, Yabo
Xu, Shugong
Zhang, Chi
Li, Xuelong
author_facet Wen, Tao
Wang, Jiepeng
Chen, Yabo
Xu, Shugong
Zhang, Chi
Li, Xuelong
contents Accurate and generalizable metric depth estimation is crucial for various computer vision applications but remains challenging due to the diverse depth scales encountered in indoor and outdoor environments. In this paper, we introduce Metric-Solver, a novel sliding anchor-based metric depth estimation method that dynamically adapts to varying scene scales. Our approach leverages an anchor-based representation, where a reference depth serves as an anchor to separate and normalize the scene depth into two components: scaled near-field depth and tapered far-field depth. The anchor acts as a normalization factor, enabling the near-field depth to be normalized within a consistent range while mapping far-field depth smoothly toward zero. Through this approach, any depth from zero to infinity in the scene can be represented within a unified representation, effectively eliminating the need to manually account for scene scale variations. More importantly, for the same scene, the anchor can slide along the depth axis, dynamically adjusting to different depth scales. A smaller anchor provides higher resolution in the near-field, improving depth precision for closer objects while a larger anchor improves depth estimation in far regions. This adaptability enables the model to handle depth predictions at varying distances and ensure strong generalization across datasets. Our design enables a unified and adaptive depth representation across diverse environments. Extensive experiments demonstrate that Metric-Solver outperforms existing methods in both accuracy and cross-dataset generalization.
format Preprint
id arxiv_https___arxiv_org_abs_2504_12103
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
Wen, Tao
Wang, Jiepeng
Chen, Yabo
Xu, Shugong
Zhang, Chi
Li, Xuelong
Computer Vision and Pattern Recognition
Accurate and generalizable metric depth estimation is crucial for various computer vision applications but remains challenging due to the diverse depth scales encountered in indoor and outdoor environments. In this paper, we introduce Metric-Solver, a novel sliding anchor-based metric depth estimation method that dynamically adapts to varying scene scales. Our approach leverages an anchor-based representation, where a reference depth serves as an anchor to separate and normalize the scene depth into two components: scaled near-field depth and tapered far-field depth. The anchor acts as a normalization factor, enabling the near-field depth to be normalized within a consistent range while mapping far-field depth smoothly toward zero. Through this approach, any depth from zero to infinity in the scene can be represented within a unified representation, effectively eliminating the need to manually account for scene scale variations. More importantly, for the same scene, the anchor can slide along the depth axis, dynamically adjusting to different depth scales. A smaller anchor provides higher resolution in the near-field, improving depth precision for closer objects while a larger anchor improves depth estimation in far regions. This adaptability enables the model to handle depth predictions at varying distances and ensure strong generalization across datasets. Our design enables a unified and adaptive depth representation across diverse environments. Extensive experiments demonstrate that Metric-Solver outperforms existing methods in both accuracy and cross-dataset generalization.
title Metric-Solver: Sliding Anchored Metric Depth Estimation from a Single Image
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2504.12103