Saved in:
Bibliographic Details
Main Authors: Wang, Haotian, Xiao, Aoran, Zhang, Xiaoqin, Yang, Meng, Lu, Shijian
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.07374
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916835743498240
author Wang, Haotian
Xiao, Aoran
Zhang, Xiaoqin
Yang, Meng
Lu, Shijian
author_facet Wang, Haotian
Xiao, Aoran
Zhang, Xiaoqin
Yang, Meng
Lu, Shijian
contents Generalizable depth completion enables the acquisition of dense metric depth maps for unseen environments, offering robust perception capabilities for various downstream tasks. However, training such models typically requires large-scale datasets with metric depth labels, which are often labor-intensive to collect. This paper presents PacGDC, a label-efficient technique that enhances data diversity with minimal annotation effort for generalizable depth completion. PacGDC builds on novel insights into inherent ambiguities and consistencies in object shapes and positions during 2D-to-3D projection, allowing the synthesis of numerous pseudo geometries for the same visual scene. This process greatly broadens available geometries by manipulating scene scales of the corresponding depth maps. To leverage this property, we propose a new data synthesis pipeline that uses multiple depth foundation models as scale manipulators. These models robustly provide pseudo depth labels with varied scene scales, affecting both local objects and global layouts, while ensuring projection consistency that supports generalization. To further diversify geometries, we incorporate interpolation and relocation strategies, as well as unlabeled images, extending the data coverage beyond the individual use of foundation models. Extensive experiments show that PacGDC achieves remarkable generalizability across multiple benchmarks, excelling in diverse scene semantics/scales and depth sparsity/patterns under both zero-shot and few-shot settings. Code: https://github.com/Wang-xjtu/PacGDC.
format Preprint
id arxiv_https___arxiv_org_abs_2507_07374
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
Wang, Haotian
Xiao, Aoran
Zhang, Xiaoqin
Yang, Meng
Lu, Shijian
Computer Vision and Pattern Recognition
Generalizable depth completion enables the acquisition of dense metric depth maps for unseen environments, offering robust perception capabilities for various downstream tasks. However, training such models typically requires large-scale datasets with metric depth labels, which are often labor-intensive to collect. This paper presents PacGDC, a label-efficient technique that enhances data diversity with minimal annotation effort for generalizable depth completion. PacGDC builds on novel insights into inherent ambiguities and consistencies in object shapes and positions during 2D-to-3D projection, allowing the synthesis of numerous pseudo geometries for the same visual scene. This process greatly broadens available geometries by manipulating scene scales of the corresponding depth maps. To leverage this property, we propose a new data synthesis pipeline that uses multiple depth foundation models as scale manipulators. These models robustly provide pseudo depth labels with varied scene scales, affecting both local objects and global layouts, while ensuring projection consistency that supports generalization. To further diversify geometries, we incorporate interpolation and relocation strategies, as well as unlabeled images, extending the data coverage beyond the individual use of foundation models. Extensive experiments show that PacGDC achieves remarkable generalizability across multiple benchmarks, excelling in diverse scene semantics/scales and depth sparsity/patterns under both zero-shot and few-shot settings. Code: https://github.com/Wang-xjtu/PacGDC.
title PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2507.07374