Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wang, Haotian, Xiao, Aoran, Zhang, Xiaoqin, Yang, Meng, Lu, Shijian
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2507.07374
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916835743498240
author	Wang, Haotian Xiao, Aoran Zhang, Xiaoqin Yang, Meng Lu, Shijian
author_facet	Wang, Haotian Xiao, Aoran Zhang, Xiaoqin Yang, Meng Lu, Shijian
contents	Generalizable depth completion enables the acquisition of dense metric depth maps for unseen environments, offering robust perception capabilities for various downstream tasks. However, training such models typically requires large-scale datasets with metric depth labels, which are often labor-intensive to collect. This paper presents PacGDC, a label-efficient technique that enhances data diversity with minimal annotation effort for generalizable depth completion. PacGDC builds on novel insights into inherent ambiguities and consistencies in object shapes and positions during 2D-to-3D projection, allowing the synthesis of numerous pseudo geometries for the same visual scene. This process greatly broadens available geometries by manipulating scene scales of the corresponding depth maps. To leverage this property, we propose a new data synthesis pipeline that uses multiple depth foundation models as scale manipulators. These models robustly provide pseudo depth labels with varied scene scales, affecting both local objects and global layouts, while ensuring projection consistency that supports generalization. To further diversify geometries, we incorporate interpolation and relocation strategies, as well as unlabeled images, extending the data coverage beyond the individual use of foundation models. Extensive experiments show that PacGDC achieves remarkable generalizability across multiple benchmarks, excelling in diverse scene semantics/scales and depth sparsity/patterns under both zero-shot and few-shot settings. Code: https://github.com/Wang-xjtu/PacGDC.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_07374
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency Wang, Haotian Xiao, Aoran Zhang, Xiaoqin Yang, Meng Lu, Shijian Computer Vision and Pattern Recognition Generalizable depth completion enables the acquisition of dense metric depth maps for unseen environments, offering robust perception capabilities for various downstream tasks. However, training such models typically requires large-scale datasets with metric depth labels, which are often labor-intensive to collect. This paper presents PacGDC, a label-efficient technique that enhances data diversity with minimal annotation effort for generalizable depth completion. PacGDC builds on novel insights into inherent ambiguities and consistencies in object shapes and positions during 2D-to-3D projection, allowing the synthesis of numerous pseudo geometries for the same visual scene. This process greatly broadens available geometries by manipulating scene scales of the corresponding depth maps. To leverage this property, we propose a new data synthesis pipeline that uses multiple depth foundation models as scale manipulators. These models robustly provide pseudo depth labels with varied scene scales, affecting both local objects and global layouts, while ensuring projection consistency that supports generalization. To further diversify geometries, we incorporate interpolation and relocation strategies, as well as unlabeled images, extending the data coverage beyond the individual use of foundation models. Extensive experiments show that PacGDC achieves remarkable generalizability across multiple benchmarks, excelling in diverse scene semantics/scales and depth sparsity/patterns under both zero-shot and few-shot settings. Code: https://github.com/Wang-xjtu/PacGDC.
title	PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2507.07374

Similar Items