Saved in:
Bibliographic Details
Main Authors: Zhang, Yu-Wei, Han, Tongju, Gao, Lipeng, Wei, Mingqiang, Liu, Hui, Li, Changbao, Zhang, Caiming
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2508.19555
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper presents MonoRelief V2, an end-to-end model designed for directly recovering 2.5D reliefs from single images under complex material and illumination variations. In contrast to its predecessor, MonoRelief V1 [1], which was solely trained on synthetic data, MonoRelief V2 incorporates real data to achieve improved robustness, accuracy and efficiency. To overcome the challenge of acquiring large-scale real-world dataset, we generate approximately 15,000 pseudo real images using a text-to-image generative model, and derive corresponding depth pseudo-labels through fusion of depth and normal predictions. Furthermore, we construct a small-scale real-world dataset (800 samples) via multi-view reconstruction and detail refinement. MonoRelief V2 is then progressively trained on the pseudo-real and real-world datasets. Comprehensive experiments demonstrate its state-of-the-art performance both in depth and normal predictions, highlighting its strong potential for a range of downstream applications. Code is at: https://github.com/glp1001/MonoreliefV2.