Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Wu, Zhenlong, Zheng, Zihan, Wang, Xuanxuan, Wang, Qianhe, Yang, Hua, Zhang, Xiaoyun, Hu, Qiang, Zhang, Wenjun
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2604.20784
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914499502538752
author	Wu, Zhenlong Zheng, Zihan Wang, Xuanxuan Wang, Qianhe Yang, Hua Zhang, Xiaoyun Hu, Qiang Zhang, Wenjun
author_facet	Wu, Zhenlong Zheng, Zihan Wang, Xuanxuan Wang, Qianhe Yang, Hua Zhang, Xiaoyun Hu, Qiang Zhang, Wenjun
contents	Reconstructing dynamic 3D scenes from sparse multi-view videos is highly ill-posed, often leading to geometric collapse, trajectory drift, and floating artifacts. Recent attempts introduce generative priors to hallucinate missing content, yet naive integration frequently causes structural drift and temporal inconsistency due to the mismatch between stochastic 2D generation and deterministic 3D geometry. In this paper, we propose GeoRect4D, a novel unified framework for sparse-view dynamic reconstruction that couples explicit 3D consistency with generative refinement via a closed-loop optimization process. Specifically, GeoRect4D introduces a degradation-aware feedback mechanism that incorporates a robust anchor-based dynamic 3DGS substrate with a single-step diffusion rectifier to hallucinate high-fidelity details. This rectifier utilizes a structural locking mechanism and spatiotemporal coordinated attention, effectively preserving physical plausibility while restoring missing content. Furthermore, we present a progressive optimization strategy that employs stochastic geometric purification to eliminate floaters and generative distillation to infuse texture details into the explicit representation. Extensive experiments demonstrate that GeoRect4D achieves state-of-the-art performance in reconstruction fidelity, perceptual quality, and spatiotemporal consistency across multiple datasets.
format	Preprint
id	arxiv_https___arxiv_org_abs_2604_20784
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	GeoRect4D: Geometry-Compatible Generative Rectification for Dynamic Sparse-View 3D Reconstruction Wu, Zhenlong Zheng, Zihan Wang, Xuanxuan Wang, Qianhe Yang, Hua Zhang, Xiaoyun Hu, Qiang Zhang, Wenjun Computer Vision and Pattern Recognition Reconstructing dynamic 3D scenes from sparse multi-view videos is highly ill-posed, often leading to geometric collapse, trajectory drift, and floating artifacts. Recent attempts introduce generative priors to hallucinate missing content, yet naive integration frequently causes structural drift and temporal inconsistency due to the mismatch between stochastic 2D generation and deterministic 3D geometry. In this paper, we propose GeoRect4D, a novel unified framework for sparse-view dynamic reconstruction that couples explicit 3D consistency with generative refinement via a closed-loop optimization process. Specifically, GeoRect4D introduces a degradation-aware feedback mechanism that incorporates a robust anchor-based dynamic 3DGS substrate with a single-step diffusion rectifier to hallucinate high-fidelity details. This rectifier utilizes a structural locking mechanism and spatiotemporal coordinated attention, effectively preserving physical plausibility while restoring missing content. Furthermore, we present a progressive optimization strategy that employs stochastic geometric purification to eliminate floaters and generative distillation to infuse texture details into the explicit representation. Extensive experiments demonstrate that GeoRect4D achieves state-of-the-art performance in reconstruction fidelity, perceptual quality, and spatiotemporal consistency across multiple datasets.
title	GeoRect4D: Geometry-Compatible Generative Rectification for Dynamic Sparse-View 3D Reconstruction
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2604.20784

Similar Items