Saved in:
Bibliographic Details
Main Authors: Wu, Zhenlong, Zheng, Zihan, Wang, Xuanxuan, Wang, Qianhe, Yang, Hua, Zhang, Xiaoyun, Hu, Qiang, Zhang, Wenjun
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.20784
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914499502538752
author Wu, Zhenlong
Zheng, Zihan
Wang, Xuanxuan
Wang, Qianhe
Yang, Hua
Zhang, Xiaoyun
Hu, Qiang
Zhang, Wenjun
author_facet Wu, Zhenlong
Zheng, Zihan
Wang, Xuanxuan
Wang, Qianhe
Yang, Hua
Zhang, Xiaoyun
Hu, Qiang
Zhang, Wenjun
contents Reconstructing dynamic 3D scenes from sparse multi-view videos is highly ill-posed, often leading to geometric collapse, trajectory drift, and floating artifacts. Recent attempts introduce generative priors to hallucinate missing content, yet naive integration frequently causes structural drift and temporal inconsistency due to the mismatch between stochastic 2D generation and deterministic 3D geometry. In this paper, we propose GeoRect4D, a novel unified framework for sparse-view dynamic reconstruction that couples explicit 3D consistency with generative refinement via a closed-loop optimization process. Specifically, GeoRect4D introduces a degradation-aware feedback mechanism that incorporates a robust anchor-based dynamic 3DGS substrate with a single-step diffusion rectifier to hallucinate high-fidelity details. This rectifier utilizes a structural locking mechanism and spatiotemporal coordinated attention, effectively preserving physical plausibility while restoring missing content. Furthermore, we present a progressive optimization strategy that employs stochastic geometric purification to eliminate floaters and generative distillation to infuse texture details into the explicit representation. Extensive experiments demonstrate that GeoRect4D achieves state-of-the-art performance in reconstruction fidelity, perceptual quality, and spatiotemporal consistency across multiple datasets.
format Preprint
id arxiv_https___arxiv_org_abs_2604_20784
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle GeoRect4D: Geometry-Compatible Generative Rectification for Dynamic Sparse-View 3D Reconstruction
Wu, Zhenlong
Zheng, Zihan
Wang, Xuanxuan
Wang, Qianhe
Yang, Hua
Zhang, Xiaoyun
Hu, Qiang
Zhang, Wenjun
Computer Vision and Pattern Recognition
Reconstructing dynamic 3D scenes from sparse multi-view videos is highly ill-posed, often leading to geometric collapse, trajectory drift, and floating artifacts. Recent attempts introduce generative priors to hallucinate missing content, yet naive integration frequently causes structural drift and temporal inconsistency due to the mismatch between stochastic 2D generation and deterministic 3D geometry. In this paper, we propose GeoRect4D, a novel unified framework for sparse-view dynamic reconstruction that couples explicit 3D consistency with generative refinement via a closed-loop optimization process. Specifically, GeoRect4D introduces a degradation-aware feedback mechanism that incorporates a robust anchor-based dynamic 3DGS substrate with a single-step diffusion rectifier to hallucinate high-fidelity details. This rectifier utilizes a structural locking mechanism and spatiotemporal coordinated attention, effectively preserving physical plausibility while restoring missing content. Furthermore, we present a progressive optimization strategy that employs stochastic geometric purification to eliminate floaters and generative distillation to infuse texture details into the explicit representation. Extensive experiments demonstrate that GeoRect4D achieves state-of-the-art performance in reconstruction fidelity, perceptual quality, and spatiotemporal consistency across multiple datasets.
title GeoRect4D: Geometry-Compatible Generative Rectification for Dynamic Sparse-View 3D Reconstruction
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2604.20784