Clár na nÁbhar: :: Library Catalog

Sábháilte in:

Sonraí bibleagrafaíochta
Príomhchruthaitheoirí:	Lu, Hongbo, Yao, Liang, He, Chenghao, Liu, Fan, Liao, Wenlong, He, Tao, Peng, Pai
Formáid:	Preprint
Foilsithe / Cruthaithe:	2026
Ábhair:	Computer Vision and Pattern Recognition
Rochtain ar líne:	https://arxiv.org/abs/2603.17382
Clibeanna:	Cuir clib leis Níl clibeanna ann, Bí ar an gcéad duine le clib a chur leis an taifead seo!

Clár na nÁbhar:

A fundamental bottleneck in Novel View Synthesis (NVS) for autonomous driving is the inherent supervision gap on novel trajectories: models are tasked with synthesizing unseen views during inference, yet lack ground truth images for these shifted poses during training. In this paper, we propose VisionNVS, a camera-only framework that fundamentally reformulates view synthesis from an ill-posed extrapolation problem into a self-supervised inpainting task. By introducing a ``Virtual-Shift'' strategy, we use monocular depth proxies to simulate occlusion patterns and map them onto the original view. This paradigm shift allows the use of raw, recorded images as pixel-perfect supervision, effectively eliminating the domain gap inherent in previous approaches. Furthermore, we address spatial consistency through a Pseudo-3D Seam Synthesis strategy, which integrates visual data from adjacent cameras during training to explicitly model real-world photometric discrepancies and calibration errors. Experiments demonstrate that VisionNVS achieves superior geometric fidelity and visual quality compared to LiDAR-dependent baselines, offering a robust solution for scalable driving simulation.

Míreanna comhchosúla