:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ke, Bingxin, Zhou, Qunjie, Huang, Jiahui, Ren, Xuanchi, Shen, Tianchang, Schindler, Konrad, Leal-Taixé, Laura, Huang, Shengyu
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.14751
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The NeRFect Match: Exploring NeRF Features for Visual Localization
by: Zhou, Qunjie, et al.
Published: (2024)

Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
by: Ke, Bingxin, et al.
Published: (2023)

MATCHA:Towards Matching Anything
by: Xue, Fei, et al.
Published: (2025)

ViPE: Video Pose Engine for 3D Geometric Perception
by: Huang, Jiahui, et al.
Published: (2025)

Video Depth without Video Models
by: Ke, Bingxin, et al.
Published: (2024)

Light3R-SfM: Towards Feed-forward Structure-from-Motion
by: Elflein, Sven, et al.
Published: (2025)

DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction
by: Seidenschwarz, Jenny, et al.
Published: (2024)

Déjà View: Looping Transformers for Multi-View 3D Reconstruction
by: Burzio, Alessandro, et al.
Published: (2026)

Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
by: Ke, Bingxin, et al.
Published: (2025)

Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
by: Viola, Massimiliano, et al.
Published: (2024)

Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
by: Ren, Xuanchi, et al.
Published: (2025)

VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale
by: Elflein, Sven, et al.
Published: (2026)

SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
by: Huang, Qunjie, et al.
Published: (2026)

StereoSpace: Depth-Free Synthesis of Stereo Geometry via End-to-End Diffusion in a Canonical Space
by: Behrens, Tjark, et al.
Published: (2025)

BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
by: Zhang, Xiang, et al.
Published: (2024)

Test-Time Adaptation for Depth Completion
by: Park, Hyoungseob, et al.
Published: (2024)

GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)

Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments
by: Zhu, Liyuan, et al.
Published: (2023)

A Guide to Structureless Visual Localization
by: Panek, Vojtech, et al.
Published: (2025)

SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow
by: Cetintas, Orcun, et al.
Published: (2024)

InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
by: Lu, Yifan, et al.
Published: (2024)

Efficient Test-Time Optimization for Depth Completion via Low-Rank Decoder Adaptation
by: Seo, Minseok, et al.
Published: (2026)

Native Segmentation Vision Transformers
by: Brasó, Guillem, et al.
Published: (2025)

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
by: Bahmani, Sherwin, et al.
Published: (2025)

Towards Learning to Complete Anything in Lidar
by: Takmaz, Ayca, et al.
Published: (2025)

Soft Augmentation for Image Classification
by: Liu, Yang, et al.
Published: (2022)

Dynamic LiDAR Re-simulation using Compositional Neural Fields
by: Wu, Hanfeng, et al.
Published: (2023)

Zero-Shot 4D Lidar Panoptic Segmentation
by: Zhang, Yushan, et al.
Published: (2025)

SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation
by: Toker, Aysim, et al.
Published: (2024)

Need for Speed: Zero-Shot Depth Completion with Single-Step Diffusion
by: Gregorek, Jakub, et al.
Published: (2026)

Lyra 2.0: Explorable Generative 3D Worlds
by: Shen, Tianchang, et al.
Published: (2026)

MoRight: Motion Control Done Right
by: Liu, Shaowei, et al.
Published: (2026)

XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
by: Ren, Xuanchi, et al.
Published: (2023)

LoopSplat: Loop Closure by Registering 3D Gaussian Splats
by: Zhu, Liyuan, et al.
Published: (2024)

Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
by: Liu, Fangfu, et al.
Published: (2026)

SeMoLi: What Moves Together Belongs Together
by: Seidenschwarz, Jenny, et al.
Published: (2024)

NOOUGAT: Towards Unified Online and Offline Multi-Object Tracking
by: Missaoui, Benjamin, et al.
Published: (2025)

A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
by: Zhao, Zixiang, et al.
Published: (2025)

MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation
by: Yang, Linyan, et al.
Published: (2024)

DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
by: Jia, Yuru, et al.
Published: (2023)