Saved in:
| Main Authors: | Ke, Bingxin, Zhou, Qunjie, Huang, Jiahui, Ren, Xuanchi, Shen, Tianchang, Schindler, Konrad, Leal-Taixé, Laura, Huang, Shengyu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.14751 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The NeRFect Match: Exploring NeRF Features for Visual Localization
by: Zhou, Qunjie, et al.
Published: (2024)
by: Zhou, Qunjie, et al.
Published: (2024)
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
by: Ke, Bingxin, et al.
Published: (2023)
by: Ke, Bingxin, et al.
Published: (2023)
MATCHA:Towards Matching Anything
by: Xue, Fei, et al.
Published: (2025)
by: Xue, Fei, et al.
Published: (2025)
ViPE: Video Pose Engine for 3D Geometric Perception
by: Huang, Jiahui, et al.
Published: (2025)
by: Huang, Jiahui, et al.
Published: (2025)
Video Depth without Video Models
by: Ke, Bingxin, et al.
Published: (2024)
by: Ke, Bingxin, et al.
Published: (2024)
Light3R-SfM: Towards Feed-forward Structure-from-Motion
by: Elflein, Sven, et al.
Published: (2025)
by: Elflein, Sven, et al.
Published: (2025)
DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction
by: Seidenschwarz, Jenny, et al.
Published: (2024)
by: Seidenschwarz, Jenny, et al.
Published: (2024)
Déjà View: Looping Transformers for Multi-View 3D Reconstruction
by: Burzio, Alessandro, et al.
Published: (2026)
by: Burzio, Alessandro, et al.
Published: (2026)
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis
by: Ke, Bingxin, et al.
Published: (2025)
by: Ke, Bingxin, et al.
Published: (2025)
Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion
by: Viola, Massimiliano, et al.
Published: (2024)
by: Viola, Massimiliano, et al.
Published: (2024)
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
by: Ren, Xuanchi, et al.
Published: (2025)
by: Ren, Xuanchi, et al.
Published: (2025)
VGG-T$^3$: Offline Feed-Forward 3D Reconstruction at Scale
by: Elflein, Sven, et al.
Published: (2026)
by: Elflein, Sven, et al.
Published: (2026)
SATTC: Structure-Aware Label-Free Test-Time Calibration for Cross-Subject EEG-to-Image Retrieval
by: Huang, Qunjie, et al.
Published: (2026)
by: Huang, Qunjie, et al.
Published: (2026)
StereoSpace: Depth-Free Synthesis of Stereo Geometry via End-to-End Diffusion in a Canonical Space
by: Behrens, Tjark, et al.
Published: (2025)
by: Behrens, Tjark, et al.
Published: (2025)
BetterDepth: Plug-and-Play Diffusion Refiner for Zero-Shot Monocular Depth Estimation
by: Zhang, Xiang, et al.
Published: (2024)
by: Zhang, Xiang, et al.
Published: (2024)
Test-Time Adaptation for Depth Completion
by: Park, Hyoungseob, et al.
Published: (2024)
by: Park, Hyoungseob, et al.
Published: (2024)
GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
by: Ren, Xuanchi, et al.
Published: (2025)
by: Ren, Xuanchi, et al.
Published: (2025)
Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments
by: Zhu, Liyuan, et al.
Published: (2023)
by: Zhu, Liyuan, et al.
Published: (2023)
A Guide to Structureless Visual Localization
by: Panek, Vojtech, et al.
Published: (2025)
by: Panek, Vojtech, et al.
Published: (2025)
SPAMming Labels: Efficient Annotations for the Trackers of Tomorrow
by: Cetintas, Orcun, et al.
Published: (2024)
by: Cetintas, Orcun, et al.
Published: (2024)
InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
by: Lu, Yifan, et al.
Published: (2024)
by: Lu, Yifan, et al.
Published: (2024)
Efficient Test-Time Optimization for Depth Completion via Low-Rank Decoder Adaptation
by: Seo, Minseok, et al.
Published: (2026)
by: Seo, Minseok, et al.
Published: (2026)
Native Segmentation Vision Transformers
by: Brasó, Guillem, et al.
Published: (2025)
by: Brasó, Guillem, et al.
Published: (2025)
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
by: Bahmani, Sherwin, et al.
Published: (2025)
by: Bahmani, Sherwin, et al.
Published: (2025)
Towards Learning to Complete Anything in Lidar
by: Takmaz, Ayca, et al.
Published: (2025)
by: Takmaz, Ayca, et al.
Published: (2025)
Soft Augmentation for Image Classification
by: Liu, Yang, et al.
Published: (2022)
by: Liu, Yang, et al.
Published: (2022)
Dynamic LiDAR Re-simulation using Compositional Neural Fields
by: Wu, Hanfeng, et al.
Published: (2023)
by: Wu, Hanfeng, et al.
Published: (2023)
Zero-Shot 4D Lidar Panoptic Segmentation
by: Zhang, Yushan, et al.
Published: (2025)
by: Zhang, Yushan, et al.
Published: (2025)
SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation
by: Toker, Aysim, et al.
Published: (2024)
by: Toker, Aysim, et al.
Published: (2024)
Need for Speed: Zero-Shot Depth Completion with Single-Step Diffusion
by: Gregorek, Jakub, et al.
Published: (2026)
by: Gregorek, Jakub, et al.
Published: (2026)
Lyra 2.0: Explorable Generative 3D Worlds
by: Shen, Tianchang, et al.
Published: (2026)
by: Shen, Tianchang, et al.
Published: (2026)
MoRight: Motion Control Done Right
by: Liu, Shaowei, et al.
Published: (2026)
by: Liu, Shaowei, et al.
Published: (2026)
XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
by: Ren, Xuanchi, et al.
Published: (2023)
by: Ren, Xuanchi, et al.
Published: (2023)
LoopSplat: Loop Closure by Registering 3D Gaussian Splats
by: Zhu, Liyuan, et al.
Published: (2024)
by: Zhu, Liyuan, et al.
Published: (2024)
Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players
by: Liu, Fangfu, et al.
Published: (2026)
by: Liu, Fangfu, et al.
Published: (2026)
SeMoLi: What Moves Together Belongs Together
by: Seidenschwarz, Jenny, et al.
Published: (2024)
by: Seidenschwarz, Jenny, et al.
Published: (2024)
NOOUGAT: Towards Unified Online and Offline Multi-Object Tracking
by: Missaoui, Benjamin, et al.
Published: (2025)
by: Missaoui, Benjamin, et al.
Published: (2025)
A Unified Solution to Video Fusion: From Multi-Frame Learning to Benchmarking
by: Zhao, Zixiang, et al.
Published: (2025)
by: Zhao, Zixiang, et al.
Published: (2025)
MICDrop: Masking Image and Depth Features via Complementary Dropout for Domain-Adaptive Semantic Segmentation
by: Yang, Linyan, et al.
Published: (2024)
by: Yang, Linyan, et al.
Published: (2024)
DGInStyle: Domain-Generalizable Semantic Segmentation with Image Diffusion Models and Stylized Semantic Control
by: Jia, Yuru, et al.
Published: (2023)
by: Jia, Yuru, et al.
Published: (2023)
Similar Items
-
The NeRFect Match: Exploring NeRF Features for Visual Localization
by: Zhou, Qunjie, et al.
Published: (2024) -
Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation
by: Ke, Bingxin, et al.
Published: (2023) -
MATCHA:Towards Matching Anything
by: Xue, Fei, et al.
Published: (2025) -
ViPE: Video Pose Engine for 3D Geometric Perception
by: Huang, Jiahui, et al.
Published: (2025) -
Video Depth without Video Models
by: Ke, Bingxin, et al.
Published: (2024)