:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Sarkar, Sayan Deb, Pautrat, Rémi, Miksik, Ondrej, Pollefeys, Marc, Armeni, Iro, Rad, Mahdi, Dusmanu, Mihai
Format:	Preprint
Published:	2026
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2602.13191
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CrossOver: 3D Scene Cross-Modal Alignment
by: Sarkar, Sayan Deb, et al.
Published: (2025)

SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment
by: Singh, Binod, et al.
Published: (2025)

Space3D-Bench: Spatial 3D Question Answering Benchmark
by: Szymanska, Emilia, et al.
Published: (2024)

Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
by: Qu, Kevin, et al.
Published: (2026)

GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer
by: Sarkar, Sayan Deb, et al.
Published: (2025)

UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
by: Wüest, Matthias, et al.
Published: (2025)

HouseTour: A Virtual Real Estate A(I)gent
by: Çelen, Ata, et al.
Published: (2025)

Multiway Point Cloud Mosaicking with Diffusion and Global Optimization
by: Jin, Shengze, et al.
Published: (2024)

Volumetric Semantically Consistent 3D Panoptic Mapping
by: Miao, Yang, et al.
Published: (2023)

MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
by: Bösiger, Lukas, et al.
Published: (2024)

"Where am I?" Scene Retrieval with Language
by: Chen, Jiaqi, et al.
Published: (2024)

Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
by: Yu, Yifan, et al.
Published: (2025)

WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
by: Zheng, Jianhao, et al.
Published: (2025)

LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching
by: Ubingazhibov, Aidyn, et al.
Published: (2025)

CoPE: A Small Language Model for Steerable and Scalable Content Labeling
by: Chakrabarti, Samidh, et al.
Published: (2025)

CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs
by: Li, Haoran, et al.
Published: (2026)

Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change
by: Sun, Tao, et al.
Published: (2023)

AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
by: Qi, Haozhe, et al.
Published: (2026)

MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps
by: Zheng, Jianhao, et al.
Published: (2024)

GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
by: Zhu, Liyuan, et al.
Published: (2026)

3D Neural Edge Reconstruction
by: Li, Lei, et al.
Published: (2024)

ReSpace: Text-Driven Autoregressive 3D Indoor Scene Synthesis and Editing
by: Bucher, Martin JJ., et al.
Published: (2025)

Multi Activity Sequence Alignment via Implicit Clustering
by: Kwon, Taein, et al.
Published: (2025)

Robust Incremental Structure-from-Motion with Hybrid Features
by: Liu, Shaohui, et al.
Published: (2024)

EgoGen: An Egocentric Synthetic Data Generator
by: Li, Gen, et al.
Published: (2024)

MR.NAVI: Mixed-Reality Navigation Assistant for the Visually Impaired
by: Pfitzer, Nicolas, et al.
Published: (2025)

CodecSight: Leveraging Video Codec Signals for Efficient Streaming VLM Inference
by: Zou, Yulin, et al.
Published: (2026)

WildPose: A Unified Framework for Robust Pose Estimation in the Wild
by: Zheng, Jianhao, et al.
Published: (2026)

Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments
by: Zhu, Liyuan, et al.
Published: (2023)

Facade Segmentation for Solar Photovoltaic Suitability
by: Duran, Ayca, et al.
Published: (2025)

CoPE: A Lightweight Complex Positional Encoding
by: Amballa, Avinash
Published: (2025)

CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
by: Chen, Xueyuan, et al.
Published: (2024)

Rectified Point Flow: Generic Point Cloud Pose Estimation
by: Sun, Tao, et al.
Published: (2025)

SuperDec: 3D Scene Decomposition with Superquadric Primitives
by: Fedele, Elisabetta, et al.
Published: (2025)

ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
by: Steiner, Emily, et al.
Published: (2026)

Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
by: Jiang, Zeren, et al.
Published: (2025)

Evaluating the Role of the CoPE in Relation to Established Laryngology PROMs
by: Camryn R. Marshall, et al.
Published: (2025)

LoopSplat: Loop Closure by Registering 3D Gaussian Splats
by: Zhu, Liyuan, et al.
Published: (2024)

ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences
by: Zhu, Liyuan, et al.
Published: (2025)

Deep Sketch-Based 3D Modeling: A Survey
by: Tono, Alberto, et al.
Published: (2026)