Saved in:
| Main Authors: | Sarkar, Sayan Deb, Pautrat, Rémi, Miksik, Ondrej, Pollefeys, Marc, Armeni, Iro, Rad, Mahdi, Dusmanu, Mihai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.13191 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CrossOver: 3D Scene Cross-Modal Alignment
by: Sarkar, Sayan Deb, et al.
Published: (2025)
by: Sarkar, Sayan Deb, et al.
Published: (2025)
SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment
by: Singh, Binod, et al.
Published: (2025)
by: Singh, Binod, et al.
Published: (2025)
Space3D-Bench: Spatial 3D Question Answering Benchmark
by: Szymanska, Emilia, et al.
Published: (2024)
by: Szymanska, Emilia, et al.
Published: (2024)
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
by: Qu, Kevin, et al.
Published: (2026)
by: Qu, Kevin, et al.
Published: (2026)
GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer
by: Sarkar, Sayan Deb, et al.
Published: (2025)
by: Sarkar, Sayan Deb, et al.
Published: (2025)
UnLoc: Leveraging Depth Uncertainties for Floorplan Localization
by: Wüest, Matthias, et al.
Published: (2025)
by: Wüest, Matthias, et al.
Published: (2025)
HouseTour: A Virtual Real Estate A(I)gent
by: Çelen, Ata, et al.
Published: (2025)
by: Çelen, Ata, et al.
Published: (2025)
Multiway Point Cloud Mosaicking with Diffusion and Global Optimization
by: Jin, Shengze, et al.
Published: (2024)
by: Jin, Shengze, et al.
Published: (2024)
Volumetric Semantically Consistent 3D Panoptic Mapping
by: Miao, Yang, et al.
Published: (2023)
by: Miao, Yang, et al.
Published: (2023)
MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References
by: Bösiger, Lukas, et al.
Published: (2024)
by: Bösiger, Lukas, et al.
Published: (2024)
"Where am I?" Scene Retrieval with Language
by: Chen, Jiaqi, et al.
Published: (2024)
by: Chen, Jiaqi, et al.
Published: (2024)
Relative Pose Estimation through Affine Corrections of Monocular Depth Priors
by: Yu, Yifan, et al.
Published: (2025)
by: Yu, Yifan, et al.
Published: (2025)
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
by: Zheng, Jianhao, et al.
Published: (2025)
by: Zheng, Jianhao, et al.
Published: (2025)
LightGlueStick: a Fast and Robust Glue for Joint Point-Line Matching
by: Ubingazhibov, Aidyn, et al.
Published: (2025)
by: Ubingazhibov, Aidyn, et al.
Published: (2025)
CoPE: A Small Language Model for Steerable and Scalable Content Labeling
by: Chakrabarti, Samidh, et al.
Published: (2025)
by: Chakrabarti, Samidh, et al.
Published: (2025)
CoPE: Clipped RoPE as A Scalable Free Lunch for Long Context LLMs
by: Li, Haoran, et al.
Published: (2026)
by: Li, Haoran, et al.
Published: (2026)
Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change
by: Sun, Tao, et al.
Published: (2023)
by: Sun, Tao, et al.
Published: (2023)
AdaptToken: Entropy-based Adaptive Token Selection for MLLM Long Video Understanding
by: Qi, Haozhe, et al.
Published: (2026)
by: Qi, Haozhe, et al.
Published: (2026)
MAP-ADAPT: Real-Time Quality-Adaptive Semantic 3D Maps
by: Zheng, Jianhao, et al.
Published: (2024)
by: Zheng, Jianhao, et al.
Published: (2024)
GaussFusion: Improving 3D Reconstruction in the Wild with A Geometry-Informed Video Generator
by: Zhu, Liyuan, et al.
Published: (2026)
by: Zhu, Liyuan, et al.
Published: (2026)
3D Neural Edge Reconstruction
by: Li, Lei, et al.
Published: (2024)
by: Li, Lei, et al.
Published: (2024)
ReSpace: Text-Driven Autoregressive 3D Indoor Scene Synthesis and Editing
by: Bucher, Martin JJ., et al.
Published: (2025)
by: Bucher, Martin JJ., et al.
Published: (2025)
Multi Activity Sequence Alignment via Implicit Clustering
by: Kwon, Taein, et al.
Published: (2025)
by: Kwon, Taein, et al.
Published: (2025)
Robust Incremental Structure-from-Motion with Hybrid Features
by: Liu, Shaohui, et al.
Published: (2024)
by: Liu, Shaohui, et al.
Published: (2024)
EgoGen: An Egocentric Synthetic Data Generator
by: Li, Gen, et al.
Published: (2024)
by: Li, Gen, et al.
Published: (2024)
MR.NAVI: Mixed-Reality Navigation Assistant for the Visually Impaired
by: Pfitzer, Nicolas, et al.
Published: (2025)
by: Pfitzer, Nicolas, et al.
Published: (2025)
CodecSight: Leveraging Video Codec Signals for Efficient Streaming VLM Inference
by: Zou, Yulin, et al.
Published: (2026)
by: Zou, Yulin, et al.
Published: (2026)
WildPose: A Unified Framework for Robust Pose Estimation in the Wild
by: Zheng, Jianhao, et al.
Published: (2026)
by: Zheng, Jianhao, et al.
Published: (2026)
Living Scenes: Multi-object Relocalization and Reconstruction in Changing 3D Environments
by: Zhu, Liyuan, et al.
Published: (2023)
by: Zhu, Liyuan, et al.
Published: (2023)
Facade Segmentation for Solar Photovoltaic Suitability
by: Duran, Ayca, et al.
Published: (2025)
by: Duran, Ayca, et al.
Published: (2025)
CoPE: A Lightweight Complex Positional Encoding
by: Amballa, Avinash
Published: (2025)
by: Amballa, Avinash
Published: (2025)
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
by: Chen, Xueyuan, et al.
Published: (2024)
by: Chen, Xueyuan, et al.
Published: (2024)
Rectified Point Flow: Generic Point Cloud Pose Estimation
by: Sun, Tao, et al.
Published: (2025)
by: Sun, Tao, et al.
Published: (2025)
SuperDec: 3D Scene Decomposition with Superquadric Primitives
by: Fedele, Elisabetta, et al.
Published: (2025)
by: Fedele, Elisabetta, et al.
Published: (2025)
ReScene4D: Temporally Consistent Semantic Instance Segmentation of Evolving Indoor 3D Scenes
by: Steiner, Emily, et al.
Published: (2026)
by: Steiner, Emily, et al.
Published: (2026)
Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
by: Jiang, Zeren, et al.
Published: (2025)
by: Jiang, Zeren, et al.
Published: (2025)
Evaluating the Role of the CoPE in Relation to Established Laryngology PROMs
by: Camryn R. Marshall, et al.
Published: (2025)
by: Camryn R. Marshall, et al.
Published: (2025)
LoopSplat: Loop Closure by Registering 3D Gaussian Splats
by: Zhu, Liyuan, et al.
Published: (2024)
by: Zhu, Liyuan, et al.
Published: (2024)
ReStyle3D: Scene-Level Appearance Transfer with Semantic Correspondences
by: Zhu, Liyuan, et al.
Published: (2025)
by: Zhu, Liyuan, et al.
Published: (2025)
Deep Sketch-Based 3D Modeling: A Survey
by: Tono, Alberto, et al.
Published: (2026)
by: Tono, Alberto, et al.
Published: (2026)
Similar Items
-
CrossOver: 3D Scene Cross-Modal Alignment
by: Sarkar, Sayan Deb, et al.
Published: (2025) -
SGAligner++: Cross-Modal Language-Aided 3D Scene Graph Alignment
by: Singh, Binod, et al.
Published: (2025) -
Space3D-Bench: Spatial 3D Question Answering Benchmark
by: Szymanska, Emilia, et al.
Published: (2024) -
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models
by: Qu, Kevin, et al.
Published: (2026) -
GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer
by: Sarkar, Sayan Deb, et al.
Published: (2025)