:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lee, Junwoon, Tian, Yulun
Format:	Preprint
Published:	2026
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2602.12314
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
by: Wilson, Joey, et al.
Published: (2024)

Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
by: Dai, Zhirui, et al.
Published: (2025)

SimScale: Learning to Drive via Real-World Simulation at Scale
by: Tian, Haochen, et al.
Published: (2025)

VILP: Imitation Learning with Latent Video Planning
by: Xu, Zhengtong, et al.
Published: (2025)

Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving
by: Wang, Linbo, et al.
Published: (2026)

AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
by: Monninger, Thomas, et al.
Published: (2025)

Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking
by: Luz, Maximilian, et al.
Published: (2026)

Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting
by: Vila, Guillem Casadesus, et al.
Published: (2026)

GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping
by: Hong, Sheng, et al.
Published: (2025)

Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
by: Yu, Fanqi, et al.
Published: (2026)

OpenNavMap: Structure-Free Topometric Mapping via Large-Scale Collaborative Localization
by: Jiao, Jianhao, et al.
Published: (2026)

Latent Chain-of-Thought World Modeling for End-to-End Driving
by: Tan, Shuhan, et al.
Published: (2025)

Large-Scale Gaussian Splatting SLAM
by: Xin, Zhe, et al.
Published: (2025)

GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
by: Xiang, Enda, et al.
Published: (2026)

Conditioning Latent-Space Clusters for Real-World Anomaly Classification
by: Bogdoll, Daniel, et al.
Published: (2023)

Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
by: Gu, Xunjiang, et al.
Published: (2024)

ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion
by: Chen, Lu, et al.
Published: (2024)

LaMP: Learning Vision-Language-Action Policies with 3D Scene Flow as Latent Motion Prior
by: Wang, Xinkai, et al.
Published: (2026)

Online Embedding Multi-Scale CLIP Features into 3D Maps
by: Taguchi, Shun, et al.
Published: (2024)

MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation
by: Deng, Yijie, et al.
Published: (2025)

Efficient Robotic Policy Learning via Latent Space Backward Planning
by: Liu, Dongxiu, et al.
Published: (2025)

ReefMapGS: Enabling Large-Scale Underwater Reconstruction by Closing the Loop Between Multimodal SLAM and Gaussian Splatting
by: Yang, Daniel, et al.
Published: (2026)

FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention
by: Zhao, Hangtian, et al.
Published: (2025)

Latent Representations for Visual Proprioception in Inexpensive Robots
by: Sheikholeslami, Sahara, et al.
Published: (2025)

Latent Action Pretraining Through World Modeling
by: Tharwat, Bahey, et al.
Published: (2025)

CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning
by: Yang, Jiange, et al.
Published: (2025)

From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance
by: Li, Zhe, et al.
Published: (2025)

Real-Time ESFP: Estimating, Smoothing, Filtering, and Pose-Mapping
by: Cui, Qifei, et al.
Published: (2025)

MapGCLR: Geospatial Contrastive Learning of Representations for Online Vectorized HD Map Construction
by: Merkert, Jonas, et al.
Published: (2026)

LOPR: Latent Occupancy PRediction using Generative Models
by: Lange, Bernard, et al.
Published: (2022)

LaST-R1: Reinforcing Robotic Manipulation via Adaptive Physical Latent Reasoning
by: Chen, Hao, et al.
Published: (2026)

MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
by: Zhang, Hengyuan, et al.
Published: (2025)

Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor Environments
by: Jiao, Jianhao, et al.
Published: (2024)

Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing
by: Bin, Teng, et al.
Published: (2024)

UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models
by: Govind, Manish Kumar, et al.
Published: (2026)

PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
by: Villar-Corrales, Angel, et al.
Published: (2025)

BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment
by: Guan, Tongfan, et al.
Published: (2025)

FlowSSC: Universal Generative Monocular Semantic Scene Completion via One-Step Latent Diffusion
by: Xi, Zichen, et al.
Published: (2026)

EmbodiedSAM: Online Segment Any 3D Thing in Real Time
by: Xu, Xiuwei, et al.
Published: (2024)

CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM
by: Feng, Dapeng, et al.
Published: (2024)