Saved in:
| Main Authors: | Lee, Junwoon, Tian, Yulun |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.12314 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
by: Wilson, Joey, et al.
Published: (2024)
by: Wilson, Joey, et al.
Published: (2024)
Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
by: Dai, Zhirui, et al.
Published: (2025)
by: Dai, Zhirui, et al.
Published: (2025)
SimScale: Learning to Drive via Real-World Simulation at Scale
by: Tian, Haochen, et al.
Published: (2025)
by: Tian, Haochen, et al.
Published: (2025)
VILP: Imitation Learning with Latent Video Planning
by: Xu, Zhengtong, et al.
Published: (2025)
by: Xu, Zhengtong, et al.
Published: (2025)
Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving
by: Wang, Linbo, et al.
Published: (2026)
by: Wang, Linbo, et al.
Published: (2026)
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
by: Monninger, Thomas, et al.
Published: (2025)
by: Monninger, Thomas, et al.
Published: (2025)
Latent Gaussian Splatting for 4D Panoptic Occupancy Tracking
by: Luz, Maximilian, et al.
Published: (2026)
by: Luz, Maximilian, et al.
Published: (2026)
Semantic Segmentation and Depth Estimation for Real-Time Lunar Surface Mapping Using 3D Gaussian Splatting
by: Vila, Guillem Casadesus, et al.
Published: (2026)
by: Vila, Guillem Casadesus, et al.
Published: (2026)
GS-LIVO: Real-Time LiDAR, Inertial, and Visual Multi-sensor Fused Odometry with Gaussian Mapping
by: Hong, Sheng, et al.
Published: (2025)
by: Hong, Sheng, et al.
Published: (2025)
Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment
by: Yu, Fanqi, et al.
Published: (2026)
by: Yu, Fanqi, et al.
Published: (2026)
OpenNavMap: Structure-Free Topometric Mapping via Large-Scale Collaborative Localization
by: Jiao, Jianhao, et al.
Published: (2026)
by: Jiao, Jianhao, et al.
Published: (2026)
Latent Chain-of-Thought World Modeling for End-to-End Driving
by: Tan, Shuhan, et al.
Published: (2025)
by: Tan, Shuhan, et al.
Published: (2025)
Large-Scale Gaussian Splatting SLAM
by: Xin, Zhe, et al.
Published: (2025)
by: Xin, Zhe, et al.
Published: (2025)
GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
by: Xiang, Enda, et al.
Published: (2026)
by: Xiang, Enda, et al.
Published: (2026)
Conditioning Latent-Space Clusters for Real-World Anomaly Classification
by: Bogdoll, Daniel, et al.
Published: (2023)
by: Bogdoll, Daniel, et al.
Published: (2023)
Accelerating Online Mapping and Behavior Prediction via Direct BEV Feature Attention
by: Gu, Xunjiang, et al.
Published: (2024)
by: Gu, Xunjiang, et al.
Published: (2024)
ES-Gaussian: Gaussian Splatting Mapping via Error Space-Based Gaussian Completion
by: Chen, Lu, et al.
Published: (2024)
by: Chen, Lu, et al.
Published: (2024)
LaMP: Learning Vision-Language-Action Policies with 3D Scene Flow as Latent Motion Prior
by: Wang, Xinkai, et al.
Published: (2026)
by: Wang, Xinkai, et al.
Published: (2026)
Online Embedding Multi-Scale CLIP Features into 3D Maps
by: Taguchi, Shun, et al.
Published: (2024)
by: Taguchi, Shun, et al.
Published: (2024)
MapBERT: Bitwise Masked Modeling for Real-Time Semantic Mapping Generation
by: Deng, Yijie, et al.
Published: (2025)
by: Deng, Yijie, et al.
Published: (2025)
Efficient Robotic Policy Learning via Latent Space Backward Planning
by: Liu, Dongxiu, et al.
Published: (2025)
by: Liu, Dongxiu, et al.
Published: (2025)
ReefMapGS: Enabling Large-Scale Underwater Reconstruction by Closing the Loop Between Multimodal SLAM and Gaussian Splatting
by: Yang, Daniel, et al.
Published: (2026)
by: Yang, Daniel, et al.
Published: (2026)
FastViDAR: Real-Time Omnidirectional Depth Estimation via Alternative Hierarchical Attention
by: Zhao, Hangtian, et al.
Published: (2025)
by: Zhao, Hangtian, et al.
Published: (2025)
Latent Representations for Visual Proprioception in Inexpensive Robots
by: Sheikholeslami, Sahara, et al.
Published: (2025)
by: Sheikholeslami, Sahara, et al.
Published: (2025)
Latent Action Pretraining Through World Modeling
by: Tharwat, Bahey, et al.
Published: (2025)
by: Tharwat, Bahey, et al.
Published: (2025)
CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning
by: Yang, Jiange, et al.
Published: (2025)
by: Yang, Jiange, et al.
Published: (2025)
From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance
by: Li, Zhe, et al.
Published: (2025)
by: Li, Zhe, et al.
Published: (2025)
Real-Time ESFP: Estimating, Smoothing, Filtering, and Pose-Mapping
by: Cui, Qifei, et al.
Published: (2025)
by: Cui, Qifei, et al.
Published: (2025)
MapGCLR: Geospatial Contrastive Learning of Representations for Online Vectorized HD Map Construction
by: Merkert, Jonas, et al.
Published: (2026)
by: Merkert, Jonas, et al.
Published: (2026)
LOPR: Latent Occupancy PRediction using Generative Models
by: Lange, Bernard, et al.
Published: (2022)
by: Lange, Bernard, et al.
Published: (2022)
LaST-R1: Reinforcing Robotic Manipulation via Adaptive Physical Latent Reasoning
by: Chen, Hao, et al.
Published: (2026)
by: Chen, Hao, et al.
Published: (2026)
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
by: Zhang, Hengyuan, et al.
Published: (2025)
by: Zhang, Hengyuan, et al.
Published: (2025)
Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor Environments
by: Jiao, Jianhao, et al.
Published: (2024)
by: Jiao, Jianhao, et al.
Published: (2024)
Real-Time Polygonal Semantic Mapping for Humanoid Robot Stair Climbing
by: Bin, Teng, et al.
Published: (2024)
by: Bin, Teng, et al.
Published: (2024)
UniLACT: Depth-Aware RGB Latent Action Learning for Vision-Language-Action Models
by: Govind, Manish Kumar, et al.
Published: (2026)
by: Govind, Manish Kumar, et al.
Published: (2026)
PlaySlot: Learning Inverse Latent Dynamics for Controllable Object-Centric Video Prediction and Planning
by: Villar-Corrales, Angel, et al.
Published: (2025)
by: Villar-Corrales, Angel, et al.
Published: (2025)
BridgeDepth: Bridging Monocular and Stereo Reasoning with Latent Alignment
by: Guan, Tongfan, et al.
Published: (2025)
by: Guan, Tongfan, et al.
Published: (2025)
FlowSSC: Universal Generative Monocular Semantic Scene Completion via One-Step Latent Diffusion
by: Xi, Zichen, et al.
Published: (2026)
by: Xi, Zichen, et al.
Published: (2026)
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
by: Xu, Xiuwei, et al.
Published: (2024)
by: Xu, Xiuwei, et al.
Published: (2024)
CaRtGS: Computational Alignment for Real-Time Gaussian Splatting SLAM
by: Feng, Dapeng, et al.
Published: (2024)
by: Feng, Dapeng, et al.
Published: (2024)
Similar Items
-
LatentBKI: Open-Dictionary Continuous Mapping in Visual-Language Latent Spaces with Quantifiable Uncertainty
by: Wilson, Joey, et al.
Published: (2024) -
Learning Scene-Level Signed Directional Distance Function with Ellipsoidal Priors and Neural Residuals
by: Dai, Zhirui, et al.
Published: (2025) -
SimScale: Learning to Drive via Real-World Simulation at Scale
by: Tian, Haochen, et al.
Published: (2025) -
VILP: Imitation Learning with Latent Video Planning
by: Xu, Zhengtong, et al.
Published: (2025) -
Latent-WAM: Latent World Action Modeling for End-to-End Autonomous Driving
by: Wang, Linbo, et al.
Published: (2026)