Saved in:
| Main Authors: | Shi, Le, Shi, Yifei, Xu, Xin, Liu, Tenglong, Xi, Junhua, Chen, Chengyuan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.10359 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
by: Zhang, Hengyuan, et al.
Published: (2025)
by: Zhang, Hengyuan, et al.
Published: (2025)
View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
by: Tian, Stephen, et al.
Published: (2024)
by: Tian, Stephen, et al.
Published: (2024)
SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis
by: Chen, Yi, et al.
Published: (2025)
by: Chen, Yi, et al.
Published: (2025)
Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis
by: Rauniyar, Aditya, et al.
Published: (2025)
by: Rauniyar, Aditya, et al.
Published: (2025)
FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis
by: Chen, Yuxing, et al.
Published: (2025)
by: Chen, Yuxing, et al.
Published: (2025)
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
by: Chen, Xiao, et al.
Published: (2024)
by: Chen, Xiao, et al.
Published: (2024)
Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning
by: Jiang, Tianchong, et al.
Published: (2025)
by: Jiang, Tianchong, et al.
Published: (2025)
SG-Reg: Generalizable and Efficient Scene Graph Registration
by: Liu, Chuhao, et al.
Published: (2025)
by: Liu, Chuhao, et al.
Published: (2025)
ESCAPE: Episodic Spatial Memory and Adaptive Execution Policy for Long-Horizon Mobile Manipulation
by: Qian, Jingjing, et al.
Published: (2026)
by: Qian, Jingjing, et al.
Published: (2026)
Efficient Training of Generalizable Visuomotor Policies via Control-Aware Augmentation
by: Zhao, Yinuo, et al.
Published: (2024)
by: Zhao, Yinuo, et al.
Published: (2024)
Systematic Evaluation of Novel View Synthesis for Video Place Recognition
by: Mahmud, Muhammad Zawad, et al.
Published: (2026)
by: Mahmud, Muhammad Zawad, et al.
Published: (2026)
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)
by: Garcia, Ricardo, et al.
Published: (2024)
MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field
by: Yan, Dongyu, et al.
Published: (2024)
by: Yan, Dongyu, et al.
Published: (2024)
Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
by: Qu, Zhongche, et al.
Published: (2024)
by: Qu, Zhongche, et al.
Published: (2024)
NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation
by: Jeong, Mingyu, et al.
Published: (2025)
by: Jeong, Mingyu, et al.
Published: (2025)
Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)
by: Zhou, Yiyun, et al.
Published: (2025)
GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
by: Xiang, Enda, et al.
Published: (2026)
by: Xiang, Enda, et al.
Published: (2026)
Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation
by: Chen, Bolei, et al.
Published: (2025)
by: Chen, Bolei, et al.
Published: (2025)
DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation
by: Shi, Haoxiang, et al.
Published: (2025)
by: Shi, Haoxiang, et al.
Published: (2025)
3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
by: Ze, Yanjie, et al.
Published: (2024)
by: Ze, Yanjie, et al.
Published: (2024)
RoboScape-R: Unified Reward-Observation World Models for Generalizable Robotics Training via RL
by: Tang, Yinzhou, et al.
Published: (2025)
by: Tang, Yinzhou, et al.
Published: (2025)
Universal Pose Pretraining for Generalizable Vision-Language-Action Policies
by: Lin, Haitao, et al.
Published: (2026)
by: Lin, Haitao, et al.
Published: (2026)
Seeing to Act, Prompting to Specify: A Bayesian Factorization of Vision Language Action Policy
by: Xu, Kechun, et al.
Published: (2025)
by: Xu, Kechun, et al.
Published: (2025)
VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and Invisibility
by: Shi, Yitian, et al.
Published: (2025)
by: Shi, Yitian, et al.
Published: (2025)
FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
by: Liu, Weiheng, et al.
Published: (2025)
by: Liu, Weiheng, et al.
Published: (2025)
Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model
by: Li, Peiyan, et al.
Published: (2026)
by: Li, Peiyan, et al.
Published: (2026)
Generalizable Humanoid Manipulation with 3D Diffusion Policies
by: Ze, Yanjie, et al.
Published: (2024)
by: Ze, Yanjie, et al.
Published: (2024)
G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation
by: Jian, Juntao, et al.
Published: (2025)
by: Jian, Juntao, et al.
Published: (2025)
Towards Generalizable Robotic Manipulation in Dynamic Environments
by: Fang, Heng, et al.
Published: (2026)
by: Fang, Heng, et al.
Published: (2026)
Learning Generalizable 3D Manipulation With 10 Demonstrations
by: Ren, Yu, et al.
Published: (2024)
by: Ren, Yu, et al.
Published: (2024)
Language-Conditioned World Modeling for Visual Navigation
by: Dong, Yifei, et al.
Published: (2026)
by: Dong, Yifei, et al.
Published: (2026)
Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model
by: Wang, Hanqing, et al.
Published: (2025)
by: Wang, Hanqing, et al.
Published: (2025)
DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing
by: Qu, Hao, et al.
Published: (2024)
by: Qu, Hao, et al.
Published: (2024)
ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
GMT: Goal-Conditioned Multimodal Transformer for 6-DOF Object Trajectory Synthesis in 3D Scenes
by: Zeng, Huajian, et al.
Published: (2026)
by: Zeng, Huajian, et al.
Published: (2026)
Image Compression Using Novel View Synthesis Priors
by: Peng, Luyuan, et al.
Published: (2024)
by: Peng, Luyuan, et al.
Published: (2024)
WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving
by: Zürn, Jannik, et al.
Published: (2024)
by: Zürn, Jannik, et al.
Published: (2024)
GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes
by: Chen, Xiao, et al.
Published: (2025)
by: Chen, Xiao, et al.
Published: (2025)
Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications
by: Hillemann, Markus, et al.
Published: (2024)
by: Hillemann, Markus, et al.
Published: (2024)
Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)
by: Ren, Yifei, et al.
Published: (2025)
by: Ren, Yifei, et al.
Published: (2025)
Similar Items
-
MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
by: Zhang, Hengyuan, et al.
Published: (2025) -
View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
by: Tian, Stephen, et al.
Published: (2024) -
SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis
by: Chen, Yi, et al.
Published: (2025) -
Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis
by: Rauniyar, Aditya, et al.
Published: (2025) -
FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis
by: Chen, Yuxing, et al.
Published: (2025)