:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Le, Shi, Yifei, Xu, Xin, Liu, Tenglong, Xi, Junhua, Chen, Chengyuan
Format:	Preprint
Published:	2025
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.10359
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MapGS: Generalizable Pretraining and Data Augmentation for Online Mapping via Novel View Synthesis
by: Zhang, Hengyuan, et al.
Published: (2025)

View-Invariant Policy Learning via Zero-Shot Novel View Synthesis
by: Tian, Stephen, et al.
Published: (2024)

SN-LiDAR: Semantic Neural Fields for Novel Space-time View LiDAR Synthesis
by: Chen, Yi, et al.
Published: (2025)

Aug3D: Augmenting large scale outdoor datasets for Generalizable Novel View Synthesis
by: Rauniyar, Aditya, et al.
Published: (2025)

FoldNet: Learning Generalizable Closed-Loop Policy for Garment Folding via Keypoint-Driven Asset and Demonstration Synthesis
by: Chen, Yuxing, et al.
Published: (2025)

GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction
by: Chen, Xiao, et al.
Published: (2024)

Do You Know Where Your Camera Is? View-Invariant Policy Learning with Camera Conditioning
by: Jiang, Tianchong, et al.
Published: (2025)

SG-Reg: Generalizable and Efficient Scene Graph Registration
by: Liu, Chuhao, et al.
Published: (2025)

ESCAPE: Episodic Spatial Memory and Adaptive Execution Policy for Long-Horizon Mobile Manipulation
by: Qian, Jingjing, et al.
Published: (2026)

Efficient Training of Generalizable Visuomotor Policies via Control-Aware Augmentation
by: Zhao, Yinuo, et al.
Published: (2024)

Systematic Evaluation of Novel View Synthesis for Video Place Recognition
by: Mahmud, Muhammad Zawad, et al.
Published: (2026)

Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)

MSI-NeRF: Linking Omni-Depth with View Synthesis through Multi-Sphere Image aided Generalizable Neural Radiance Field
by: Yan, Dongyu, et al.
Published: (2024)

Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis
by: Qu, Zhongche, et al.
Published: (2024)

NVSim: Novel View Synthesis Simulator for Large Scale Indoor Navigation
by: Jeong, Mingyu, et al.
Published: (2025)

Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)

GraspLDP: Towards Generalizable Grasping Policy via Latent Diffusion
by: Xiang, Enda, et al.
Published: (2026)

Recursive Visual Imagination and Adaptive Linguistic Grounding for Vision Language Navigation
by: Chen, Bolei, et al.
Published: (2025)

DAgger Diffusion Navigation: DAgger Boosted Diffusion Policy for Vision-Language Navigation
by: Shi, Haoxiang, et al.
Published: (2025)

3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations
by: Ze, Yanjie, et al.
Published: (2024)

RoboScape-R: Unified Reward-Observation World Models for Generalizable Robotics Training via RL
by: Tang, Yinzhou, et al.
Published: (2025)

Universal Pose Pretraining for Generalizable Vision-Language-Action Policies
by: Lin, Haitao, et al.
Published: (2026)

Seeing to Act, Prompting to Specify: A Bayesian Factorization of Vision Language Action Policy
by: Xu, Kechun, et al.
Published: (2025)

VISO-Grasp: Vision-Language Informed Spatial Object-centric 6-DoF Active View Planning and Grasping in Clutter and Invisibility
by: Shi, Yitian, et al.
Published: (2025)

FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
by: Liu, Weiheng, et al.
Published: (2025)

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model
by: Li, Peiyan, et al.
Published: (2026)

Generalizable Humanoid Manipulation with 3D Diffusion Policies
by: Ze, Yanjie, et al.
Published: (2024)

G-DexGrasp: Generalizable Dexterous Grasping Synthesis Via Part-Aware Prior Retrieval and Prior-Assisted Generation
by: Jian, Juntao, et al.
Published: (2025)

Towards Generalizable Robotic Manipulation in Dynamic Environments
by: Fang, Heng, et al.
Published: (2026)

Learning Generalizable 3D Manipulation With 10 Demonstrations
by: Ren, Yu, et al.
Published: (2024)

Language-Conditioned World Modeling for Visual Navigation
by: Dong, Yifei, et al.
Published: (2026)

Affordance-R1: Reinforcement Learning for Generalizable Affordance Reasoning in Multimodal Large Language Model
by: Wang, Hanqing, et al.
Published: (2025)

DK-SLAM: Monocular Visual SLAM with Deep Keypoint Learning, Tracking and Loop-Closing
by: Qu, Hao, et al.
Published: (2024)

ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
by: Li, Zheng, et al.
Published: (2025)

GMT: Goal-Conditioned Multimodal Transformer for 6-DOF Object Trajectory Synthesis in 3D Scenes
by: Zeng, Huajian, et al.
Published: (2026)

Image Compression Using Novel View Synthesis Priors
by: Peng, Luyuan, et al.
Published: (2024)

WayveScenes101: A Dataset and Benchmark for Novel View Synthesis in Autonomous Driving
by: Zürn, Jannik, et al.
Published: (2024)

GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scenes
by: Chen, Xiao, et al.
Published: (2025)

Novel View Synthesis with Neural Radiance Fields for Industrial Robot Applications
by: Hillemann, Markus, et al.
Published: (2024)

Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)
by: Ren, Yifei, et al.
Published: (2025)