:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Jin, Wei, Ping, Chen, Yixin, Zheng, Nanning
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2603.04848
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
by: Qian, Shengyi, et al.
Published: (2024)

VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025)

PlaneHEC: Efficient Hand-Eye Calibration for Multi-view Robotic Arm via Any Point Cloud Plane Detection
by: Wang, Ye, et al.
Published: (2025)

Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024)

Optimal-Horizon Social Robot Navigation in Heterogeneous Crowds
by: Shi, Jiamin, et al.
Published: (2026)

Toward Visually Realistic Simulation: A Benchmark for Evaluating Robot Manipulation in Simulation
by: Zhu, Yixin, et al.
Published: (2026)

DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
by: Ma, Chengzhong, et al.
Published: (2024)

Multiview Progress Prediction of Robot Activities
by: Zoppellari, Elena, et al.
Published: (2026)

FG-CLTP: Fine-Grained Contrastive Language Tactile Pretraining for Robotic Manipulation
by: Ma, Wenxuan, et al.
Published: (2026)

Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning
by: Li, Jiachen, et al.
Published: (2023)

iManip: Skill-Incremental Learning for Robotic Manipulation
by: Zheng, Zexin, et al.
Published: (2025)

EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration
by: Shi, Modi, et al.
Published: (2026)

RoboLight: A Dataset with Linearly Composable Illumination for Robotic Manipulation
by: Jin, Shutong, et al.
Published: (2026)

Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
by: Tian, Yang, et al.
Published: (2024)

Learning Physics from Pretrained Video Models: A Multimodal Continuous and Sequential World Interaction Models for Robotic Manipulation
by: Song, Zijian, et al.
Published: (2026)

Interpretable Robotic Manipulation from Language
by: Zheng, Boyuan, et al.
Published: (2024)

Integration of Robot and Scene Kinematics for Sequential Mobile Manipulation Planning
by: Jiao, Ziyuan, et al.
Published: (2025)

Skill-Aware Diffusion for Generalizable Robotic Manipulation
by: Huang, Aoshen, et al.
Published: (2026)

Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
by: Bu, Qingwen, et al.
Published: (2024)

Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey
by: Bai, Shuanghao, et al.
Published: (2025)

Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
by: Li, Yuyang, et al.
Published: (2025)

3PoinTr: 3D Point Tracks for Robot Manipulation Pretraining from Casual Videos
by: Hung, Adam, et al.
Published: (2026)

Is Diversity All You Need for Scalable Robotic Manipulation?
by: Shi, Modi, et al.
Published: (2025)

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation
by: Xiao, Junjin, et al.
Published: (2026)

Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives
by: Bai, Shuanghao, et al.
Published: (2025)

Physically-based Lighting Generation for Robotic Manipulation
by: Jin, Shutong, et al.
Published: (2025)

Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation
by: Zhang, Chuye, et al.
Published: (2025)

Generative Artificial Intelligence in Robotic Manipulation: A Survey
by: Zhang, Kun, et al.
Published: (2025)

COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans
by: Martini, Enrico, et al.
Published: (2025)

LongBench: Evaluating Robotic Manipulation Policies on Real-World Long-Horizon Tasks
by: Chen, Xueyao, et al.
Published: (2026)

Language-Grounded Decoupled Action Representation for Robotic Manipulation
by: Weng, Wuding, et al.
Published: (2026)

Synergizing Efficiency and Reliability for Continuous Mobile Manipulation
by: Wu, Chengkai, et al.
Published: (2026)

Language-Conditioned Open-Vocabulary Mobile Manipulation with Pretrained Models
by: Tan, Shen, et al.
Published: (2025)

Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
by: Li, Qixiu, et al.
Published: (2025)

VITaL Pretraining: Visuo-Tactile Pretraining for Tactile and Non-Tactile Manipulation Policies
by: George, Abraham, et al.
Published: (2024)

VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
by: Zhao, Wentao, et al.
Published: (2024)

Think before Go: Hierarchical Reasoning for Image-goal Navigation
by: Li, Pengna, et al.
Published: (2026)

GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
by: Lu, Guanxing, et al.
Published: (2025)

Transferring Foundation Models for Generalizable Robotic Manipulation
by: Yang, Jiange, et al.
Published: (2023)

HyperSim: A Holistic Sim-To-Real Framework For Robust Robotic Manipulation
by: Dong, Junyi, et al.
Published: (2026)