Saved in:
| Main Authors: | Yang, Jin, Wei, Ping, Chen, Yixin, Zheng, Nanning |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.04848 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
by: Qian, Shengyi, et al.
Published: (2024)
by: Qian, Shengyi, et al.
Published: (2024)
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025)
by: Shen, Yichao, et al.
Published: (2025)
PlaneHEC: Efficient Hand-Eye Calibration for Multi-view Robotic Arm via Any Point Cloud Plane Detection
by: Wang, Ye, et al.
Published: (2025)
by: Wang, Ye, et al.
Published: (2025)
Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024)
by: Wang, Wanying, et al.
Published: (2024)
Optimal-Horizon Social Robot Navigation in Heterogeneous Crowds
by: Shi, Jiamin, et al.
Published: (2026)
by: Shi, Jiamin, et al.
Published: (2026)
Toward Visually Realistic Simulation: A Benchmark for Evaluating Robot Manipulation in Simulation
by: Zhu, Yixin, et al.
Published: (2026)
by: Zhu, Yixin, et al.
Published: (2026)
DexDiff: Towards Extrinsic Dexterity Manipulation of Ungraspable Objects in Unrestricted Environments
by: Ma, Chengzhong, et al.
Published: (2024)
by: Ma, Chengzhong, et al.
Published: (2024)
Multiview Progress Prediction of Robot Activities
by: Zoppellari, Elena, et al.
Published: (2026)
by: Zoppellari, Elena, et al.
Published: (2026)
FG-CLTP: Fine-Grained Contrastive Language Tactile Pretraining for Robotic Manipulation
by: Ma, Wenxuan, et al.
Published: (2026)
by: Ma, Wenxuan, et al.
Published: (2026)
Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning
by: Li, Jiachen, et al.
Published: (2023)
by: Li, Jiachen, et al.
Published: (2023)
iManip: Skill-Incremental Learning for Robotic Manipulation
by: Zheng, Zexin, et al.
Published: (2025)
by: Zheng, Zexin, et al.
Published: (2025)
EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration
by: Shi, Modi, et al.
Published: (2026)
by: Shi, Modi, et al.
Published: (2026)
RoboLight: A Dataset with Linearly Composable Illumination for Robotic Manipulation
by: Jin, Shutong, et al.
Published: (2026)
by: Jin, Shutong, et al.
Published: (2026)
Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
by: Tian, Yang, et al.
Published: (2024)
by: Tian, Yang, et al.
Published: (2024)
Learning Physics from Pretrained Video Models: A Multimodal Continuous and Sequential World Interaction Models for Robotic Manipulation
by: Song, Zijian, et al.
Published: (2026)
by: Song, Zijian, et al.
Published: (2026)
Interpretable Robotic Manipulation from Language
by: Zheng, Boyuan, et al.
Published: (2024)
by: Zheng, Boyuan, et al.
Published: (2024)
Integration of Robot and Scene Kinematics for Sequential Mobile Manipulation Planning
by: Jiao, Ziyuan, et al.
Published: (2025)
by: Jiao, Ziyuan, et al.
Published: (2025)
Skill-Aware Diffusion for Generalizable Robotic Manipulation
by: Huang, Aoshen, et al.
Published: (2026)
by: Huang, Aoshen, et al.
Published: (2026)
Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
by: Bu, Qingwen, et al.
Published: (2024)
by: Bu, Qingwen, et al.
Published: (2024)
Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey
by: Bai, Shuanghao, et al.
Published: (2025)
by: Bai, Shuanghao, et al.
Published: (2025)
Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
by: Li, Yuyang, et al.
Published: (2025)
by: Li, Yuyang, et al.
Published: (2025)
3PoinTr: 3D Point Tracks for Robot Manipulation Pretraining from Casual Videos
by: Hung, Adam, et al.
Published: (2026)
by: Hung, Adam, et al.
Published: (2026)
Is Diversity All You Need for Scalable Robotic Manipulation?
by: Shi, Modi, et al.
Published: (2025)
by: Shi, Modi, et al.
Published: (2025)
Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation
by: Xiao, Junjin, et al.
Published: (2026)
by: Xiao, Junjin, et al.
Published: (2026)
Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives
by: Bai, Shuanghao, et al.
Published: (2025)
by: Bai, Shuanghao, et al.
Published: (2025)
Physically-based Lighting Generation for Robotic Manipulation
by: Jin, Shutong, et al.
Published: (2025)
by: Jin, Shutong, et al.
Published: (2025)
Generative Visual Foresight Meets Task-Agnostic Pose Estimation in Robotic Table-Top Manipulation
by: Zhang, Chuye, et al.
Published: (2025)
by: Zhang, Chuye, et al.
Published: (2025)
Generative Artificial Intelligence in Robotic Manipulation: A Survey
by: Zhang, Kun, et al.
Published: (2025)
by: Zhang, Kun, et al.
Published: (2025)
COMETH: Convex Optimization for Multiview Estimation and Tracking of Humans
by: Martini, Enrico, et al.
Published: (2025)
by: Martini, Enrico, et al.
Published: (2025)
LongBench: Evaluating Robotic Manipulation Policies on Real-World Long-Horizon Tasks
by: Chen, Xueyao, et al.
Published: (2026)
by: Chen, Xueyao, et al.
Published: (2026)
Language-Grounded Decoupled Action Representation for Robotic Manipulation
by: Weng, Wuding, et al.
Published: (2026)
by: Weng, Wuding, et al.
Published: (2026)
Synergizing Efficiency and Reliability for Continuous Mobile Manipulation
by: Wu, Chengkai, et al.
Published: (2026)
by: Wu, Chengkai, et al.
Published: (2026)
Language-Conditioned Open-Vocabulary Mobile Manipulation with Pretrained Models
by: Tan, Shen, et al.
Published: (2025)
by: Tan, Shen, et al.
Published: (2025)
Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
by: Li, Qixiu, et al.
Published: (2025)
by: Li, Qixiu, et al.
Published: (2025)
VITaL Pretraining: Visuo-Tactile Pretraining for Tactile and Non-Tactile Manipulation Policies
by: George, Abraham, et al.
Published: (2024)
by: George, Abraham, et al.
Published: (2024)
VLMPC: Vision-Language Model Predictive Control for Robotic Manipulation
by: Zhao, Wentao, et al.
Published: (2024)
by: Zhao, Wentao, et al.
Published: (2024)
Think before Go: Hierarchical Reasoning for Image-goal Navigation
by: Li, Pengna, et al.
Published: (2026)
by: Li, Pengna, et al.
Published: (2026)
GWM: Towards Scalable Gaussian World Models for Robotic Manipulation
by: Lu, Guanxing, et al.
Published: (2025)
by: Lu, Guanxing, et al.
Published: (2025)
Transferring Foundation Models for Generalizable Robotic Manipulation
by: Yang, Jiange, et al.
Published: (2023)
by: Yang, Jiange, et al.
Published: (2023)
HyperSim: A Holistic Sim-To-Real Framework For Robust Robotic Manipulation
by: Dong, Junyi, et al.
Published: (2026)
by: Dong, Junyi, et al.
Published: (2026)
Similar Items
-
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
by: Qian, Shengyi, et al.
Published: (2024) -
VideoVLA: Video Generators Can Be Generalizable Robot Manipulators
by: Shen, Yichao, et al.
Published: (2025) -
PlaneHEC: Efficient Hand-Eye Calibration for Multi-view Robotic Arm via Any Point Cloud Plane Detection
by: Wang, Ye, et al.
Published: (2025) -
Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024) -
Optimal-Horizon Social Robot Navigation in Heterogeneous Crowds
by: Shi, Jiamin, et al.
Published: (2026)