Saved in:
| Main Authors: | Zeng, Xianchao, Zhou, Xinyu, Li, Youcheng, Shi, Jiayou, Li, Tianle, Chen, Liangming, Ren, Lei, Li, Yong-Lu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.02787 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis
by: Li, Dayou, et al.
Published: (2026)
by: Li, Dayou, et al.
Published: (2026)
Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
by: Li, Yuyang, et al.
Published: (2025)
by: Li, Yuyang, et al.
Published: (2025)
LIDEA: Human-to-Robot Imitation Learning via Implicit Feature Distillation and Explicit Geometry Alignment
by: Xu, Yifu, et al.
Published: (2026)
by: Xu, Yifu, et al.
Published: (2026)
Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation
by: Jiang, Zebin, et al.
Published: (2025)
by: Jiang, Zebin, et al.
Published: (2025)
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
by: Li, Kailin, et al.
Published: (2025)
by: Li, Kailin, et al.
Published: (2025)
Learning Manipulation by Predicting Interaction
by: Zeng, Jia, et al.
Published: (2024)
by: Zeng, Jia, et al.
Published: (2024)
Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation
by: Dong, Runpei, et al.
Published: (2026)
by: Dong, Runpei, et al.
Published: (2026)
Structural Action Transformer for 3D Dexterous Manipulation
by: Lei, Xiaohan, et al.
Published: (2026)
by: Lei, Xiaohan, et al.
Published: (2026)
VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents
by: Zhao, Xunyi, et al.
Published: (2025)
by: Zhao, Xunyi, et al.
Published: (2025)
ESCAPE: Episodic Spatial Memory and Adaptive Execution Policy for Long-Horizon Mobile Manipulation
by: Qian, Jingjing, et al.
Published: (2026)
by: Qian, Jingjing, et al.
Published: (2026)
Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations
by: Li, Puhao, et al.
Published: (2024)
by: Li, Puhao, et al.
Published: (2024)
RoboView-Bias: Benchmarking Visual Bias in Embodied Agents for Robotic Manipulation
by: Liu, Enguang, et al.
Published: (2025)
by: Liu, Enguang, et al.
Published: (2025)
GeoPredict: Leveraging Predictive Kinematics and 3D Gaussian Geometry for Precise VLA Manipulation
by: Qian, Jingjing, et al.
Published: (2025)
by: Qian, Jingjing, et al.
Published: (2025)
Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance
by: Zeng, Jing, et al.
Published: (2024)
by: Zeng, Jing, et al.
Published: (2024)
AIC MLLM: Autonomous Interactive Correction MLLM for Robust Robotic Manipulation
by: Xiong, Chuyan, et al.
Published: (2024)
by: Xiong, Chuyan, et al.
Published: (2024)
ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
YOCO: You Only Calibrate Once for Accurate Extrinsic Parameter in LiDAR-Camera Systems
by: Zeng, Tianle, et al.
Published: (2024)
by: Zeng, Tianle, et al.
Published: (2024)
Neuro-Symbolic Manipulation Understanding with Enriched Semantic Event Chains
by: Ziaeetabar, Fatemeh
Published: (2026)
by: Ziaeetabar, Fatemeh
Published: (2026)
Mash, Spread, Slice! Learning to Manipulate Object States via Visual Spatial Progress
by: Mandikal, Priyanka, et al.
Published: (2025)
by: Mandikal, Priyanka, et al.
Published: (2025)
TAPTRv2: Attention-based Position Update Improves Tracking Any Point
by: Li, Hongyang, et al.
Published: (2024)
by: Li, Hongyang, et al.
Published: (2024)
TAPTR: Tracking Any Point with Transformers as Detection
by: Li, Hongyang, et al.
Published: (2024)
by: Li, Hongyang, et al.
Published: (2024)
$χ_{0}$: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies
by: Yu, Checheng, et al.
Published: (2026)
by: Yu, Checheng, et al.
Published: (2026)
DeformMaster: An Interactive Physics-Neural World Model for Deformable Objects from Videos
by: Li, Can, et al.
Published: (2026)
by: Li, Can, et al.
Published: (2026)
NaturalVLM: Leveraging Fine-grained Natural Language for Affordance-Guided Visual Manipulation
by: Xu, Ran, et al.
Published: (2024)
by: Xu, Ran, et al.
Published: (2024)
STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
by: Ren, Hao, et al.
Published: (2026)
by: Ren, Hao, et al.
Published: (2026)
DynaRend: Learning 3D Dynamics via Masked Future Rendering for Robotic Manipulation
by: Tian, Jingyi, et al.
Published: (2025)
by: Tian, Jingyi, et al.
Published: (2025)
Physically Ground Commonsense Knowledge for Articulated Object Manipulation with Analytic Concepts
by: Wei, Jiude, et al.
Published: (2025)
by: Wei, Jiude, et al.
Published: (2025)
Prior Does Matter: Visual Navigation via Denoising Diffusion Bridge Models
by: Ren, Hao, et al.
Published: (2025)
by: Ren, Hao, et al.
Published: (2025)
Think Proprioceptively: Embodied Visual Reasoning for VLA Manipulation
by: Wang, Fangyuan, et al.
Published: (2026)
by: Wang, Fangyuan, et al.
Published: (2026)
TrajBooster: Boosting Humanoid Whole-Body Manipulation via Trajectory-Centric Learning
by: Liu, Jiacheng, et al.
Published: (2025)
by: Liu, Jiacheng, et al.
Published: (2025)
Learning Generalizable 3D Manipulation With 10 Demonstrations
by: Ren, Yu, et al.
Published: (2024)
by: Ren, Yu, et al.
Published: (2024)
ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
by: Zeng, Qiyuan, et al.
Published: (2025)
by: Zeng, Qiyuan, et al.
Published: (2025)
DFM-VLA: Iterative Action Refinement for Robot Manipulation via Discrete Flow Matching
by: Chen, Jiayi, et al.
Published: (2026)
by: Chen, Jiayi, et al.
Published: (2026)
R2RGEN: Real-to-Real 3D Data Generation for Spatially Generalized Manipulation
by: Xu, Xiuwei, et al.
Published: (2025)
by: Xu, Xiuwei, et al.
Published: (2025)
Dense Policy: Bidirectional Autoregressive Learning of Actions
by: Su, Yue, et al.
Published: (2025)
by: Su, Yue, et al.
Published: (2025)
Click to Grasp: Zero-Shot Precise Manipulation via Visual Diffusion Descriptors
by: Tsagkas, Nikolaos, et al.
Published: (2024)
by: Tsagkas, Nikolaos, et al.
Published: (2024)
Scaling Cross-Environment Failure Reasoning Data for Vision-Language Robotic Manipulation
by: Pacaud, Paul, et al.
Published: (2025)
by: Pacaud, Paul, et al.
Published: (2025)
Self-Correcting VLA: Online Action Refinement via Sparse World Imagination
by: Liu, Chenyv, et al.
Published: (2026)
by: Liu, Chenyv, et al.
Published: (2026)
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation
by: Zhang, Junjie, et al.
Published: (2024)
by: Zhang, Junjie, et al.
Published: (2024)
DSM: Constructing a Diverse Semantic Map for 3D Visual Grounding
by: Xie, Qinghongbing, et al.
Published: (2025)
by: Xie, Qinghongbing, et al.
Published: (2025)
Similar Items
-
Learning Actionable Manipulation Recovery via Counterfactual Failure Synthesis
by: Li, Dayou, et al.
Published: (2026) -
Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation
by: Li, Yuyang, et al.
Published: (2025) -
LIDEA: Human-to-Robot Imitation Learning via Implicit Feature Distillation and Explicit Geometry Alignment
by: Xu, Yifu, et al.
Published: (2026) -
Language-Guided Grasp Detection with Coarse-to-Fine Learning for Robotic Manipulation
by: Jiang, Zebin, et al.
Published: (2025) -
ManipTrans: Efficient Dexterous Bimanual Manipulation Transfer via Residual Learning
by: Li, Kailin, et al.
Published: (2025)