:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Jinming, Zhu, Yichen, Xu, Zhiyuan, Gu, Jindong, Zhu, Minjie, Liu, Xin, Liu, Ning, Peng, Yaxin, Feng, Feifei, Tang, Jian
Format:	Preprint
Published:	2024
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2406.19693
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation
by: Zhu, Minjie, et al.
Published: (2024)

Object-Centric Instruction Augmentation for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)

Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
by: Zhu, Minjie, et al.
Published: (2024)

TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)

Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024)

ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model
by: Zhou, Zhongyi, et al.
Published: (2025)

Diffusion-VLA: Generalizable and Interpretable Robot Foundation Model via Self-Generated Reasoning
by: Wen, Junjie, et al.
Published: (2024)

CoA-VLA: Improving Vision-Language-Action Models via Visual-Textual Chain-of-Affordance
by: Li, Jinming, et al.
Published: (2024)

ObjectVLA: End-to-End Open-World Object Manipulation Without Demonstration
by: Zhu, Minjie, et al.
Published: (2025)

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
by: Wu, Kun, et al.
Published: (2024)

Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
by: Zhu, Minjie, et al.
Published: (2024)

Let Me Show You: Learning by Retrieving from Egocentric Video for Robotic Manipulation
by: Zhu, Yichen, et al.
Published: (2025)

DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
by: Wen, Junjie, et al.
Published: (2025)

PointVLA: Injecting the 3D World into Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)

A Survey on Robotics with Foundation Models: toward Embodied AI
by: Xu, Zhiyuan, et al.
Published: (2024)

dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought
by: Wen, Junjie, et al.
Published: (2025)

Learning from Imperfect Demonstrations with Self-Supervision for Robotic Manipulation
by: Wu, Kun, et al.
Published: (2024)

HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
by: Xu, Zhiyuan, et al.
Published: (2025)

Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
by: Zhu, Xiang, et al.
Published: (2025)

HARP-VLA: Human-Robot Aligned Representation Learning for Vision-Language-Action Model
by: Zhu, Xiang, et al.
Published: (2026)

Retrieval-Augmented Embodied Agents
by: Zhu, Yichen, et al.
Published: (2024)

Load-Aware Locomotion Control for Humanoid Robots in Industrial Transportation Tasks
by: Fu, Lequn, et al.
Published: (2026)

Designing Robots to Support Parent-Child Connections: Opportunities Through Robot-Mediated Communication
by: Xu, Michael F, et al.
Published: (2026)

MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
by: Liu, Zhuoyang, et al.
Published: (2025)

EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
by: Chen, Xinwang, et al.
Published: (2024)

Goal State Generation for Robotic Manipulation Based on Linguistically Guided Hybrid Gaussian Diffusion
by: Xu, Yichen, et al.
Published: (2024)

Designing Telepresence Robots to Support Place Attachment
by: Hu, Yaxin, et al.
Published: (2025)

A Microgravity Simulation Experimental Platform For Small Space Robots In Orbit
by: Luo, Hang, et al.
Published: (2025)

Bringing Robots Home: The Rise of AI Robots in Consumer Electronics
by: Dong, Haiwei, et al.
Published: (2024)

BadRobot: Jailbreaking Embodied LLMs in the Physical World
by: Zhang, Hangtao, et al.
Published: (2024)

MotuBrain: An Advanced World Action Model for Robot Control
by: MotuBrain Team, et al.
Published: (2026)

RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation
by: Wang, Xinhua, et al.
Published: (2026)

ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
by: Zeng, Qiyuan, et al.
Published: (2025)

Hi-WM: Human-in-the-World-Model for Scalable Robot Post-Training
by: Li, Yaxuan, et al.
Published: (2026)

Modelling and Optimization of Magnetic Navigation Systems for Passive Robots in Minimally Invasive Brain Surgery
by: Xu Tang, et al.
Published: (2025)

dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model
by: Li, Yaxuan, et al.
Published: (2026)

LLMs for Coding and Robotics Education
by: Shu, Peng, et al.
Published: (2024)

Embodiment Transfer Learning for Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)

OmniNxt: A Fully Open-source and Compact Aerial Robot with Omnidirectional Visual Perception
by: Liu, Peize, et al.
Published: (2024)

Failure Mechanisms and Risk Estimation for Legged Robot Locomotion on Granular Slopes
by: Liao, Xingjue, et al.
Published: (2026)