Saved in:
| Main Authors: | Li, Jinming, Zhu, Yichen, Xu, Zhiyuan, Gu, Jindong, Zhu, Minjie, Liu, Xin, Liu, Ning, Peng, Yaxin, Feng, Feifei, Tang, Jian |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.19693 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation
by: Zhu, Minjie, et al.
Published: (2024)
by: Zhu, Minjie, et al.
Published: (2024)
Object-Centric Instruction Augmentation for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)
by: Wen, Junjie, et al.
Published: (2024)
Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
by: Zhu, Minjie, et al.
Published: (2024)
by: Zhu, Minjie, et al.
Published: (2024)
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024)
by: Wen, Junjie, et al.
Published: (2024)
Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024)
by: Wang, Wanying, et al.
Published: (2024)
ChatVLA: Unified Multimodal Understanding and Robot Control with Vision-Language-Action Model
by: Zhou, Zhongyi, et al.
Published: (2025)
by: Zhou, Zhongyi, et al.
Published: (2025)
Diffusion-VLA: Generalizable and Interpretable Robot Foundation Model via Self-Generated Reasoning
by: Wen, Junjie, et al.
Published: (2024)
by: Wen, Junjie, et al.
Published: (2024)
CoA-VLA: Improving Vision-Language-Action Models via Visual-Textual Chain-of-Affordance
by: Li, Jinming, et al.
Published: (2024)
by: Li, Jinming, et al.
Published: (2024)
ObjectVLA: End-to-End Open-World Object Manipulation Without Demonstration
by: Zhu, Minjie, et al.
Published: (2025)
by: Zhu, Minjie, et al.
Published: (2025)
Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
Mipha: A Comprehensive Overhaul of Multimodal Assistant with Small Language Models
by: Zhu, Minjie, et al.
Published: (2024)
by: Zhu, Minjie, et al.
Published: (2024)
Let Me Show You: Learning by Retrieving from Egocentric Video for Robotic Manipulation
by: Zhu, Yichen, et al.
Published: (2025)
by: Zhu, Yichen, et al.
Published: (2025)
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control
by: Wen, Junjie, et al.
Published: (2025)
by: Wen, Junjie, et al.
Published: (2025)
PointVLA: Injecting the 3D World into Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)
by: Li, Chengmeng, et al.
Published: (2025)
A Survey on Robotics with Foundation Models: toward Embodied AI
by: Xu, Zhiyuan, et al.
Published: (2024)
by: Xu, Zhiyuan, et al.
Published: (2024)
dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought
by: Wen, Junjie, et al.
Published: (2025)
by: Wen, Junjie, et al.
Published: (2025)
Learning from Imperfect Demonstrations with Self-Supervision for Robotic Manipulation
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
HACTS: a Human-As-Copilot Teleoperation System for Robot Learning
by: Xu, Zhiyuan, et al.
Published: (2025)
by: Xu, Zhiyuan, et al.
Published: (2025)
Learning Generalizable Robot Policy with Human Demonstration Video as a Prompt
by: Zhu, Xiang, et al.
Published: (2025)
by: Zhu, Xiang, et al.
Published: (2025)
HARP-VLA: Human-Robot Aligned Representation Learning for Vision-Language-Action Model
by: Zhu, Xiang, et al.
Published: (2026)
by: Zhu, Xiang, et al.
Published: (2026)
Retrieval-Augmented Embodied Agents
by: Zhu, Yichen, et al.
Published: (2024)
by: Zhu, Yichen, et al.
Published: (2024)
Load-Aware Locomotion Control for Humanoid Robots in Industrial Transportation Tasks
by: Fu, Lequn, et al.
Published: (2026)
by: Fu, Lequn, et al.
Published: (2026)
Designing Robots to Support Parent-Child Connections: Opportunities Through Robot-Mediated Communication
by: Xu, Michael F, et al.
Published: (2026)
by: Xu, Michael F, et al.
Published: (2026)
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation
by: Liu, Zhuoyang, et al.
Published: (2025)
by: Liu, Zhuoyang, et al.
Published: (2025)
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching
by: Chen, Xinwang, et al.
Published: (2024)
by: Chen, Xinwang, et al.
Published: (2024)
Goal State Generation for Robotic Manipulation Based on Linguistically Guided Hybrid Gaussian Diffusion
by: Xu, Yichen, et al.
Published: (2024)
by: Xu, Yichen, et al.
Published: (2024)
Designing Telepresence Robots to Support Place Attachment
by: Hu, Yaxin, et al.
Published: (2025)
by: Hu, Yaxin, et al.
Published: (2025)
A Microgravity Simulation Experimental Platform For Small Space Robots In Orbit
by: Luo, Hang, et al.
Published: (2025)
by: Luo, Hang, et al.
Published: (2025)
Bringing Robots Home: The Rise of AI Robots in Consumer Electronics
by: Dong, Haiwei, et al.
Published: (2024)
by: Dong, Haiwei, et al.
Published: (2024)
BadRobot: Jailbreaking Embodied LLMs in the Physical World
by: Zhang, Hangtao, et al.
Published: (2024)
by: Zhang, Hangtao, et al.
Published: (2024)
MotuBrain: An Advanced World Action Model for Robot Control
by: MotuBrain Team, et al.
Published: (2026)
by: MotuBrain Team, et al.
Published: (2026)
RoboAug: One Annotation to Hundreds of Scenes via Region-Contrastive Data Augmentation for Robotic Manipulation
by: Wang, Xinhua, et al.
Published: (2026)
by: Wang, Xinhua, et al.
Published: (2026)
ActiveUMI: Robotic Manipulation with Active Perception from Robot-Free Human Demonstrations
by: Zeng, Qiyuan, et al.
Published: (2025)
by: Zeng, Qiyuan, et al.
Published: (2025)
Hi-WM: Human-in-the-World-Model for Scalable Robot Post-Training
by: Li, Yaxuan, et al.
Published: (2026)
by: Li, Yaxuan, et al.
Published: (2026)
Modelling and Optimization of Magnetic Navigation Systems for Passive Robots in Minimally Invasive Brain Surgery
by: Xu Tang, et al.
Published: (2025)
by: Xu Tang, et al.
Published: (2025)
dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model
by: Li, Yaxuan, et al.
Published: (2026)
by: Li, Yaxuan, et al.
Published: (2026)
LLMs for Coding and Robotics Education
by: Shu, Peng, et al.
Published: (2024)
by: Shu, Peng, et al.
Published: (2024)
Embodiment Transfer Learning for Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)
by: Li, Chengmeng, et al.
Published: (2025)
OmniNxt: A Fully Open-source and Compact Aerial Robot with Omnidirectional Visual Perception
by: Liu, Peize, et al.
Published: (2024)
by: Liu, Peize, et al.
Published: (2024)
Failure Mechanisms and Risk Estimation for Legged Robot Locomotion on Granular Slopes
by: Liao, Xingjue, et al.
Published: (2026)
by: Liao, Xingjue, et al.
Published: (2026)
Similar Items
-
Scaling Diffusion Policy in Transformer to 1 Billion Parameters for Robotic Manipulation
by: Zhu, Minjie, et al.
Published: (2024) -
Object-Centric Instruction Augmentation for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024) -
Language-Conditioned Robotic Manipulation with Fast and Slow Thinking
by: Zhu, Minjie, et al.
Published: (2024) -
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic Manipulation
by: Wen, Junjie, et al.
Published: (2024) -
Visual Robotic Manipulation with Depth-Aware Pretraining
by: Wang, Wanying, et al.
Published: (2024)