Saved in:
| Main Authors: | Liu, Zhirui, Ji, Kaiyang, Yang, Ke, Fan, Yahao, Yu, Jingyi, Shi, Ye, Wang, Jingya |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.22963 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
by: Fan, Yahao, et al.
Published: (2025)
by: Fan, Yahao, et al.
Published: (2025)
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
by: Ji, Kaiyang, et al.
Published: (2025)
by: Ji, Kaiyang, et al.
Published: (2025)
UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)
by: Zhang, Zhenhao, et al.
Published: (2026)
A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals
by: Tang, Jiangnan, et al.
Published: (2024)
by: Tang, Jiangnan, et al.
Published: (2024)
SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-based Humanoid Control
by: Zhang, Jingyan, et al.
Published: (2026)
by: Zhang, Jingyan, et al.
Published: (2026)
UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots
by: Jiang, Nan, et al.
Published: (2025)
by: Jiang, Nan, et al.
Published: (2025)
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
by: Fan, Shichao, et al.
Published: (2025)
by: Fan, Shichao, et al.
Published: (2025)
Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision--Language--Motion Diffusion Architecture
by: Sun, Fuze, et al.
Published: (2026)
by: Sun, Fuze, et al.
Published: (2026)
A Universal Large Language Model -- Drone Command and Control Interface
by: Ramos-Silva, Javier N., et al.
Published: (2026)
by: Ramos-Silva, Javier N., et al.
Published: (2026)
Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions
by: Jiang, Zhenyu, et al.
Published: (2024)
by: Jiang, Zhenyu, et al.
Published: (2024)
MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
by: Liu, Yang, et al.
Published: (2026)
by: Liu, Yang, et al.
Published: (2026)
ELAN4D: Embodiment-Centric 4D Supervision for Vision-Language-Action Models via Plug-and-Play Adaptation
by: He, Zeyuan, et al.
Published: (2026)
by: He, Zeyuan, et al.
Published: (2026)
Precise Robot Command Understanding Using Grammar-Constrained Large Language Models
by: Huo, Xinyun, et al.
Published: (2026)
by: Huo, Xinyun, et al.
Published: (2026)
RynnVLA-002: A Unified Vision-Language-Action and World Model
by: Cen, Jun, et al.
Published: (2025)
by: Cen, Jun, et al.
Published: (2025)
Unified Walking, Running, and Recovery for Humanoids via State-Dependent Adversarial Motion Priors
by: Lu, Yidan, et al.
Published: (2026)
by: Lu, Yidan, et al.
Published: (2026)
BINDER: Instantly Adaptive Mobile Manipulation with Open-Vocabulary Commands
by: Cho, Seongwon, et al.
Published: (2025)
by: Cho, Seongwon, et al.
Published: (2025)
ElegantVLA: Learning When to Think for Efficient Vision-Language-Action Models
by: Li, Ye, et al.
Published: (2026)
by: Li, Ye, et al.
Published: (2026)
RedVLA: Physical Red Teaming for Vision-Language-Action Models
by: Zhang, Yuhao, et al.
Published: (2026)
by: Zhang, Yuhao, et al.
Published: (2026)
Token Expand-Merge: Training-Free Token Compression for Vision-Language-Action Models
by: Ye, Yifan, et al.
Published: (2025)
by: Ye, Yifan, et al.
Published: (2025)
Unified Humanoid Fall-Safety Policy from a Few Demonstrations
by: Xu, Zhengjie, et al.
Published: (2025)
by: Xu, Zhengjie, et al.
Published: (2025)
RLinf-VLA: A Unified and Efficient Framework for Reinforcement Learning of Vision-Language-Action Models
by: Zang, Hongzhi, et al.
Published: (2025)
by: Zang, Hongzhi, et al.
Published: (2025)
Unified Vision-Language-Action Model
by: Wang, Yuqi, et al.
Published: (2025)
by: Wang, Yuqi, et al.
Published: (2025)
AffordDP: Generalizable Diffusion Policy with Transferable Affordance
by: Wu, Shijie, et al.
Published: (2024)
by: Wu, Shijie, et al.
Published: (2024)
Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning
by: Shi, Jiyuan, et al.
Published: (2025)
by: Shi, Jiyuan, et al.
Published: (2025)
Optimal Scene Graph Planning with Large Language Model Guidance
by: Dai, Zhirui, et al.
Published: (2023)
by: Dai, Zhirui, et al.
Published: (2023)
UniTracker: Learning Universal Whole-Body Motion Tracker for Humanoid Robots
by: Yin, Kangning, et al.
Published: (2025)
by: Yin, Kangning, et al.
Published: (2025)
RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control
by: Yue, Junpeng, et al.
Published: (2025)
by: Yue, Junpeng, et al.
Published: (2025)
Towards Efficient Motion Planning for UAVs: Lazy A* Search with Motion Primitives
by: Wang, Wentao, et al.
Published: (2024)
by: Wang, Wentao, et al.
Published: (2024)
ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models
by: Fang, Zhou, et al.
Published: (2026)
by: Fang, Zhou, et al.
Published: (2026)
Minimal Self in Humanoid Robot "Alter3" Driven by Large Language Model
by: Yoshida, Takahide, et al.
Published: (2024)
by: Yoshida, Takahide, et al.
Published: (2024)
EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control
by: Yang, Chao, et al.
Published: (2025)
by: Yang, Chao, et al.
Published: (2025)
CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)
by: Huang, Dongchi, et al.
Published: (2025)
Robust and Generalized Humanoid Motion Tracking
by: Ma, Yubiao, et al.
Published: (2026)
by: Ma, Yubiao, et al.
Published: (2026)
ACTLLM: Action Consistency Tuned Large Language Model
by: Bi, Jing, et al.
Published: (2025)
by: Bi, Jing, et al.
Published: (2025)
CoFreeVLA: Collision-Free Dual-Arm Manipulation via Vision-Language-Action Model and Risk Estimation
by: Zhai, Xuanran, et al.
Published: (2026)
by: Zhai, Xuanran, et al.
Published: (2026)
Preference-Conditioned Multi-Objective RL for Integrated Command Tracking and Force Compliance in Humanoid Locomotion
by: Leng, Tingxuan, et al.
Published: (2025)
by: Leng, Tingxuan, et al.
Published: (2025)
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
Retargeting Matters: General Motion Retargeting for Humanoid Motion Tracking
by: Araujo, Joao Pedro, et al.
Published: (2025)
by: Araujo, Joao Pedro, et al.
Published: (2025)
GMT: General Motion Tracking for Humanoid Whole-Body Control
by: Chen, Zixuan, et al.
Published: (2025)
by: Chen, Zixuan, et al.
Published: (2025)
UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling
by: Chen, Boyu, et al.
Published: (2026)
by: Chen, Boyu, et al.
Published: (2026)
Similar Items
-
DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
by: Fan, Yahao, et al.
Published: (2025) -
Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
by: Ji, Kaiyang, et al.
Published: (2025) -
UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026) -
A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals
by: Tang, Jiangnan, et al.
Published: (2024) -
SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-based Humanoid Control
by: Zhang, Jingyan, et al.
Published: (2026)