:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liu, Zhirui, Ji, Kaiyang, Yang, Ke, Fan, Yahao, Yu, Jingyi, Shi, Ye, Wang, Jingya
Format:	Preprint
Published:	2025
Subjects:	Robotics Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.22963
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

DreamPolicy: A Unified World-model Policy for Scalable Humanoid Locomotion
by: Fan, Yahao, et al.
Published: (2025)

Towards Immersive Human-X Interaction: A Real-Time Framework for Physically Plausible Motion Synthesis
by: Ji, Kaiyang, et al.
Published: (2025)

UniHM: Unified Dexterous Hand Manipulation with Vision Language Model
by: Zhang, Zhenhao, et al.
Published: (2026)

A Unified Diffusion Framework for Scene-aware Human Motion Estimation from Sparse Signals
by: Tang, Jiangnan, et al.
Published: (2024)

SCRIPT: Scalable Diffusion Policy with Multi-stage Training for Language-driven Physics-based Humanoid Control
by: Zhang, Jingyan, et al.
Published: (2026)

UniAct: Unified Motion Generation and Action Streaming for Humanoid Robots
by: Jiang, Nan, et al.
Published: (2025)

XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations
by: Fan, Shichao, et al.
Published: (2025)

Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision--Language--Motion Diffusion Architecture
by: Sun, Fuze, et al.
Published: (2026)

A Universal Large Language Model -- Drone Command and Control Interface
by: Ramos-Silva, Javier N., et al.
Published: (2026)

Harmon: Whole-Body Motion Generation of Humanoid Robots from Language Descriptions
by: Jiang, Zhenyu, et al.
Published: (2024)

MMaDA-VLA: Large Diffusion Vision-Language-Action Model with Unified Multi-Modal Instruction and Generation
by: Liu, Yang, et al.
Published: (2026)

ELAN4D: Embodiment-Centric 4D Supervision for Vision-Language-Action Models via Plug-and-Play Adaptation
by: He, Zeyuan, et al.
Published: (2026)

Precise Robot Command Understanding Using Grammar-Constrained Large Language Models
by: Huo, Xinyun, et al.
Published: (2026)

RynnVLA-002: A Unified Vision-Language-Action and World Model
by: Cen, Jun, et al.
Published: (2025)

Unified Walking, Running, and Recovery for Humanoids via State-Dependent Adversarial Motion Priors
by: Lu, Yidan, et al.
Published: (2026)

BINDER: Instantly Adaptive Mobile Manipulation with Open-Vocabulary Commands
by: Cho, Seongwon, et al.
Published: (2025)

ElegantVLA: Learning When to Think for Efficient Vision-Language-Action Models
by: Li, Ye, et al.
Published: (2026)

RedVLA: Physical Red Teaming for Vision-Language-Action Models
by: Zhang, Yuhao, et al.
Published: (2026)

Token Expand-Merge: Training-Free Token Compression for Vision-Language-Action Models
by: Ye, Yifan, et al.
Published: (2025)

Unified Humanoid Fall-Safety Policy from a Few Demonstrations
by: Xu, Zhengjie, et al.
Published: (2025)

RLinf-VLA: A Unified and Efficient Framework for Reinforcement Learning of Vision-Language-Action Models
by: Zang, Hongzhi, et al.
Published: (2025)

Unified Vision-Language-Action Model
by: Wang, Yuqi, et al.
Published: (2025)

AffordDP: Generalizable Diffusion Policy with Transferable Affordance
by: Wu, Shijie, et al.
Published: (2024)

Adversarial Locomotion and Motion Imitation for Humanoid Policy Learning
by: Shi, Jiyuan, et al.
Published: (2025)

Optimal Scene Graph Planning with Large Language Model Guidance
by: Dai, Zhirui, et al.
Published: (2023)

UniTracker: Learning Universal Whole-Body Motion Tracker for Humanoid Robots
by: Yin, Kangning, et al.
Published: (2025)

RL from Physical Feedback: Aligning Large Motion Models with Humanoid Control
by: Yue, Junpeng, et al.
Published: (2025)

Towards Efficient Motion Planning for UAVs: Lazy A* Search with Motion Primitives
by: Wang, Wentao, et al.
Published: (2024)

ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models
by: Fang, Zhou, et al.
Published: (2026)

Minimal Self in Humanoid Robot "Alter3" Driven by Large Language Model
by: Yoshida, Takahide, et al.
Published: (2024)

EGM: Efficiently Learning General Motion Tracking Policy for High Dynamic Humanoid Whole-Body Control
by: Yang, Chao, et al.
Published: (2025)

CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
by: Huang, Dongchi, et al.
Published: (2025)

Robust and Generalized Humanoid Motion Tracking
by: Ma, Yubiao, et al.
Published: (2026)

ACTLLM: Action Consistency Tuned Large Language Model
by: Bi, Jing, et al.
Published: (2025)

CoFreeVLA: Collision-Free Dual-Arm Manipulation via Vision-Language-Action Model and Risk Estimation
by: Zhai, Xuanran, et al.
Published: (2026)

Preference-Conditioned Multi-Objective RL for Integrated Command Tracking and Force Compliance in Humanoid Locomotion
by: Leng, Tingxuan, et al.
Published: (2025)

SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
by: Wang, Yuxuan, et al.
Published: (2025)

Retargeting Matters: General Motion Retargeting for Humanoid Motion Tracking
by: Araujo, Joao Pedro, et al.
Published: (2025)

GMT: General Motion Tracking for Humanoid Whole-Body Control
by: Chen, Zixuan, et al.
Published: (2025)

UniT: Toward a Unified Physical Language for Human-to-Humanoid Policy Learning and World Modeling
by: Chen, Boyu, et al.
Published: (2026)