Saved in:
| Main Authors: | Chen, Boyu, Chen, Yi, Qiu, Lu, Bai, Jerry, Ge, Yuying, Ge, Yixiao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.19734 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
by: Chen, Yi, et al.
Published: (2026)
by: Chen, Yi, et al.
Published: (2026)
Humanoid Policy ~ Human Policy
by: Qiu, Ri-Zhao, et al.
Published: (2025)
by: Qiu, Ri-Zhao, et al.
Published: (2025)
EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios
by: Qiu, Lu, et al.
Published: (2024)
by: Qiu, Lu, et al.
Published: (2024)
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
by: Chen, Yi, et al.
Published: (2024)
by: Chen, Yi, et al.
Published: (2024)
UniJEPA: Enhancing Robot Policy via Unified Continuous and Discrete Representation Learning
by: Zhang, Jianke, et al.
Published: (2025)
by: Zhang, Jianke, et al.
Published: (2025)
AdaWorldPolicy: World-Model-Driven Diffusion Policy with Online Adaptive Learning for Robotic Manipulation
by: Yuan, Ge, et al.
Published: (2026)
by: Yuan, Ge, et al.
Published: (2026)
Humanoid World Models: Open World Foundation Models for Humanoid Robotics
by: Ali, Muhammad Qasim, et al.
Published: (2025)
by: Ali, Muhammad Qasim, et al.
Published: (2025)
LHM-Humanoid: Learning a Unified Policy for Long-Horizon Humanoid Whole-Body Loco-Manipulation in Diverse Messy Environments
by: Zhang, Haozhuo, et al.
Published: (2025)
by: Zhang, Haozhuo, et al.
Published: (2025)
UniT: Unified Multimodal Chain-of-Thought Test-time Scaling
by: Chen, Leon Liangyu, et al.
Published: (2026)
by: Chen, Leon Liangyu, et al.
Published: (2026)
Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance
by: Zhang, Yang, et al.
Published: (2025)
by: Zhang, Yang, et al.
Published: (2025)
Learning Smooth Humanoid Locomotion through Lipschitz-Constrained Policies
by: Chen, Zixuan, et al.
Published: (2024)
by: Chen, Zixuan, et al.
Published: (2024)
Coordinated Humanoid Manipulation with Choice Policies
by: Qi, Haozhi, et al.
Published: (2025)
by: Qi, Haozhi, et al.
Published: (2025)
Learning Human-Like Badminton Skills for Humanoid Robots
by: Chen, Yeke, et al.
Published: (2026)
by: Chen, Yeke, et al.
Published: (2026)
Advancing Humanoid Locomotion: Mastering Challenging Terrains with Denoising World Model Learning
by: Gu, Xinyang, et al.
Published: (2024)
by: Gu, Xinyang, et al.
Published: (2024)
Commanding Humanoid by Free-form Language: A Large Language Action Model with Unified Motion Vocabulary
by: Liu, Zhirui, et al.
Published: (2025)
by: Liu, Zhirui, et al.
Published: (2025)
Mobile-TeleVision: Predictive Motion Priors for Humanoid Whole-Body Control
by: Lu, Chenhao, et al.
Published: (2024)
by: Lu, Chenhao, et al.
Published: (2024)
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation
by: Shen, Yutong, et al.
Published: (2026)
by: Shen, Yutong, et al.
Published: (2026)
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1
by: Chen, Yi, et al.
Published: (2025)
by: Chen, Yi, et al.
Published: (2025)
Endowing GPT-4 with a Humanoid Body: Building the Bridge Between Off-the-Shelf VLMs and the Physical World
by: Jian, Yingzhao, et al.
Published: (2025)
by: Jian, Yingzhao, et al.
Published: (2025)
Generative World Modelling for Humanoids: 1X World Model Challenge Technical Report
by: Mereu, Riccardo, et al.
Published: (2025)
by: Mereu, Riccardo, et al.
Published: (2025)
Success in Humanoid Reinforcement Learning under Partial Observation
by: Wang, Wuhao, et al.
Published: (2025)
by: Wang, Wuhao, et al.
Published: (2025)
Human Cognition in Machines: A Unified Perspective of World Models
by: Rupprecht, Timothy, et al.
Published: (2026)
by: Rupprecht, Timothy, et al.
Published: (2026)
Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations
by: Huang, Wei-Jin, et al.
Published: (2026)
by: Huang, Wei-Jin, et al.
Published: (2026)
InterReal: A Unified Physics-Based Imitation Framework for Learning Human-Object Interaction Skills
by: Liang, Dayang, et al.
Published: (2026)
by: Liang, Dayang, et al.
Published: (2026)
KungfuBot: Physics-Based Humanoid Whole-Body Control for Learning Highly-Dynamic Skills
by: Xie, Weiji, et al.
Published: (2025)
by: Xie, Weiji, et al.
Published: (2025)
Towards Adaptable Humanoid Control via Adaptive Motion Tracking
by: Huang, Tao, et al.
Published: (2025)
by: Huang, Tao, et al.
Published: (2025)
Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport
by: Zhang, Hao, et al.
Published: (2026)
by: Zhang, Hao, et al.
Published: (2026)
Learning Multi-Modal Whole-Body Control for Real-World Humanoid Robots
by: Dugar, Pranay, et al.
Published: (2024)
by: Dugar, Pranay, et al.
Published: (2024)
Learning Bilevel Policies over Symbolic World Models for Long-Horizon Planning
by: Chen, Dillon Z., et al.
Published: (2026)
by: Chen, Dillon Z., et al.
Published: (2026)
UniT: Data Efficient Tactile Representation with Generalization to Unseen Objects
by: Xu, Zhengtong, et al.
Published: (2024)
by: Xu, Zhengtong, et al.
Published: (2024)
Human-Humanoid Robots Cross-Embodiment Behavior-Skill Transfer Using Decomposed Adversarial Learning from Demonstration
by: Liu, Junjia, et al.
Published: (2024)
by: Liu, Junjia, et al.
Published: (2024)
PhysHSI: Towards a Real-World Generalizable and Natural Humanoid-Scene Interaction System
by: Wang, Huayi, et al.
Published: (2025)
by: Wang, Huayi, et al.
Published: (2025)
REFINE-DP: Diffusion Policy Fine-tuning for Humanoid Loco-manipulation via Reinforcement Learning
by: Gu, Zhaoyuan, et al.
Published: (2026)
by: Gu, Zhaoyuan, et al.
Published: (2026)
SENTINEL: A Fully End-to-End Language-Action Model for Humanoid Whole Body Control
by: Wang, Yuxuan, et al.
Published: (2025)
by: Wang, Yuxuan, et al.
Published: (2025)
Ego-Vision World Model for Humanoid Contact Planning
by: Liu, Hang, et al.
Published: (2025)
by: Liu, Hang, et al.
Published: (2025)
Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models
by: Bärmann, Leonard, et al.
Published: (2023)
by: Bärmann, Leonard, et al.
Published: (2023)
Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning
by: Kim, Moo Jin, et al.
Published: (2026)
by: Kim, Moo Jin, et al.
Published: (2026)
Act2Goal: From World Model To General Goal-conditioned Policy
by: Zhou, Pengfei, et al.
Published: (2025)
by: Zhou, Pengfei, et al.
Published: (2025)
PRIOR: Perceptive Learning for Humanoid Locomotion with Reference Gait Priors
by: Han, Chenxi, et al.
Published: (2026)
by: Han, Chenxi, et al.
Published: (2026)
ExBody2: Advanced Expressive Humanoid Whole-Body Control
by: Ji, Mazeyu, et al.
Published: (2024)
by: Ji, Mazeyu, et al.
Published: (2024)
Similar Items
-
DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA
by: Chen, Yi, et al.
Published: (2026) -
Humanoid Policy ~ Human Policy
by: Qiu, Ri-Zhao, et al.
Published: (2025) -
EgoPlan-Bench2: A Benchmark for Multimodal Large Language Model Planning in Real-World Scenarios
by: Qiu, Lu, et al.
Published: (2024) -
Moto: Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
by: Chen, Yi, et al.
Published: (2024) -
UniJEPA: Enhancing Robot Policy via Unified Continuous and Discrete Representation Learning
by: Zhang, Jianke, et al.
Published: (2025)