Saved in:
| Main Authors: | Shen, Yutong, Liu, Hangxu, Zhang, Lei, Liu, Penghui, Liu, Yinqi, Yang, Liuxiang, Feng, Tongtong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.20721 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DETACH: Cross-domain Learning for Long-Horizon Tasks via Mixture of Disentangled Experts
by: Shen, Yutong, et al.
Published: (2025)
by: Shen, Yutong, et al.
Published: (2025)
MetaWorld: Skill Transfer and Composition in a Hierarchical World Model for Grounding High-Level Instructions
by: Shen, Yutong, et al.
Published: (2026)
by: Shen, Yutong, et al.
Published: (2026)
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation
by: Shen, Yutong, et al.
Published: (2026)
by: Shen, Yutong, et al.
Published: (2026)
Efficient Long-Horizon Vision-Language-Action Models via Static-Dynamic Disentanglement
by: Qiu, Weikang, et al.
Published: (2026)
by: Qiu, Weikang, et al.
Published: (2026)
Goal2Skill: Long-Horizon Manipulation with Adaptive Planning and Reflection
by: Liu, Zhen, et al.
Published: (2026)
by: Liu, Zhen, et al.
Published: (2026)
Anticipation-VLA: Solving Long-Horizon Embodied Tasks via Anticipation-based Subgoal Generation
by: Zhang, Zhilong, et al.
Published: (2026)
by: Zhang, Zhilong, et al.
Published: (2026)
Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation
by: Tan, Huajie, et al.
Published: (2026)
by: Tan, Huajie, et al.
Published: (2026)
Trace-Focused Diffusion Policy for Multi-Modal Action Disambiguation in Long-Horizon Robotic Manipulation
by: Hu, Yuxuan, et al.
Published: (2026)
by: Hu, Yuxuan, et al.
Published: (2026)
BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation
by: Hu, Yucheng, et al.
Published: (2026)
by: Hu, Yucheng, et al.
Published: (2026)
Long-VLA: Unleashing Long-Horizon Capability of Vision Language Action Model for Robot Manipulation
by: Fan, Yiguo, et al.
Published: (2025)
by: Fan, Yiguo, et al.
Published: (2025)
AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models
by: Jiang, Yuhua, et al.
Published: (2025)
by: Jiang, Yuhua, et al.
Published: (2025)
EvolvingAgent: Curriculum Self-evolving Agent with Continual World Model for Long-Horizon Tasks
by: Feng, Tongtong, et al.
Published: (2025)
by: Feng, Tongtong, et al.
Published: (2025)
Learning Long-Horizon Robot Manipulation Skills via Privileged Action
by: Mao, Xiaofeng, et al.
Published: (2025)
by: Mao, Xiaofeng, et al.
Published: (2025)
Non-Markovian Long-Horizon Robot Manipulation via Keyframe Chaining
by: Chen, Yipeng, et al.
Published: (2026)
by: Chen, Yipeng, et al.
Published: (2026)
LongNav-R1: Horizon-Adaptive Multi-Turn RL for Long-Horizon VLA Navigation
by: Hu, Yue, et al.
Published: (2026)
by: Hu, Yue, et al.
Published: (2026)
Long-Horizon Manipulation via Trace-Conditioned VLA Planning
by: Liu, Isabella, et al.
Published: (2026)
by: Liu, Isabella, et al.
Published: (2026)
VLM-TDP: VLM-guided Trajectory-conditioned Diffusion Policy for Robust Long-Horizon Manipulation
by: Huang, Kefeng, et al.
Published: (2025)
by: Huang, Kefeng, et al.
Published: (2025)
SeqVLA: Sequential Task Execution for Long-Horizon Manipulation with Completion-Aware Vision-Language-Action Model
by: Yang, Ran, et al.
Published: (2025)
by: Yang, Ran, et al.
Published: (2025)
Beyond Policy Optimization: A Data Curation Flywheel for Sparse-Reward Long-Horizon Planning
by: Wang, Yutong, et al.
Published: (2025)
by: Wang, Yutong, et al.
Published: (2025)
Mixture of Horizons in Action Chunking
by: Jing, Dong, et al.
Published: (2025)
by: Jing, Dong, et al.
Published: (2025)
DeepSight: Long-Horizon World Modeling via Latent States Prediction for End-to-End Autonomous Driving
by: Zhang, Lingjun, et al.
Published: (2026)
by: Zhang, Lingjun, et al.
Published: (2026)
StreamingVLA: Streaming Vision-Language-Action Model with Action Flow Matching and Adaptive Early Observation
by: Shi, Yiran, et al.
Published: (2026)
by: Shi, Yiran, et al.
Published: (2026)
AsyncShield: A Plug-and-Play Edge Adapter for Asynchronous Cloud-based VLA Navigation
by: Yang, Kai, et al.
Published: (2026)
by: Yang, Kai, et al.
Published: (2026)
VAG: Dual-Stream Video-Action Generation for Embodied Data Synthesis
by: Lang, Xiaolei, et al.
Published: (2026)
by: Lang, Xiaolei, et al.
Published: (2026)
ELVIS: Ensemble-Calibrated Latent Imagination for Long-Horizon Visual MPC
by: Du, Yurui, et al.
Published: (2026)
by: Du, Yurui, et al.
Published: (2026)
PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation
by: Liu, Yuanzhe, et al.
Published: (2026)
by: Liu, Yuanzhe, et al.
Published: (2026)
F\textsuperscript{2}LP-AP: Fast \& Flexible Label Propagation with Adaptive Propagation Kernel
by: Shen, Yutong, et al.
Published: (2026)
by: Shen, Yutong, et al.
Published: (2026)
Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks
by: Liu, Zhihong, et al.
Published: (2026)
by: Liu, Zhihong, et al.
Published: (2026)
FORGE-Tree: Diffusion-Forcing Tree Search for Long-Horizon Robot Manipulation
by: Huang, Yanjia, et al.
Published: (2025)
by: Huang, Yanjia, et al.
Published: (2025)
HarmoWAM: Harmonizing Generalizable and Precise Manipulation via Adaptive World Action Models
by: Feng, Qiuxuan, et al.
Published: (2026)
by: Feng, Qiuxuan, et al.
Published: (2026)
LoLA: Long Horizon Latent Action Learning for General Robot Manipulation
by: Wang, Xiaofan, et al.
Published: (2025)
by: Wang, Xiaofan, et al.
Published: (2025)
AsyncVLA: An Asynchronous VLA for Fast and Robust Navigation on the Edge
by: Hirose, Noriaki, et al.
Published: (2026)
by: Hirose, Noriaki, et al.
Published: (2026)
TempoFit: Plug-and-Play Layer-Wise Temporal KV Memory for Long-Horizon Vision-Language-Action Manipulation
by: Sun, Jun, et al.
Published: (2026)
by: Sun, Jun, et al.
Published: (2026)
Triple-S: A Collaborative Multi-LLM Framework for Solving Long-Horizon Implicative Tasks in Robotics
by: Jia, Zixi, et al.
Published: (2025)
by: Jia, Zixi, et al.
Published: (2025)
Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation
by: Wu, Kun, et al.
Published: (2024)
by: Wu, Kun, et al.
Published: (2024)
Learning Multi-Agent Loco-Manipulation for Long-Horizon Quadrupedal Pushing
by: Feng, Yuming, et al.
Published: (2024)
by: Feng, Yuming, et al.
Published: (2024)
Label-Free Long-Horizon 3D UAV Trajectory Prediction via Motion-Aligned RGB and Event Cues
by: Liang, Hanfang, et al.
Published: (2025)
by: Liang, Hanfang, et al.
Published: (2025)
CookBench: A Long-Horizon Embodied Planning Benchmark for Complex Cooking Scenarios
by: Cai, Muzhen, et al.
Published: (2025)
by: Cai, Muzhen, et al.
Published: (2025)
ESCAPE: Episodic Spatial Memory and Adaptive Execution Policy for Long-Horizon Mobile Manipulation
by: Qian, Jingjing, et al.
Published: (2026)
by: Qian, Jingjing, et al.
Published: (2026)
AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory
by: Ma, Lianjie, et al.
Published: (2026)
by: Ma, Lianjie, et al.
Published: (2026)
Similar Items
-
DETACH: Cross-domain Learning for Long-Horizon Tasks via Mixture of Disentangled Experts
by: Shen, Yutong, et al.
Published: (2025) -
MetaWorld: Skill Transfer and Composition in a Hierarchical World Model for Grounding High-Level Instructions
by: Shen, Yutong, et al.
Published: (2026) -
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation
by: Shen, Yutong, et al.
Published: (2026) -
Efficient Long-Horizon Vision-Language-Action Models via Static-Dynamic Disentanglement
by: Qiu, Weikang, et al.
Published: (2026) -
Goal2Skill: Long-Horizon Manipulation with Adaptive Planning and Reflection
by: Liu, Zhen, et al.
Published: (2026)