:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guan, Jiarui, Zhao, Wenshuai, Pei, Yue, Chen, Ziliang, Solin, Arno, Kannala, Juho
Format:	Preprint
Published:	2026
Subjects:	Robotics
Online Access:	https://arxiv.org/abs/2605.23856
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Latent-Compressed Variational Autoencoder for Video Diffusion Models
by: Guan, Jiarui, et al.
Published: (2026)

Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
by: Zhao, Yi, et al.
Published: (2025)

Sources of Uncertainty in 3D Scene Reconstruction
by: Klasson, Marcus, et al.
Published: (2024)

Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection
by: Kok, Manon, et al.
Published: (2024)

Sparsely Supervised Diffusion
by: Zhao, Wenshuai, et al.
Published: (2026)

Rao-Blackwellized Particle Smoothing for Simultaneous Localization and Mapping
by: Kok, Manon, et al.
Published: (2023)

Distant Object Localisation from Noisy Image Segmentation Sequences
by: Pesonen, Julius, et al.
Published: (2025)

FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)

HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by: Seiskari, Otto, et al.
Published: (2021)

DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
by: Wang, Yihao, et al.
Published: (2024)

Continuous Monte Carlo Graph Search
by: Kujanpää, Kalle, et al.
Published: (2022)

RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
by: Zhao, Yi, et al.
Published: (2024)

Dexterous Robotic Piano Playing at Scale
by: Chen, Le, et al.
Published: (2025)

Optimistic Multi-Agent Policy Gradient
by: Zhao, Wenshuai, et al.
Published: (2023)

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models
by: Dong, Hao, et al.
Published: (2025)

SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation
by: Dong, Hao, et al.
Published: (2022)

PAWS: Perception of Articulation in the Wild at Scale from Egocentric Videos
by: Wang, Yihao, et al.
Published: (2026)

Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion
by: Seiskari, Otto, et al.
Published: (2024)

When to Trust Imagination: Adaptive Action Execution for World Action Models
by: Wang, Rui, et al.
Published: (2026)

WorldVLA: Towards Autoregressive Action World Model
by: Cen, Jun, et al.
Published: (2025)

Bi-Level Motion Imitation for Humanoid Robots
by: Zhao, Wenshuai, et al.
Published: (2024)

World Guidance: World Modeling in Condition Space for Action Generation
by: Su, Yue, et al.
Published: (2026)

RISE: Self-Improving Robot Policy with Compositional World Model
by: Yang, Jiazhi, et al.
Published: (2026)

PointACT: Vision-Language-Action Models with Multi-Scale Point-Action Interaction
by: Chen, Shizhe, et al.
Published: (2026)

Track A*: Fast Visibility-Aware Trajectory Planning for Active Target Tracking
by: Chen, Hanxuan, et al.
Published: (2026)

World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry
by: Liu, Yuejiang, et al.
Published: (2026)

Bridging the Embodiment Gap: Disentangled Cross-Embodiment Video Editing
by: Li, Zhiyuan, et al.
Published: (2026)

Improving Discrete Diffusion Models via Structured Preferential Generation
by: Rissanen, Severi, et al.
Published: (2024)

$τ_0$-WM: A Unified Video-Action World Model for Robotic Manipulation
by: Zhou, Pengfei, et al.
Published: (2026)

Registering the 4D Millimeter Wave Radar Point Clouds Via Generalized Method of Moments
by: Li, Xingyi, et al.
Published: (2025)

NoiseGate: Learning Per-Latent Timestep Schedules as Information Gating in World Action Models
by: Huang, Wen, et al.
Published: (2026)

RynnVLA-002: A Unified Vision-Language-Action and World Model
by: Cen, Jun, et al.
Published: (2025)

MotuBrain: An Advanced World Action Model for Robot Control
by: MotuBrain Team, et al.
Published: (2026)

PointVLA: Injecting the 3D World into Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)

Privileged Foresight Distillation: Zero-Cost Future Correction for World Action Models
by: Fang, Pengcheng, et al.
Published: (2026)

WorldVLN: Autoregressive World Action Model for Aerial Vision-Language Navigation
by: Zhao, Baining, et al.
Published: (2026)

A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
by: Zhong, Yifan, et al.
Published: (2025)

STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation
by: Tian, Yuxuan, et al.
Published: (2026)

Learning to Approximate Particle Smoothing Trajectories via Diffusion Generative Models
by: Tamir, Ella, et al.
Published: (2024)

VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model
by: Guo, Yanjiang, et al.
Published: (2026)