Saved in:
| Main Authors: | Guan, Jiarui, Zhao, Wenshuai, Pei, Yue, Chen, Ziliang, Solin, Arno, Kannala, Juho |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.23856 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Latent-Compressed Variational Autoencoder for Video Diffusion Models
by: Guan, Jiarui, et al.
Published: (2026)
by: Guan, Jiarui, et al.
Published: (2026)
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
by: Zhao, Yi, et al.
Published: (2025)
by: Zhao, Yi, et al.
Published: (2025)
Sources of Uncertainty in 3D Scene Reconstruction
by: Klasson, Marcus, et al.
Published: (2024)
by: Klasson, Marcus, et al.
Published: (2024)
Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection
by: Kok, Manon, et al.
Published: (2024)
by: Kok, Manon, et al.
Published: (2024)
Sparsely Supervised Diffusion
by: Zhao, Wenshuai, et al.
Published: (2026)
by: Zhao, Wenshuai, et al.
Published: (2026)
Rao-Blackwellized Particle Smoothing for Simultaneous Localization and Mapping
by: Kok, Manon, et al.
Published: (2023)
by: Kok, Manon, et al.
Published: (2023)
Distant Object Localisation from Noisy Image Segmentation Sequences
by: Pesonen, Julius, et al.
Published: (2025)
by: Pesonen, Julius, et al.
Published: (2025)
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking
by: Gunes, Ulas, et al.
Published: (2025)
by: Gunes, Ulas, et al.
Published: (2025)
HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by: Seiskari, Otto, et al.
Published: (2021)
by: Seiskari, Otto, et al.
Published: (2021)
DeSplat: Decomposed Gaussian Splatting for Distractor-Free Rendering
by: Wang, Yihao, et al.
Published: (2024)
by: Wang, Yihao, et al.
Published: (2024)
Continuous Monte Carlo Graph Search
by: Kujanpää, Kalle, et al.
Published: (2022)
by: Kujanpää, Kalle, et al.
Published: (2022)
RP1M: A Large-Scale Motion Dataset for Piano Playing with Bi-Manual Dexterous Robot Hands
by: Zhao, Yi, et al.
Published: (2024)
by: Zhao, Yi, et al.
Published: (2024)
Dexterous Robotic Piano Playing at Scale
by: Chen, Le, et al.
Published: (2025)
by: Chen, Le, et al.
Published: (2025)
Optimistic Multi-Agent Policy Gradient
by: Zhao, Wenshuai, et al.
Published: (2023)
by: Zhao, Wenshuai, et al.
Published: (2023)
Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models
by: Dong, Hao, et al.
Published: (2025)
by: Dong, Hao, et al.
Published: (2025)
SuperFusion: Multilevel LiDAR-Camera Fusion for Long-Range HD Map Generation
by: Dong, Hao, et al.
Published: (2022)
by: Dong, Hao, et al.
Published: (2022)
PAWS: Perception of Articulation in the Wild at Scale from Egocentric Videos
by: Wang, Yihao, et al.
Published: (2026)
by: Wang, Yihao, et al.
Published: (2026)
Gaussian Splatting on the Move: Blur and Rolling Shutter Compensation for Natural Camera Motion
by: Seiskari, Otto, et al.
Published: (2024)
by: Seiskari, Otto, et al.
Published: (2024)
When to Trust Imagination: Adaptive Action Execution for World Action Models
by: Wang, Rui, et al.
Published: (2026)
by: Wang, Rui, et al.
Published: (2026)
WorldVLA: Towards Autoregressive Action World Model
by: Cen, Jun, et al.
Published: (2025)
by: Cen, Jun, et al.
Published: (2025)
Bi-Level Motion Imitation for Humanoid Robots
by: Zhao, Wenshuai, et al.
Published: (2024)
by: Zhao, Wenshuai, et al.
Published: (2024)
World Guidance: World Modeling in Condition Space for Action Generation
by: Su, Yue, et al.
Published: (2026)
by: Su, Yue, et al.
Published: (2026)
RISE: Self-Improving Robot Policy with Compositional World Model
by: Yang, Jiazhi, et al.
Published: (2026)
by: Yang, Jiazhi, et al.
Published: (2026)
PointACT: Vision-Language-Action Models with Multi-Scale Point-Action Interaction
by: Chen, Shizhe, et al.
Published: (2026)
by: Chen, Shizhe, et al.
Published: (2026)
Track A*: Fast Visibility-Aware Trajectory Planning for Active Target Tracking
by: Chen, Hanxuan, et al.
Published: (2026)
by: Chen, Hanxuan, et al.
Published: (2026)
World Action Verifier: Self-Improving World Models via Forward-Inverse Asymmetry
by: Liu, Yuejiang, et al.
Published: (2026)
by: Liu, Yuejiang, et al.
Published: (2026)
Bridging the Embodiment Gap: Disentangled Cross-Embodiment Video Editing
by: Li, Zhiyuan, et al.
Published: (2026)
by: Li, Zhiyuan, et al.
Published: (2026)
Improving Discrete Diffusion Models via Structured Preferential Generation
by: Rissanen, Severi, et al.
Published: (2024)
by: Rissanen, Severi, et al.
Published: (2024)
$τ_0$-WM: A Unified Video-Action World Model for Robotic Manipulation
by: Zhou, Pengfei, et al.
Published: (2026)
by: Zhou, Pengfei, et al.
Published: (2026)
Registering the 4D Millimeter Wave Radar Point Clouds Via Generalized Method of Moments
by: Li, Xingyi, et al.
Published: (2025)
by: Li, Xingyi, et al.
Published: (2025)
NoiseGate: Learning Per-Latent Timestep Schedules as Information Gating in World Action Models
by: Huang, Wen, et al.
Published: (2026)
by: Huang, Wen, et al.
Published: (2026)
RynnVLA-002: A Unified Vision-Language-Action and World Model
by: Cen, Jun, et al.
Published: (2025)
by: Cen, Jun, et al.
Published: (2025)
MotuBrain: An Advanced World Action Model for Robot Control
by: MotuBrain Team, et al.
Published: (2026)
by: MotuBrain Team, et al.
Published: (2026)
PointVLA: Injecting the 3D World into Vision-Language-Action Models
by: Li, Chengmeng, et al.
Published: (2025)
by: Li, Chengmeng, et al.
Published: (2025)
Privileged Foresight Distillation: Zero-Cost Future Correction for World Action Models
by: Fang, Pengcheng, et al.
Published: (2026)
by: Fang, Pengcheng, et al.
Published: (2026)
WorldVLN: Autoregressive World Action Model for Aerial Vision-Language Navigation
by: Zhao, Baining, et al.
Published: (2026)
by: Zhao, Baining, et al.
Published: (2026)
A Survey on Vision-Language-Action Models: An Action Tokenization Perspective
by: Zhong, Yifan, et al.
Published: (2025)
by: Zhong, Yifan, et al.
Published: (2025)
STARRY: Spatial-Temporal Action-Centric World Modeling for Robotic Manipulation
by: Tian, Yuxuan, et al.
Published: (2026)
by: Tian, Yuxuan, et al.
Published: (2026)
Learning to Approximate Particle Smoothing Trajectories via Diffusion Generative Models
by: Tamir, Ella, et al.
Published: (2024)
by: Tamir, Ella, et al.
Published: (2024)
VLAW: Iterative Co-Improvement of Vision-Language-Action Policy and World Model
by: Guo, Yanjiang, et al.
Published: (2026)
by: Guo, Yanjiang, et al.
Published: (2026)
Similar Items
-
Latent-Compressed Variational Autoencoder for Video Diffusion Models
by: Guan, Jiarui, et al.
Published: (2026) -
Efficient Reinforcement Learning by Guiding Generalist World Models with Non-Curated Data
by: Zhao, Yi, et al.
Published: (2025) -
Sources of Uncertainty in 3D Scene Reconstruction
by: Klasson, Marcus, et al.
Published: (2024) -
Online One-Dimensional Magnetic Field SLAM with Loop-Closure Detection
by: Kok, Manon, et al.
Published: (2024) -
Sparsely Supervised Diffusion
by: Zhao, Wenshuai, et al.
Published: (2026)