Saved in:
| Main Authors: | Dong, Jiahua, Man, Yunze, Tokmakov, Pavel, Wang, Yu-Xiong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.04880 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception
by: Man, Yunze, et al.
Published: (2023)
by: Man, Yunze, et al.
Published: (2023)
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
by: Man, Yunze, et al.
Published: (2024)
by: Man, Yunze, et al.
Published: (2024)
Video Generators are Robot Policies
by: Liang, Junbang, et al.
Published: (2025)
by: Liang, Junbang, et al.
Published: (2025)
AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis
by: Ye, Junjie, et al.
Published: (2025)
by: Ye, Junjie, et al.
Published: (2025)
Understanding Video Transformers via Universal Concept Discovery
by: Kowal, Matthew, et al.
Published: (2024)
by: Kowal, Matthew, et al.
Published: (2024)
Dreamitate: Real-World Visuomotor Policy Learning via Video Generation
by: Liang, Junbang, et al.
Published: (2024)
by: Liang, Junbang, et al.
Published: (2024)
Reactive Planning based Control for Mobile Robots in Obstacle-Cluttered Environments
by: Tan, Li, et al.
Published: (2026)
by: Tan, Li, et al.
Published: (2026)
Real-Time Auto-Optimization in Unknown Environments via Structure-Exploiting Dual Control for Exploration and Exploitation
by: Dong, Shiying, et al.
Published: (2026)
by: Dong, Shiying, et al.
Published: (2026)
RoboDream: Compositional World Models for Scalable Robot Data Synthesis
by: Ye, Junjie, et al.
Published: (2026)
by: Ye, Junjie, et al.
Published: (2026)
GeoMatch++: Morphology Conditioned Geometry Matching for Multi-Embodiment Grasping
by: Wei, Yunze, et al.
Published: (2024)
by: Wei, Yunze, et al.
Published: (2024)
SwarmDiff: Swarm Robotic Trajectory Planning in Cluttered Environments via Diffusion Transformer
by: Ding, Kang, et al.
Published: (2025)
by: Ding, Kang, et al.
Published: (2025)
Do Visual-Language Grid Maps Capture Latent Semantics?
by: Pekkanen, Matti, et al.
Published: (2024)
by: Pekkanen, Matti, et al.
Published: (2024)
Evaluating Collaborative Autonomy in Opposed Environments using Maritime Capture-the-Flag Competitions
by: Beason, Jordan, et al.
Published: (2024)
by: Beason, Jordan, et al.
Published: (2024)
Never-Ending Behavior-Cloning Agent for Robotic Manipulation
by: Liang, Wenqi, et al.
Published: (2024)
by: Liang, Wenqi, et al.
Published: (2024)
R-VoxelMap: Accurate Voxel Mapping with Recursive Plane Fitting for Online LiDAR Odometry
by: Xi, Haobo, et al.
Published: (2026)
by: Xi, Haobo, et al.
Published: (2026)
Capture Point Control in Thruster-Assisted Bipedal Locomotion
by: Pitroda, Shreyansh, et al.
Published: (2024)
by: Pitroda, Shreyansh, et al.
Published: (2024)
Lifelong Embodied Navigation Learning
by: Wang, Xudong, et al.
Published: (2026)
by: Wang, Xudong, et al.
Published: (2026)
HAFO: A Force-Adaptive Control Framework for Humanoid Robots in Intense Interaction Environments
by: Dong, Chenhui, et al.
Published: (2025)
by: Dong, Chenhui, et al.
Published: (2025)
Control-Barrier-Aided Teleoperation with Visual-Inertial SLAM for Safe MAV Navigation in Complex Environments
by: Zhou, Siqi, et al.
Published: (2024)
by: Zhou, Siqi, et al.
Published: (2024)
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis
by: Van Hoorick, Basile, et al.
Published: (2024)
by: Van Hoorick, Basile, et al.
Published: (2024)
Design and Control of a Perching Drone Inspired by the Prey-Capturing Mechanism of Venus Flytrap
by: Li, Ye, et al.
Published: (2025)
by: Li, Ye, et al.
Published: (2025)
Enhanced Capture Point Control Using Thruster Dynamics and QP-Based Optimization for Harpy
by: Pitroda, Shreyansh, et al.
Published: (2024)
by: Pitroda, Shreyansh, et al.
Published: (2024)
Collision-Free Robot Navigation in Crowded Environments using Learning based Convex Model Predictive Control
by: Wen, Zhuanglei, et al.
Published: (2024)
by: Wen, Zhuanglei, et al.
Published: (2024)
Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning
by: Huang, Chi-Pin, et al.
Published: (2026)
by: Huang, Chi-Pin, et al.
Published: (2026)
HE-Nav: A High-Performance and Efficient Navigation System for Aerial-Ground Robots in Cluttered Environments
by: Wang, Junming, et al.
Published: (2024)
by: Wang, Junming, et al.
Published: (2024)
Advancing Audio-Visual Navigation Through Multi-Agent Collaboration in 3D Environments
by: Zhang, Hailong, et al.
Published: (2025)
by: Zhang, Hailong, et al.
Published: (2025)
Control Synthesis in Partially Observable Environments for Complex Perception-Related Objectives
by: Xuan, Zetong, et al.
Published: (2025)
by: Xuan, Zetong, et al.
Published: (2025)
Offline-Online Hierarchical 3D Global Relocalization With Synthetic LiDAR Sensing and Descriptor-Space Retrieval
by: Ren, Jiahua, et al.
Published: (2026)
by: Ren, Jiahua, et al.
Published: (2026)
A Robotic Cyber-Physical System for Automated Reality Capture and Visualization in Construction Progress Monitoring
by: Halder, Srijeet, et al.
Published: (2024)
by: Halder, Srijeet, et al.
Published: (2024)
RESC: A Reinforcement Learning Based Search-to-Control Framework for Quadrotor Local Planning in Dense Environments
by: Liu, Zhaohong, et al.
Published: (2024)
by: Liu, Zhaohong, et al.
Published: (2024)
HMT-Grasp: A Hybrid Mamba-Transformer Approach for Robot Grasping in Cluttered Environments
by: Xiong, Songsong, et al.
Published: (2024)
by: Xiong, Songsong, et al.
Published: (2024)
Generalizable Collaborative Search-and-Capture in Cluttered Environments via Path-Guided MAPPO and Directional Frontier Allocation
by: Ying, Jialin, et al.
Published: (2025)
by: Ying, Jialin, et al.
Published: (2025)
AnyView: Synthesizing Any Novel View in Dynamic Scenes
by: Van Hoorick, Basile, et al.
Published: (2026)
by: Van Hoorick, Basile, et al.
Published: (2026)
Frozen Transformers in Language Models Are Effective Visual Encoder Layers
by: Pang, Ziqi, et al.
Published: (2023)
by: Pang, Ziqi, et al.
Published: (2023)
Multi-Robot Rendezvous in Unknown Environment with Limited Communication
by: Song, Kun, et al.
Published: (2024)
by: Song, Kun, et al.
Published: (2024)
Ubiquitous Robot Control Through Multimodal Motion Capture Using Smartwatch and Smartphone Data
by: Weigend, Fabian C, et al.
Published: (2024)
by: Weigend, Fabian C, et al.
Published: (2024)
RGBGrasp: Image-based Object Grasping by Capturing Multiple Views during Robot Arm Movement with Neural Radiance Fields
by: Liu, Chang, et al.
Published: (2023)
by: Liu, Chang, et al.
Published: (2023)
OVAL: Open-Vocabulary Augmented Memory Model for Lifelong Object Goal Navigation
by: Pei, Jiahua, et al.
Published: (2026)
by: Pei, Jiahua, et al.
Published: (2026)
Visualizing Latent Phase Structures in Locomotion Policies: A Multi-Environment Study with Temporal Feature Extension
by: Yasui, Daisuke, et al.
Published: (2026)
by: Yasui, Daisuke, et al.
Published: (2026)
GeNIE: A Generalizable Navigation System for In-the-Wild Environments
by: Wang, Jiaming, et al.
Published: (2025)
by: Wang, Jiaming, et al.
Published: (2025)
Similar Items
-
DualCross: Cross-Modality Cross-Domain Adaptation for Monocular BEV Perception
by: Man, Yunze, et al.
Published: (2023) -
Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding
by: Man, Yunze, et al.
Published: (2024) -
Video Generators are Robot Policies
by: Liang, Junbang, et al.
Published: (2025) -
AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis
by: Ye, Junjie, et al.
Published: (2025) -
Understanding Video Transformers via Universal Concept Discovery
by: Kowal, Matthew, et al.
Published: (2024)