Saved in:
| Main Authors: | Liu, Liu, Wang, Xiaofeng, Zhao, Guosheng, Li, Keyu, Qin, Wenkang, Zhu, Jiagang, Qiu, Jiaxiong, Zhu, Zheng, Huang, Guan, Su, Zhizhong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.23171 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning
by: Zhou, Yang, et al.
Published: (2026)
by: Zhou, Yang, et al.
Published: (2026)
GLS: Geometry-aware 3D Language Gaussian Splatting
by: Qiu, Jiaxiong, et al.
Published: (2024)
by: Qiu, Jiaxiong, et al.
Published: (2024)
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
by: Zhao, Guosheng, et al.
Published: (2025)
by: Zhao, Guosheng, et al.
Published: (2025)
ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
by: Ni, Chaojun, et al.
Published: (2025)
by: Ni, Chaojun, et al.
Published: (2025)
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion
by: Wang, Weijie, et al.
Published: (2025)
by: Wang, Weijie, et al.
Published: (2025)
EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
by: Dong, Zhehao, et al.
Published: (2025)
by: Dong, Zhehao, et al.
Published: (2025)
Gaussian Object Carver: Object-Compositional Gaussian Splatting with surfaces completion
by: Liu, Liu, et al.
Published: (2024)
by: Liu, Liu, et al.
Published: (2024)
WonderTurbo: Generating Interactive 3D World in 0.72 Seconds
by: Ni, Chaojun, et al.
Published: (2025)
by: Ni, Chaojun, et al.
Published: (2025)
DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation
by: Yin, Ze-Xin, et al.
Published: (2025)
by: Yin, Ze-Xin, et al.
Published: (2025)
DeCo: Decoupled Human-Centered Diffusion Video Editing with Motion Consistency
by: Zhong, Xiaojing, et al.
Published: (2024)
by: Zhong, Xiaojing, et al.
Published: (2024)
DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation
by: Zhao, Guosheng, et al.
Published: (2024)
by: Zhao, Guosheng, et al.
Published: (2024)
Cross Domain Policy Transfer with Effect Cycle-Consistency
by: Zhu, Ruiqi, et al.
Published: (2024)
by: Zhu, Ruiqi, et al.
Published: (2024)
DIPO: Dual-State Images Controlled Articulated Object Generation Powered by Diverse Data
by: Wu, Ruiqi, et al.
Published: (2025)
by: Wu, Ruiqi, et al.
Published: (2025)
GeoFlow-SLAM: A Robust Tightly-Coupled RGBD-Inertial and Legged Odometry Fusion SLAM for Dynamic Legged Robotics
by: Xiao, Tingyang, et al.
Published: (2025)
by: Xiao, Tingyang, et al.
Published: (2025)
HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation
by: Wang, Boyuan, et al.
Published: (2025)
by: Wang, Boyuan, et al.
Published: (2025)
EgoVid-5M: A Large-Scale Video-Action Dataset for Egocentric Video Generation
by: Wang, Xiaofeng, et al.
Published: (2024)
by: Wang, Xiaofeng, et al.
Published: (2024)
WonderFree: Enhancing Novel View Quality and Cross-View Consistency for 3D Scene Exploration
by: Ni, Chaojun, et al.
Published: (2025)
by: Ni, Chaojun, et al.
Published: (2025)
DataTransfer: Neural network based interpolation across non-nested meshes
by: Hao, Jiaxiong, et al.
Published: (2025)
by: Hao, Jiaxiong, et al.
Published: (2025)
AffordDP: Generalizable Diffusion Policy with Transferable Affordance
by: Wu, Shijie, et al.
Published: (2024)
by: Wu, Shijie, et al.
Published: (2024)
VAG: Dual-Stream Video-Action Generation for Embodied Data Synthesis
by: Lang, Xiaolei, et al.
Published: (2026)
by: Lang, Xiaolei, et al.
Published: (2026)
EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence
by: Wang, Xinjie, et al.
Published: (2025)
by: Wang, Xinjie, et al.
Published: (2025)
SEM: Enhancing Spatial Understanding for Robust Robot Manipulation
by: Lin, Xuewu, et al.
Published: (2025)
by: Lin, Xuewu, et al.
Published: (2025)
RoboTrustBench: Benchmarking the Trustworthiness of Video World Models for Robotic Manipulation
by: Li, Huiqiong, et al.
Published: (2026)
by: Li, Huiqiong, et al.
Published: (2026)
H-RDT: Human Manipulation Enhanced Bimanual Robotic Manipulation
by: Bi, Hongzhe, et al.
Published: (2025)
by: Bi, Hongzhe, et al.
Published: (2025)
LAMARL: LLM-Aided Multi-Agent Reinforcement Learning for Cooperative Policy Generation
by: Zhu, Guobin, et al.
Published: (2025)
by: Zhu, Guobin, et al.
Published: (2025)
RoboTron-Mani: All-in-One Multimodal Large Model for Robotic Manipulation
by: Yan, Feng, et al.
Published: (2024)
by: Yan, Feng, et al.
Published: (2024)
Hierarchical Equivariant Policy via Frame Transfer
by: Zhao, Haibo, et al.
Published: (2025)
by: Zhao, Haibo, et al.
Published: (2025)
EmbodieDreamer: Advancing Real2Sim2Real Transfer for Policy Training via Embodied World Modeling
by: Wang, Boyuan, et al.
Published: (2025)
by: Wang, Boyuan, et al.
Published: (2025)
RoboRouter: Training-Free Policy Routing for Robotic Manipulation
by: Chen, Yiteng, et al.
Published: (2026)
by: Chen, Yiteng, et al.
Published: (2026)
Vidar: Embodied Video Diffusion Model for Generalist Manipulation
by: Feng, Yao, et al.
Published: (2025)
by: Feng, Yao, et al.
Published: (2025)
Can Video Diffusion Models Predict Past Frames? Bidirectional Cycle Consistency for Reversible Interpolation
by: Liu, Lingyu, et al.
Published: (2026)
by: Liu, Lingyu, et al.
Published: (2026)
ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration
by: Ni, Chaojun, et al.
Published: (2024)
by: Ni, Chaojun, et al.
Published: (2024)
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
Geometry-Aware Rotary Position Embedding for Consistent Video World Model
by: Xiang, Chendong, et al.
Published: (2026)
by: Xiang, Chendong, et al.
Published: (2026)
Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models
by: Bao, Fan, et al.
Published: (2024)
by: Bao, Fan, et al.
Published: (2024)
Can Video Diffusion Model Reconstruct 4D Geometry?
by: Mai, Jinjie, et al.
Published: (2025)
by: Mai, Jinjie, et al.
Published: (2025)
RoboPearls: Editable Video Simulation for Robot Manipulation
by: Tang, Tao, et al.
Published: (2025)
by: Tang, Tao, et al.
Published: (2025)
VidSplat: Gaussian Splatting Reconstruction with Geometry-Guided Video Diffusion Priors
by: Tang, Jimin, et al.
Published: (2026)
by: Tang, Jimin, et al.
Published: (2026)
ADEPT: Adaptive Diffusion Environment for Policy Transfer Sim-to-Real
by: Yu, Youwei, et al.
Published: (2025)
by: Yu, Youwei, et al.
Published: (2025)
FLUX-Makeup: High-Fidelity, Identity-Consistent, and Robust Makeup Transfer via Diffusion Transformer
by: Zhu, Jian, et al.
Published: (2025)
by: Zhu, Jian, et al.
Published: (2025)
Similar Items
-
DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning
by: Zhou, Yang, et al.
Published: (2026) -
GLS: Geometry-aware 3D Language Gaussian Splatting
by: Qiu, Jiaxiong, et al.
Published: (2024) -
ReconDreamer++: Harmonizing Generative and Reconstructive Models for Driving Scene Representation
by: Zhao, Guosheng, et al.
Published: (2025) -
ReconDreamer-RL: Enhancing Reinforcement Learning via Diffusion-based Scene Reconstruction
by: Ni, Chaojun, et al.
Published: (2025) -
DriveGen3D: Boosting Feed-Forward Driving Scene Generation with Efficient Video Diffusion
by: Wang, Weijie, et al.
Published: (2025)