Saved in:
| Main Authors: | Xu, Xiuwei, Ma, Angyuan, Li, Hankun, Yu, Bingyao, Zhu, Zheng, Zhou, Jie, Lu, Jiwen |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08547 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ShapeGen: Robotic Data Generation for Category-Level Manipulation
by: Wang, Yirui, et al.
Published: (2026)
by: Wang, Yirui, et al.
Published: (2026)
F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation
by: Wei, Haoyu, et al.
Published: (2026)
by: Wei, Haoyu, et al.
Published: (2026)
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
by: Xu, Xiuwei, et al.
Published: (2024)
by: Xu, Xiuwei, et al.
Published: (2024)
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
by: Yin, Hang, et al.
Published: (2024)
by: Yin, Hang, et al.
Published: (2024)
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
by: Guo, Wenxuan, et al.
Published: (2025)
by: Guo, Wenxuan, et al.
Published: (2025)
MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation
by: Wu, Zhenyu, et al.
Published: (2025)
by: Wu, Zhenyu, et al.
Published: (2025)
UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
by: Wu, Zhenyu, et al.
Published: (2023)
by: Wu, Zhenyu, et al.
Published: (2023)
MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
by: Wu, Zhenyu, et al.
Published: (2025)
by: Wu, Zhenyu, et al.
Published: (2025)
CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection
by: Cheng, Ziyang, et al.
Published: (2026)
by: Cheng, Ziyang, et al.
Published: (2026)
AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
by: Guo, Wenxuan, et al.
Published: (2026)
by: Guo, Wenxuan, et al.
Published: (2026)
ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation
by: Qin, Yiran, et al.
Published: (2026)
by: Qin, Yiran, et al.
Published: (2026)
R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation
by: Zhang, Yuhao, et al.
Published: (2026)
by: Zhang, Yuhao, et al.
Published: (2026)
3D Small Object Detection with Dynamic Spatial Pruning
by: Xu, Xiuwei, et al.
Published: (2023)
by: Xu, Xiuwei, et al.
Published: (2023)
NeXT-IMDL: Build Benchmark for NeXT-Generation Image Manipulation Detection & Localization
by: Li, Yifei, et al.
Published: (2025)
by: Li, Yifei, et al.
Published: (2025)
iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
RealD$^2$iff: Bridging Real-World Gap in Robot Manipulation via Depth Diffusion
by: Liang, Xiujian, et al.
Published: (2025)
by: Liang, Xiujian, et al.
Published: (2025)
UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
by: Zhang, Yanran, et al.
Published: (2026)
by: Zhang, Yanran, et al.
Published: (2026)
REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation
by: Yuan, Puzhen, et al.
Published: (2025)
by: Yuan, Puzhen, et al.
Published: (2025)
Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos
by: Dou, Weijia, et al.
Published: (2026)
by: Dou, Weijia, et al.
Published: (2026)
$\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
by: Zhang, Yanran, et al.
Published: (2025)
by: Zhang, Yanran, et al.
Published: (2025)
Vega: Learning to Drive with Natural Language Instructions
by: Zuo, Sicheng, et al.
Published: (2026)
by: Zuo, Sicheng, et al.
Published: (2026)
A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards
by: Patel, Shivansh, et al.
Published: (2025)
by: Patel, Shivansh, et al.
Published: (2025)
RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation
by: Zhou, Shijie, et al.
Published: (2026)
by: Zhou, Shijie, et al.
Published: (2026)
Embodied Instruction Following in Unknown Environments
by: Wu, Zhenyu, et al.
Published: (2024)
by: Wu, Zhenyu, et al.
Published: (2024)
ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
by: Lu, Guanxing, et al.
Published: (2024)
by: Lu, Guanxing, et al.
Published: (2024)
TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
by: Guo, Wenxuan, et al.
Published: (2025)
by: Guo, Wenxuan, et al.
Published: (2025)
GPD-1: Generative Pre-training for Driving
by: Xie, Zixun, et al.
Published: (2024)
by: Xie, Zixun, et al.
Published: (2024)
ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation
by: Sun, Yu, et al.
Published: (2026)
by: Sun, Yu, et al.
Published: (2026)
Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
by: Wu, Yuqi, et al.
Published: (2025)
by: Wu, Yuqi, et al.
Published: (2025)
Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
by: Zhao, Yujie, et al.
Published: (2025)
by: Zhao, Yujie, et al.
Published: (2025)
Part-Guided 3D RL for Sim2Real Articulated Object Manipulation
by: Xie, Pengwei, et al.
Published: (2024)
by: Xie, Pengwei, et al.
Published: (2024)
AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory
by: Ma, Lianjie, et al.
Published: (2026)
by: Ma, Lianjie, et al.
Published: (2026)
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
by: Hao, Jinkun, et al.
Published: (2025)
by: Hao, Jinkun, et al.
Published: (2025)
Multimodal Signal Processing For Thermo-Visible-Lidar Fusion In Real-time 3D Semantic Mapping
by: Sun, Jiajun, et al.
Published: (2026)
by: Sun, Jiajun, et al.
Published: (2026)
Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model
by: You, Junqi, et al.
Published: (2024)
by: You, Junqi, et al.
Published: (2024)
FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
by: Zhao, Wenliang, et al.
Published: (2024)
by: Zhao, Wenliang, et al.
Published: (2024)
SCENEREPLICA: Benchmarking Real-World Robot Manipulation by Creating Replicable Scenes
by: Khargonkar, Ninad, et al.
Published: (2023)
by: Khargonkar, Ninad, et al.
Published: (2023)
ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
Similar Items
-
ShapeGen: Robotic Data Generation for Category-Level Manipulation
by: Wang, Yirui, et al.
Published: (2026) -
F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation
by: Wei, Haoyu, et al.
Published: (2026) -
EmbodiedSAM: Online Segment Any 3D Thing in Real Time
by: Xu, Xiuwei, et al.
Published: (2024) -
SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
by: Yin, Hang, et al.
Published: (2024) -
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
by: Guo, Wenxuan, et al.
Published: (2025)