:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Xu, Xiuwei, Ma, Angyuan, Li, Hankun, Yu, Bingyao, Zhu, Zheng, Zhou, Jie, Lu, Jiwen
Format:	Preprint
Published:	2025
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2510.08547
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ShapeGen: Robotic Data Generation for Category-Level Manipulation
by: Wang, Yirui, et al.
Published: (2026)

F2F-AP: Flow-to-Future Asynchronous Policy for Real-time Dynamic Manipulation
by: Wei, Haoyu, et al.
Published: (2026)

EmbodiedSAM: Online Segment Any 3D Thing in Real Time
by: Xu, Xiuwei, et al.
Published: (2024)

SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation
by: Yin, Hang, et al.
Published: (2024)

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
by: Guo, Wenxuan, et al.
Published: (2025)

MoTo: A Zero-shot Plug-in Interaction-aware Navigation for General Mobile Manipulation
by: Wu, Zhenyu, et al.
Published: (2025)

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation
by: Yin, Hang, et al.
Published: (2025)

GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation
by: Yin, Hang, et al.
Published: (2025)

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
by: Wu, Zhenyu, et al.
Published: (2023)

MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation
by: Wu, Zhenyu, et al.
Published: (2025)

CMP: Robust Whole-Body Tracking for Loco-Manipulation via Competence Manifold Projection
by: Cheng, Ziyang, et al.
Published: (2026)

AwareVLN: Reasoning with Self-awareness for Vision-Language Navigation
by: Guo, Wenxuan, et al.
Published: (2026)

ComSim: Building Scalable Real-World Robot Data Generation via Compositional Simulation
by: Qin, Yiran, et al.
Published: (2026)

R3DP: Real-Time 3D-Aware Policy for Embodied Manipulation
by: Zhang, Yuhao, et al.
Published: (2026)

3D Small Object Detection with Dynamic Spatial Pruning
by: Xu, Xiuwei, et al.
Published: (2023)

NeXT-IMDL: Build Benchmark for NeXT-Generation Image Manipulation Detection & Localization
by: Li, Yifei, et al.
Published: (2025)

iGaussian: Real-Time Camera Pose Estimation via Feed-Forward 3D Gaussian Splatting Inversion
by: Wang, Hao, et al.
Published: (2025)

RealD$^2$iff: Bridging Real-World Gap in Robot Manipulation via Depth Diffusion
by: Liang, Xiujian, et al.
Published: (2025)

UniGenDet: A Unified Generative-Discriminative Framework for Co-Evolutionary Image Generation and Generated Image Detection
by: Zhang, Yanran, et al.
Published: (2026)

REMAC: Self-Reflective and Self-Evolving Multi-Agent Collaboration for Long-Horizon Robot Manipulation
by: Yuan, Puzhen, et al.
Published: (2025)

Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos
by: Dou, Weijia, et al.
Published: (2026)

$\bf{D^3}$QE: Learning Discrete Distribution Discrepancy-aware Quantization Error for Autoregressive-Generated Image Detection
by: Zhang, Yanran, et al.
Published: (2025)

Vega: Learning to Drive with Natural Language Instructions
by: Zuo, Sicheng, et al.
Published: (2026)

A Real-to-Sim-to-Real Approach to Robotic Manipulation with VLM-Generated Iterative Keypoint Rewards
by: Patel, Shivansh, et al.
Published: (2025)

RC-NF: Robot-Conditioned Normalizing Flow for Real-Time Anomaly Detection in Robotic Manipulation
by: Zhou, Shijie, et al.
Published: (2026)

Embodied Instruction Following in Unknown Environments
by: Wu, Zhenyu, et al.
Published: (2024)

ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation
by: Lu, Guanxing, et al.
Published: (2024)

TSP3D: Text-guided Sparse Voxel Pruning for Efficient 3D Visual Grounding
by: Guo, Wenxuan, et al.
Published: (2025)

GPD-1: Generative Pre-training for Driving
by: Xie, Zixun, et al.
Published: (2024)

ManipArena: Comprehensive Real-world Evaluation of Reasoning-Oriented Generalist Robot Manipulation
by: Sun, Yu, et al.
Published: (2026)

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory
by: Wu, Yuqi, et al.
Published: (2025)

Real2Edit2Real: Generating Robotic Demonstrations via a 3D Control Interface
by: Zhao, Yujie, et al.
Published: (2025)

Part-Guided 3D RL for Sim2Real Articulated Object Manipulation
by: Xie, Pengwei, et al.
Published: (2024)

AsyncMDE: Real-Time Monocular Depth Estimation via Asynchronous Spatial Memory
by: Ma, Lianjie, et al.
Published: (2026)

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
by: Hao, Jinkun, et al.
Published: (2025)

Multimodal Signal Processing For Thermo-Visible-Lidar Fusion In Real-time 3D Semantic Mapping
by: Sun, Jiajun, et al.
Published: (2026)

Bench2Drive-R: Turning Real World Data into Reactive Closed-Loop Autonomous Driving Benchmark by Generative Model
by: You, Junqi, et al.
Published: (2024)

FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner
by: Zhao, Wenliang, et al.
Published: (2024)

SCENEREPLICA: Benchmarking Real-World Robot Manipulation by Creating Replicable Scenes
by: Khargonkar, Ninad, et al.
Published: (2023)

ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations
by: Li, Zheng, et al.
Published: (2025)