:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kang, Zhihan, Wang, Boyu
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Robotics
Online Access:	https://arxiv.org/abs/2507.09459
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

O$^3$Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation
by: Tian, Tongxuan, et al.
Published: (2025)

DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
by: Wang, Jiasen, et al.
Published: (2024)

Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots
by: Ravipati, Siva Krishna, et al.
Published: (2024)

DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects
by: Liang, Tingbang, et al.
Published: (2025)

SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations
by: Ranganatha, Narayanan Elavathur, et al.
Published: (2024)

CompassAD: Intent-Driven 3D Affordance Grounding in Functionally Competing Objects
by: Li, Jingliang, et al.
Published: (2026)

3D-CDRGP: Towards Cross-Device Robotic Grasping Policy in 3D Open World
by: Zhao, Weiguang, et al.
Published: (2024)

Towards Long-Range 3D Object Detection for Autonomous Vehicles
by: Khoche, Ajinkya, et al.
Published: (2023)

Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
by: Ishaq, Ayesha, et al.
Published: (2024)

Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
by: Chen, Zhili, et al.
Published: (2024)

OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer
by: Wang, Kuanning, et al.
Published: (2026)

TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation
by: Kim, Jeongyun, et al.
Published: (2025)

PIRATR: Parametric Object Inference for Robotic Applications with Transformers in 3D Point Clouds
by: Schwingshackl, Michael, et al.
Published: (2026)

RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
by: Mei, Haiyang, et al.
Published: (2025)

Clutt3R-Seg: Sparse-view 3D Instance Segmentation for Language-grounded Grasping in Cluttered Scenes
by: Noh, Jeongho, et al.
Published: (2026)

CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection
by: Kuang, Zhaonian, et al.
Published: (2026)

3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
by: Qian, Shengyi, et al.
Published: (2024)

SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
by: Rim, Patrick, et al.
Published: (2026)

DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications
by: Nimal, Nillan, et al.
Published: (2024)

3D Object Visibility Prediction in Autonomous Driving
by: Luo, Chuanyu, et al.
Published: (2024)

NEDS-SLAM: A Neural Explicit Dense Semantic SLAM Framework using 3D Gaussian Splatting
by: Ji, Yiming, et al.
Published: (2024)

Perspective-Invariant 3D Object Detection
by: Liang, Ao, et al.
Published: (2025)

Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction
by: Kerr, Justin, et al.
Published: (2024)

Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)

Challenges for Monocular 6D Object Pose Estimation in Robotics
by: Thalhammer, Stefan, et al.
Published: (2023)

ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
by: Caramia, Donato, et al.
Published: (2025)

Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection
by: Sun, Yue, et al.
Published: (2025)

RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation
by: Li, Li, et al.
Published: (2024)

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping
by: Zhang, Mingxu, et al.
Published: (2025)

Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
by: Ming, Zhenxing, et al.
Published: (2025)

FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
by: Yang, Anqi Joyce, et al.
Published: (2026)

3D Feature Distillation with Object-Centric Priors
by: Tziafas, Georgios, et al.
Published: (2024)

Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling
by: Qiu, Xiaowen, et al.
Published: (2025)

A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery
by: Xu, Mengya, et al.
Published: (2024)

Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
by: Wu, Zhenyu, et al.
Published: (2023)

DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation
by: Kim, Young Hun, et al.
Published: (2025)

TrackDeform3D: Markerless and Autonomous 3D Keypoint Tracking and Dataset Collection for Deformable Objects
by: Zong, Yeheng, et al.
Published: (2026)

Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking
by: Nagy, Mohamed, et al.
Published: (2025)

6D Object Pose Tracking in Internet Videos for Robotic Manipulation
by: Ponimatkin, Georgy, et al.
Published: (2025)

Rethink 3D Object Detection from Physical World
by: Tanaka, Satoshi, et al.
Published: (2025)