Saved in:
| Main Authors: | Kang, Zhihan, Wang, Boyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.09459 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
O$^3$Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation
by: Tian, Tongxuan, et al.
Published: (2025)
by: Tian, Tongxuan, et al.
Published: (2025)
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
by: Wang, Jiasen, et al.
Published: (2024)
by: Wang, Jiasen, et al.
Published: (2024)
Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots
by: Ravipati, Siva Krishna, et al.
Published: (2024)
by: Ravipati, Siva Krishna, et al.
Published: (2024)
DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects
by: Liang, Tingbang, et al.
Published: (2025)
by: Liang, Tingbang, et al.
Published: (2025)
SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations
by: Ranganatha, Narayanan Elavathur, et al.
Published: (2024)
by: Ranganatha, Narayanan Elavathur, et al.
Published: (2024)
CompassAD: Intent-Driven 3D Affordance Grounding in Functionally Competing Objects
by: Li, Jingliang, et al.
Published: (2026)
by: Li, Jingliang, et al.
Published: (2026)
3D-CDRGP: Towards Cross-Device Robotic Grasping Policy in 3D Open World
by: Zhao, Weiguang, et al.
Published: (2024)
by: Zhao, Weiguang, et al.
Published: (2024)
Towards Long-Range 3D Object Detection for Autonomous Vehicles
by: Khoche, Ajinkya, et al.
Published: (2023)
by: Khoche, Ajinkya, et al.
Published: (2023)
Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking
by: Ishaq, Ayesha, et al.
Published: (2024)
by: Ishaq, Ayesha, et al.
Published: (2024)
Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection
by: Chen, Zhili, et al.
Published: (2024)
by: Chen, Zhili, et al.
Published: (2024)
OCRA: Object-Centric Learning with 3D and Tactile Priors for Human-to-Robot Action Transfer
by: Wang, Kuanning, et al.
Published: (2026)
by: Wang, Kuanning, et al.
Published: (2026)
TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation
by: Kim, Jeongyun, et al.
Published: (2025)
by: Kim, Jeongyun, et al.
Published: (2025)
PIRATR: Parametric Object Inference for Robotic Applications with Transformers in 3D Point Clouds
by: Schwingshackl, Michael, et al.
Published: (2026)
by: Schwingshackl, Michael, et al.
Published: (2026)
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
by: Mei, Haiyang, et al.
Published: (2025)
by: Mei, Haiyang, et al.
Published: (2025)
Clutt3R-Seg: Sparse-view 3D Instance Segmentation for Language-grounded Grasping in Cluttered Scenes
by: Noh, Jeongho, et al.
Published: (2026)
by: Noh, Jeongho, et al.
Published: (2026)
CoIn3D: Revisiting Configuration-Invariant Multi-Camera 3D Object Detection
by: Kuang, Zhaonian, et al.
Published: (2026)
by: Kuang, Zhaonian, et al.
Published: (2026)
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation
by: Qian, Shengyi, et al.
Published: (2024)
by: Qian, Shengyi, et al.
Published: (2024)
SHOW3D: Capturing Scenes of 3D Hands and Objects in the Wild
by: Rim, Patrick, et al.
Published: (2026)
by: Rim, Patrick, et al.
Published: (2026)
DIO: Dataset of 3D Mesh Models of Indoor Objects for Robotics and Computer Vision Applications
by: Nimal, Nillan, et al.
Published: (2024)
by: Nimal, Nillan, et al.
Published: (2024)
3D Object Visibility Prediction in Autonomous Driving
by: Luo, Chuanyu, et al.
Published: (2024)
by: Luo, Chuanyu, et al.
Published: (2024)
NEDS-SLAM: A Neural Explicit Dense Semantic SLAM Framework using 3D Gaussian Splatting
by: Ji, Yiming, et al.
Published: (2024)
by: Ji, Yiming, et al.
Published: (2024)
Perspective-Invariant 3D Object Detection
by: Liang, Ao, et al.
Published: (2025)
by: Liang, Ao, et al.
Published: (2025)
Robot See Robot Do: Imitating Articulated Object Manipulation with Monocular 4D Reconstruction
by: Kerr, Justin, et al.
Published: (2024)
by: Kerr, Justin, et al.
Published: (2024)
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
by: Garcia, Ricardo, et al.
Published: (2024)
by: Garcia, Ricardo, et al.
Published: (2024)
Challenges for Monocular 6D Object Pose Estimation in Robotics
by: Thalhammer, Stefan, et al.
Published: (2023)
by: Thalhammer, Stefan, et al.
Published: (2023)
ViTA-Seg: Vision Transformer for Amodal Segmentation in Robotics
by: Caramia, Donato, et al.
Published: (2025)
by: Caramia, Donato, et al.
Published: (2025)
Depth-aware Fusion Method based on Image and 4D Radar Spectrum for 3D Object Detection
by: Sun, Yue, et al.
Published: (2025)
by: Sun, Yue, et al.
Published: (2025)
RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation
by: Li, Li, et al.
Published: (2024)
by: Li, Li, et al.
Published: (2024)
SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping
by: Zhang, Mingxu, et al.
Published: (2025)
by: Zhang, Mingxu, et al.
Published: (2025)
Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
by: Ming, Zhenxing, et al.
Published: (2025)
by: Ming, Zhenxing, et al.
Published: (2025)
FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
by: Yang, Anqi Joyce, et al.
Published: (2026)
by: Yang, Anqi Joyce, et al.
Published: (2026)
3D Feature Distillation with Object-Centric Priors
by: Tziafas, Georgios, et al.
Published: (2024)
by: Tziafas, Georgios, et al.
Published: (2024)
Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling
by: Qiu, Xiaowen, et al.
Published: (2025)
by: Qiu, Xiaowen, et al.
Published: (2025)
A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery
by: Xu, Mengya, et al.
Published: (2024)
by: Xu, Mengya, et al.
Published: (2024)
Anyview: Generalizable Indoor 3D Object Detection with Variable Frames
by: Wu, Zhenyu, et al.
Published: (2023)
by: Wu, Zhenyu, et al.
Published: (2023)
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation
by: Kim, Young Hun, et al.
Published: (2025)
by: Kim, Young Hun, et al.
Published: (2025)
TrackDeform3D: Markerless and Autonomous 3D Keypoint Tracking and Dataset Collection for Deformable Objects
by: Zong, Yeheng, et al.
Published: (2026)
by: Zong, Yeheng, et al.
Published: (2026)
Towards Accurate State Estimation: Kalman Filter Incorporating Motion Dynamics for 3D Multi-Object Tracking
by: Nagy, Mohamed, et al.
Published: (2025)
by: Nagy, Mohamed, et al.
Published: (2025)
6D Object Pose Tracking in Internet Videos for Robotic Manipulation
by: Ponimatkin, Georgy, et al.
Published: (2025)
by: Ponimatkin, Georgy, et al.
Published: (2025)
Rethink 3D Object Detection from Physical World
by: Tanaka, Satoshi, et al.
Published: (2025)
by: Tanaka, Satoshi, et al.
Published: (2025)
Similar Items
-
O$^3$Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation
by: Tian, Tongxuan, et al.
Published: (2025) -
DVPE: Divided View Position Embedding for Multi-View 3D Object Detection
by: Wang, Jiasen, et al.
Published: (2024) -
Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots
by: Ravipati, Siva Krishna, et al.
Published: (2024) -
DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects
by: Liang, Tingbang, et al.
Published: (2025) -
SemVecNet: Generalizable Vector Map Generation for Arbitrary Sensor Configurations
by: Ranganatha, Narayanan Elavathur, et al.
Published: (2024)