Saved in:
| Main Authors: | Ku, Chahyon, Winge, Carl, Diaz, Ryan, Yuan, Wentao, Desingh, Karthik |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.09943 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks
by: Diaz, Ryan, et al.
Published: (2024)
by: Diaz, Ryan, et al.
Published: (2024)
Talk Through It: End User Directed Manipulation Learning
by: Winge, Carl, et al.
Published: (2024)
by: Winge, Carl, et al.
Published: (2024)
SLAM Adversarial Lab: An Extensible Framework for Visual SLAM Robustness Evaluation under Adverse Conditions
by: Hefny, Mohamed, et al.
Published: (2026)
by: Hefny, Mohamed, et al.
Published: (2026)
STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
by: Ren, Hao, et al.
Published: (2026)
by: Ren, Hao, et al.
Published: (2026)
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
by: Qi, Yu, et al.
Published: (2025)
by: Qi, Yu, et al.
Published: (2025)
Semantic Object-level Modeling for Robust Visual Camera Relocalization
by: Zhu, Yifan, et al.
Published: (2024)
by: Zhu, Yifan, et al.
Published: (2024)
OTPL-VIO: Robust Visual-Inertial Odometry with Optimal Transport Line Association and Adaptive Uncertainty
by: Chen, Zikun, et al.
Published: (2026)
by: Chen, Zikun, et al.
Published: (2026)
Component Selection for Craft Assembly Tasks
by: Isume, Vitor Hideyo, et al.
Published: (2024)
by: Isume, Vitor Hideyo, et al.
Published: (2024)
VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks
by: Wang, Yutong, et al.
Published: (2024)
by: Wang, Yutong, et al.
Published: (2024)
Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks
by: Eisner, Ben, et al.
Published: (2024)
by: Eisner, Ben, et al.
Published: (2024)
Learning Visual Information Utility with PIXER
by: Turkar, Yash, et al.
Published: (2024)
by: Turkar, Yash, et al.
Published: (2024)
VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
by: Ramtoula, Benjamin, et al.
Published: (2024)
by: Ramtoula, Benjamin, et al.
Published: (2024)
Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network
by: Xing, Hao, et al.
Published: (2024)
by: Xing, Hao, et al.
Published: (2024)
What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models
by: Deng, Tianchen, et al.
Published: (2025)
by: Deng, Tianchen, et al.
Published: (2025)
SNOW: Spatio-Temporal Scene Understanding with World Knowledge for Open-World Embodied Reasoning
by: Sohn, Tin Stribor, et al.
Published: (2025)
by: Sohn, Tin Stribor, et al.
Published: (2025)
SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
by: Chen, Posheng, et al.
Published: (2026)
by: Chen, Posheng, et al.
Published: (2026)
A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding
by: Liu, Zhenyang, et al.
Published: (2025)
by: Liu, Zhenyang, et al.
Published: (2025)
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition
by: Lu, Feng, et al.
Published: (2024)
by: Lu, Feng, et al.
Published: (2024)
Attentive Feature Aggregation or: How Policies Learn to Stop Worrying about Robustness and Attend to Task-Relevant Visual Cues
by: Tsagkas, Nikolaos, et al.
Published: (2025)
by: Tsagkas, Nikolaos, et al.
Published: (2025)
CloSE: A Geometric Shape-Agnostic Cloth State Representation
by: Kamat, Jay, et al.
Published: (2025)
by: Kamat, Jay, et al.
Published: (2025)
Choreographing a World of Dynamic Objects
by: Lyu, Yanzhe, et al.
Published: (2026)
by: Lyu, Yanzhe, et al.
Published: (2026)
R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
by: Sohn, Tin Stribor, et al.
Published: (2025)
by: Sohn, Tin Stribor, et al.
Published: (2025)
Disentangled Object-Centric Image Representation for Robotic Manipulation
by: Emukpere, David, et al.
Published: (2025)
by: Emukpere, David, et al.
Published: (2025)
Uncertainty Quantification for Visual Object Pose Estimation
by: Shaikewitz, Lorenzo, et al.
Published: (2025)
by: Shaikewitz, Lorenzo, et al.
Published: (2025)
S.T.A.R.-Track: Latent Motion Models for End-to-End 3D Object Tracking with Adaptive Spatio-Temporal Appearance Representations
by: Doll, Simon, et al.
Published: (2023)
by: Doll, Simon, et al.
Published: (2023)
Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks
by: Chen, Yongtao, et al.
Published: (2025)
by: Chen, Yongtao, et al.
Published: (2025)
GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity
by: Ikeda, Takuya, et al.
Published: (2025)
by: Ikeda, Takuya, et al.
Published: (2025)
Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers
by: Chen, Yutian, et al.
Published: (2025)
by: Chen, Yutian, et al.
Published: (2025)
A Synthetic Data Pipeline for Supporting Manufacturing SMEs in Visual Assembly Control
by: Werheid, Jonas, et al.
Published: (2025)
by: Werheid, Jonas, et al.
Published: (2025)
Global Truncated Loss Minimization for Robust and Threshold-Resilient Geometric Estimation
by: Huang, Tianyu, et al.
Published: (2026)
by: Huang, Tianyu, et al.
Published: (2026)
Latent Representations for Visual Proprioception in Inexpensive Robots
by: Sheikholeslami, Sahara, et al.
Published: (2025)
by: Sheikholeslami, Sahara, et al.
Published: (2025)
RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation
by: Patel, Naman, et al.
Published: (2025)
by: Patel, Naman, et al.
Published: (2025)
Realtime Robust Shape Estimation of Deformable Linear Object
by: Zhang, Jiaming, et al.
Published: (2024)
by: Zhang, Jiaming, et al.
Published: (2024)
Robust Fusion of Object-Level V2X for Learned 3D Object Detection
by: Ostendorf, Lukas, et al.
Published: (2026)
by: Ostendorf, Lukas, et al.
Published: (2026)
Using Visual Anomaly Detection for Task Execution Monitoring
by: Thoduka, Santosh, et al.
Published: (2021)
by: Thoduka, Santosh, et al.
Published: (2021)
BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
by: Bhat, Vineet, et al.
Published: (2025)
by: Bhat, Vineet, et al.
Published: (2025)
TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning
by: Feng, ZhiYuan, et al.
Published: (2026)
by: Feng, ZhiYuan, et al.
Published: (2026)
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
by: Hao, Jinkun, et al.
Published: (2025)
by: Hao, Jinkun, et al.
Published: (2025)
Robust Surgical Tool Tracking with Pixel-based Probabilities for Projected Geometric Primitives
by: D'Ambrosia, Christopher, et al.
Published: (2024)
by: D'Ambrosia, Christopher, et al.
Published: (2024)
OW-Rep: Open World Object Detection with Instance Representation Learning
by: Lee, Sunoh, et al.
Published: (2024)
by: Lee, Sunoh, et al.
Published: (2024)
Similar Items
-
AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks
by: Diaz, Ryan, et al.
Published: (2024) -
Talk Through It: End User Directed Manipulation Learning
by: Winge, Carl, et al.
Published: (2024) -
SLAM Adversarial Lab: An Extensible Framework for Visual SLAM Robustness Evaluation under Adverse Conditions
by: Hefny, Mohamed, et al.
Published: (2026) -
STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
by: Ren, Hao, et al.
Published: (2026) -
Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
by: Qi, Yu, et al.
Published: (2025)