:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ku, Chahyon, Winge, Carl, Diaz, Ryan, Yuan, Wentao, Desingh, Karthik
Format:	Preprint
Published:	2023
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2310.09943
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AugInsert: Learning Robust Visual-Force Policies via Data Augmentation for Object Assembly Tasks
by: Diaz, Ryan, et al.
Published: (2024)

Talk Through It: End User Directed Manipulation Learning
by: Winge, Carl, et al.
Published: (2024)

SLAM Adversarial Lab: An Extensible Framework for Visual SLAM Robustness Evaluation under Adverse Conditions
by: Hefny, Mohamed, et al.
Published: (2026)

STRNet: Visual Navigation with Spatio-Temporal Representation through Dynamic Graph Aggregation
by: Ren, Hao, et al.
Published: (2026)

Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation
by: Qi, Yu, et al.
Published: (2025)

Semantic Object-level Modeling for Robust Visual Camera Relocalization
by: Zhu, Yifan, et al.
Published: (2024)

OTPL-VIO: Robust Visual-Inertial Odometry with Optimal Transport Line Association and Adaptive Uncertainty
by: Chen, Zikun, et al.
Published: (2026)

Component Selection for Craft Assembly Tasks
by: Isume, Vitor Hideyo, et al.
Published: (2024)

VOOM: Robust Visual Object Odometry and Mapping using Hierarchical Landmarks
by: Wang, Yutong, et al.
Published: (2024)

Deep SE(3)-Equivariant Geometric Reasoning for Precise Placement Tasks
by: Eisner, Ben, et al.
Published: (2024)

Learning Visual Information Utility with PIXER
by: Turkar, Yash, et al.
Published: (2024)

VDNA-PR: Using General Dataset Representations for Robust Sequential Visual Place Recognition
by: Ramtoula, Benjamin, et al.
Published: (2024)

Understanding Spatio-Temporal Relations in Human-Object Interaction using Pyramid Graph Convolutional Network
by: Xing, Hao, et al.
Published: (2024)

What Is The Best 3D Scene Representation for Robotics? From Geometric to Foundation Models
by: Deng, Tianchen, et al.
Published: (2025)

SNOW: Spatio-Temporal Scene Understanding with World Knowledge for Open-World Embodied Reasoning
by: Sohn, Tin Stribor, et al.
Published: (2025)

SceneFunRI: Reasoning the Invisible for Task-Driven Functional Object Localization
by: Chen, Posheng, et al.
Published: (2026)

A Neural Representation Framework with LLM-Driven Spatial Reasoning for Open-Vocabulary 3D Visual Grounding
by: Liu, Zhenyang, et al.
Published: (2025)

CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition
by: Lu, Feng, et al.
Published: (2024)

Attentive Feature Aggregation or: How Policies Learn to Stop Worrying about Robustness and Attend to Task-Relevant Visual Cues
by: Tsagkas, Nikolaos, et al.
Published: (2025)

CloSE: A Geometric Shape-Agnostic Cloth State Representation
by: Kamat, Jay, et al.
Published: (2025)

Choreographing a World of Dynamic Objects
by: Lyu, Yanzhe, et al.
Published: (2026)

R4: Retrieval-Augmented Reasoning for Vision-Language Models in 4D Spatio-Temporal Space
by: Sohn, Tin Stribor, et al.
Published: (2025)

Disentangled Object-Centric Image Representation for Robotic Manipulation
by: Emukpere, David, et al.
Published: (2025)

Uncertainty Quantification for Visual Object Pose Estimation
by: Shaikewitz, Lorenzo, et al.
Published: (2025)

S.T.A.R.-Track: Latent Motion Models for End-to-End 3D Object Tracking with Adaptive Spatio-Temporal Appearance Representations
by: Doll, Simon, et al.
Published: (2023)

Guided Diffusion-based Generation of Adversarial Objects for Real-World Monocular Depth Estimation Attacks
by: Chen, Yongtao, et al.
Published: (2025)

GTR: Gaussian Splatting Tracking and Reconstruction of Unknown Objects Based on Appearance and Geometric Complexity
by: Ikeda, Takuya, et al.
Published: (2025)

Co-Me: Confidence-Guided Token Merging for Visual Geometric Transformers
by: Chen, Yutian, et al.
Published: (2025)

A Synthetic Data Pipeline for Supporting Manufacturing SMEs in Visual Assembly Control
by: Werheid, Jonas, et al.
Published: (2025)

Global Truncated Loss Minimization for Robust and Threshold-Resilient Geometric Estimation
by: Huang, Tianyu, et al.
Published: (2026)

Latent Representations for Visual Proprioception in Inexpensive Robots
by: Sheikholeslami, Sahara, et al.
Published: (2025)

RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation
by: Patel, Naman, et al.
Published: (2025)

Realtime Robust Shape Estimation of Deformable Linear Object
by: Zhang, Jiaming, et al.
Published: (2024)

Robust Fusion of Object-Level V2X for Learned 3D Object Detection
by: Ostendorf, Lukas, et al.
Published: (2026)

Using Visual Anomaly Detection for Task Execution Monitoring
by: Thoduka, Santosh, et al.
Published: (2021)

BOP-ASK: Object-Interaction Reasoning for Vision-Language Models
by: Bhat, Vineet, et al.
Published: (2025)

TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning
by: Feng, ZhiYuan, et al.
Published: (2026)

MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning
by: Hao, Jinkun, et al.
Published: (2025)

Robust Surgical Tool Tracking with Pixel-based Probabilities for Projected Geometric Primitives
by: D'Ambrosia, Christopher, et al.
Published: (2024)

OW-Rep: Open World Object Detection with Instance Representation Learning
by: Lee, Sunoh, et al.
Published: (2024)