:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kapelyukh, Ivan, Ren, Yifei, Alzugaray, Ignacio, Johns, Edward
Format:	Preprint
Published:	2023
Subjects:	Robotics Computer Vision and Pattern Recognition Machine Learning
Online Access:	https://arxiv.org/abs/2312.04533
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)
by: Ren, Yifei, et al.
Published: (2025)

DegustaBot: Zero-Shot Visual Preference Estimation for Personalized Multi-Object Rearrangement
by: Newman, Benjamin A., et al.
Published: (2024)

DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation
by: Kim, Young Hun, et al.
Published: (2025)

Zero-Shot 3D Visual Grounding from Vision-Language Models
by: Li, Rong, et al.
Published: (2025)

Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments
by: Chen, Kehan, et al.
Published: (2024)

One-Shot Dual-Arm Imitation Learning
by: Wang, Yilong, et al.
Published: (2025)

D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
by: Wang, Yixuan, et al.
Published: (2023)

PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
by: Jin, Shutong, et al.
Published: (2024)

MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)

DreamNav: A Trajectory-Based Imaginative Framework for Zero-Shot Vision-and-Language Navigation
by: Wang, Yunheng, et al.
Published: (2025)

DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
by: Zhang, Wenyao, et al.
Published: (2025)

Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM
by: Hug, David, et al.
Published: (2024)

ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
by: Zhang, Ying, et al.
Published: (2025)

LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
by: Guan, Tianrui, et al.
Published: (2024)

Leveraging Unknown Objects to Construct Labeled-Unlabeled Meta-Relationships for Zero-Shot Object Navigation
by: Zheng, Yanwei, et al.
Published: (2024)

FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
by: Liu, Weiheng, et al.
Published: (2025)

SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
by: Zantout, Nader, et al.
Published: (2025)

Improving Zero-Shot ObjectNav with Generative Communication
by: Dorbala, Vishnu Sashank, et al.
Published: (2024)

3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement
by: Lu, Ziqi, et al.
Published: (2024)

AgentGrounder: Zero-Shot 3D Visual Pointcloud Grounding using Multimodal Language Models
by: Huynh, Cuong, et al.
Published: (2026)

ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
by: Saxena, Pranav, et al.
Published: (2025)

SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation
by: Shi, Xiangyu, et al.
Published: (2025)

Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement
by: Gkanatsios, Nikolaos, et al.
Published: (2023)

Fast-SmartWay: Panoramic-Free End-to-End Zero-Shot Vision-and-Language Navigation
by: Shi, Xiangyu, et al.
Published: (2025)

Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
by: Qiao, Yanyuan, et al.
Published: (2024)

Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation
by: Zheng, Wanrong, et al.
Published: (2026)

Zero-Shot Peg Insertion: Identifying Mating Holes and Estimating SE(2) Poses with Vision-Language Models
by: Yajima, Masaru, et al.
Published: (2025)

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
by: Wen, Bowen, et al.
Published: (2025)

Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
by: Dharmarajan, Karthik, et al.
Published: (2025)

Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices
by: Yang, Xingjian, et al.
Published: (2025)

O$^3$Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation
by: Tian, Tongxuan, et al.
Published: (2025)

High-Speed Vision Improves Zero-Shot Semantic Understanding of Human Actions
by: Cao, Yongpeng, et al.
Published: (2026)

LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks
by: Yang, Liudi, et al.
Published: (2025)

DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
by: Ma, Ji, et al.
Published: (2024)

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation
by: Huang, Jingzhi, et al.
Published: (2026)

FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
by: Yang, Anqi Joyce, et al.
Published: (2026)

A Comparative Evaluation of Large Vision-Language Models for 2D Object Detection under SOTIF Conditions
by: Zhou, Ji, et al.
Published: (2026)

Observer-Actor: Active Vision Imitation Learning with Sparse-View Gaussian Splatting
by: Wang, Yilong, et al.
Published: (2025)

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)

PlantTrack: Task-Driven Plant Keypoint Tracking with Zero-Shot Sim2Real Transfer
by: Marri, Samhita, et al.
Published: (2024)