Saved in:
| Main Authors: | Kapelyukh, Ivan, Ren, Yifei, Alzugaray, Ignacio, Johns, Edward |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2312.04533 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)
by: Ren, Yifei, et al.
Published: (2025)
by: Ren, Yifei, et al.
Published: (2025)
DegustaBot: Zero-Shot Visual Preference Estimation for Personalized Multi-Object Rearrangement
by: Newman, Benjamin A., et al.
Published: (2024)
by: Newman, Benjamin A., et al.
Published: (2024)
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation
by: Kim, Young Hun, et al.
Published: (2025)
by: Kim, Young Hun, et al.
Published: (2025)
Zero-Shot 3D Visual Grounding from Vision-Language Models
by: Li, Rong, et al.
Published: (2025)
by: Li, Rong, et al.
Published: (2025)
Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments
by: Chen, Kehan, et al.
Published: (2024)
by: Chen, Kehan, et al.
Published: (2024)
One-Shot Dual-Arm Imitation Learning
by: Wang, Yilong, et al.
Published: (2025)
by: Wang, Yilong, et al.
Published: (2025)
D$^3$Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement
by: Wang, Yixuan, et al.
Published: (2023)
by: Wang, Yixuan, et al.
Published: (2023)
PACA: Perspective-Aware Cross-Attention Representation for Zero-Shot Scene Rearrangement
by: Jin, Shutong, et al.
Published: (2024)
by: Jin, Shutong, et al.
Published: (2024)
MonoDream: Monocular Vision-Language Navigation with Panoramic Dreaming
by: Wang, Shuo, et al.
Published: (2025)
by: Wang, Shuo, et al.
Published: (2025)
DreamNav: A Trajectory-Based Imaginative Framework for Zero-Shot Vision-and-Language Navigation
by: Wang, Yunheng, et al.
Published: (2025)
by: Wang, Yunheng, et al.
Published: (2025)
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
by: Zhang, Wenyao, et al.
Published: (2025)
by: Zhang, Wenyao, et al.
Published: (2025)
Hyperion -- A fast, versatile symbolic Gaussian Belief Propagation framework for Continuous-Time SLAM
by: Hug, David, et al.
Published: (2024)
by: Hug, David, et al.
Published: (2024)
ZISVFM: Zero-Shot Object Instance Segmentation in Indoor Robotic Environments with Vision Foundation Models
by: Zhang, Ying, et al.
Published: (2025)
by: Zhang, Ying, et al.
Published: (2025)
LOC-ZSON: Language-driven Object-Centric Zero-Shot Object Retrieval and Navigation
by: Guan, Tianrui, et al.
Published: (2024)
by: Guan, Tianrui, et al.
Published: (2024)
Leveraging Unknown Objects to Construct Labeled-Unlabeled Meta-Relationships for Zero-Shot Object Navigation
by: Zheng, Yanwei, et al.
Published: (2024)
by: Zheng, Yanwei, et al.
Published: (2024)
FetchBot: Learning Generalizable Object Fetching in Cluttered Scenes via Zero-Shot Sim2Real
by: Liu, Weiheng, et al.
Published: (2025)
by: Liu, Weiheng, et al.
Published: (2025)
SORT3D: Spatial Object-centric Reasoning Toolbox for Zero-Shot 3D Grounding Using Large Language Models
by: Zantout, Nader, et al.
Published: (2025)
by: Zantout, Nader, et al.
Published: (2025)
Improving Zero-Shot ObjectNav with Generative Communication
by: Dorbala, Vishnu Sashank, et al.
Published: (2024)
by: Dorbala, Vishnu Sashank, et al.
Published: (2024)
3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement
by: Lu, Ziqi, et al.
Published: (2024)
by: Lu, Ziqi, et al.
Published: (2024)
AgentGrounder: Zero-Shot 3D Visual Pointcloud Grounding using Multimodal Language Models
by: Huynh, Cuong, et al.
Published: (2026)
by: Huynh, Cuong, et al.
Published: (2026)
ZING-3D: Zero-shot Incremental 3D Scene Graphs via Vision-Language Models
by: Saxena, Pranav, et al.
Published: (2025)
by: Saxena, Pranav, et al.
Published: (2025)
SmartWay: Enhanced Waypoint Prediction and Backtracking for Zero-Shot Vision-and-Language Navigation
by: Shi, Xiangyu, et al.
Published: (2025)
by: Shi, Xiangyu, et al.
Published: (2025)
Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement
by: Gkanatsios, Nikolaos, et al.
Published: (2023)
by: Gkanatsios, Nikolaos, et al.
Published: (2023)
Fast-SmartWay: Panoramic-Free End-to-End Zero-Shot Vision-and-Language Navigation
by: Shi, Xiangyu, et al.
Published: (2025)
by: Shi, Xiangyu, et al.
Published: (2025)
Open-Nav: Exploring Zero-Shot Vision-and-Language Navigation in Continuous Environment with Open-Source LLMs
by: Qiao, Yanyuan, et al.
Published: (2024)
by: Qiao, Yanyuan, et al.
Published: (2024)
Three-Step Nav: A Hierarchical Global-Local Planner for Zero-Shot Vision-and-Language Navigation
by: Zheng, Wanrong, et al.
Published: (2026)
by: Zheng, Wanrong, et al.
Published: (2026)
Zero-Shot Peg Insertion: Identifying Mating Holes and Estimating SE(2) Poses with Vision-Language Models
by: Yajima, Masaru, et al.
Published: (2025)
by: Yajima, Masaru, et al.
Published: (2025)
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching
by: Wen, Bowen, et al.
Published: (2025)
by: Wen, Bowen, et al.
Published: (2025)
Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow
by: Dharmarajan, Karthik, et al.
Published: (2025)
by: Dharmarajan, Karthik, et al.
Published: (2025)
Color-Pair Guided Robust Zero-Shot 6D Pose Estimation and Tracking of Cluttered Objects on Edge Devices
by: Yang, Xingjian, et al.
Published: (2025)
by: Yang, Xingjian, et al.
Published: (2025)
O$^3$Afford: One-Shot 3D Object-to-Object Affordance Grounding for Generalizable Robotic Manipulation
by: Tian, Tongxuan, et al.
Published: (2025)
by: Tian, Tongxuan, et al.
Published: (2025)
High-Speed Vision Improves Zero-Shot Semantic Understanding of Human Actions
by: Cao, Yongpeng, et al.
Published: (2026)
by: Cao, Yongpeng, et al.
Published: (2026)
LiDAR Loop Closure Detection using Semantic Graphs with Graph Attention Networks
by: Yang, Liudi, et al.
Published: (2025)
by: Yang, Liudi, et al.
Published: (2025)
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments
by: Ma, Ji, et al.
Published: (2024)
by: Ma, Ji, et al.
Published: (2024)
AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation
by: Huang, Jingzhi, et al.
Published: (2026)
by: Huang, Jingzhi, et al.
Published: (2026)
FOMO-3D: Using Vision Foundation Models for Long-Tailed 3D Object Detection
by: Yang, Anqi Joyce, et al.
Published: (2026)
by: Yang, Anqi Joyce, et al.
Published: (2026)
A Comparative Evaluation of Large Vision-Language Models for 2D Object Detection under SOTIF Conditions
by: Zhou, Ji, et al.
Published: (2026)
by: Zhou, Ji, et al.
Published: (2026)
Observer-Actor: Active Vision Imitation Learning with Sparse-View Gaussian Splatting
by: Wang, Yilong, et al.
Published: (2025)
by: Wang, Yilong, et al.
Published: (2025)
HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System
by: Lyu, Kailin, et al.
Published: (2026)
by: Lyu, Kailin, et al.
Published: (2026)
PlantTrack: Task-Driven Plant Keypoint Tracking with Zero-Shot Sim2Real Transfer
by: Marri, Samhita, et al.
Published: (2024)
by: Marri, Samhita, et al.
Published: (2024)
Similar Items
-
Learning in ImaginationLand: Omnidirectional Policies through 3D Generative Models (OP-Gen)
by: Ren, Yifei, et al.
Published: (2025) -
DegustaBot: Zero-Shot Visual Preference Estimation for Personalized Multi-Object Rearrangement
by: Newman, Benjamin A., et al.
Published: (2024) -
DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation
by: Kim, Young Hun, et al.
Published: (2025) -
Zero-Shot 3D Visual Grounding from Vision-Language Models
by: Li, Rong, et al.
Published: (2025) -
Constraint-Aware Zero-Shot Vision-Language Navigation in Continuous Environments
by: Chen, Kehan, et al.
Published: (2024)