:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Xuying, Pan, Sicong, Bennewitz, Maren
Format:	Preprint
Published:	2025
Subjects:	Robotics Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2505.07766
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Privacy-Preserving Semantic Segmentation from Ultra-Low-Resolution RGB Inputs
by: Huang, Xuying, et al.
Published: (2025)

DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction
by: Pan, Sicong, et al.
Published: (2025)

Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning
by: Pan, Sicong, et al.
Published: (2024)

Designing Privacy-Preserving Visual Perception for Robot Navigation Based on User Privacy Preferences
by: Huang, Xuying, et al.
Published: (2026)

EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
by: Menon, Rohit, et al.
Published: (2025)

SG-DOR: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants
by: Menon, Rohit, et al.
Published: (2026)

Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving
by: Surmann, Hendrik, et al.
Published: (2025)

ObjView-Bench: Rethinking Difficulty and Deployment for Object-Centric View Planning
by: Pan, Sicong, et al.
Published: (2026)

RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
by: Mei, Haiyang, et al.
Published: (2025)

CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration
by: Yao, Gongxin, et al.
Published: (2024)

Multi-Modal Camera-Based Detection of Vulnerable Road Users
by: Brown, Penelope, et al.
Published: (2025)

Privacy Risks in Reinforcement Learning for Household Robots
by: Li, Miao, et al.
Published: (2023)

Robot Manipulation in Salient Vision through Referring Image Segmentation and Geometric Constraints
by: Jiang, Chen, et al.
Published: (2024)

RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
by: Jiang, Yuming, et al.
Published: (2025)

Robot Goes Fishing: Rapid, High-Resolution Biological Hotspot Mapping in Coral Reefs with Vision-Guided Autonomous Underwater Vehicles
by: Yang, Daniel, et al.
Published: (2023)

Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
by: Han, Xiaofeng, et al.
Published: (2025)

DriveVLM-RL: Neuroscience-Inspired Reinforcement Learning with Vision-Language Models for Safe and Deployable Autonomous Driving
by: Huang, Zilin, et al.
Published: (2026)

Image-based Geo-localization for Robotics: Are Black-box Vision-Language Models there yet?
by: Waheed, Sania, et al.
Published: (2025)

MEM: Multi-Modal Elevation Mapping for Robotics and Learning
by: Erni, Gian, et al.
Published: (2023)

Low Resolution Next Best View for Robot Packing
by: Preziosa, Giuseppe Fabio, et al.
Published: (2025)

QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
by: Ding, Pengxiang, et al.
Published: (2023)

RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
by: Huang, Haifeng, et al.
Published: (2025)

Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
by: Shang, Tianyi, et al.
Published: (2025)

Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)

World Model for Robot Learning: A Comprehensive Survey
by: Hou, Bohan, et al.
Published: (2026)

MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots
by: Huang, Ting, et al.
Published: (2025)

OnSiteVRU: A High-Resolution Trajectory Dataset for High-Density Vulnerable Road Users
by: Yan, Zhangcun, et al.
Published: (2025)

RobotPan: A 360$^\circ$ Surround-View Robotic Vision System for Embodied Perception
by: Ma, Jiahao, et al.
Published: (2026)

A Benchmarking Study of Vision-based Robotic Grasping Algorithms
by: Rameshbabu, Bharath K, et al.
Published: (2025)

Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)
by: Boros, Emanuela
Published: (2025)

AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models
by: Zhang, Heng, et al.
Published: (2025)

ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
by: Song, Wenxuan, et al.
Published: (2025)

SpikePingpong: Spike Vision-based Fast-Slow Pingpong Robot System
by: Wang, Hao, et al.
Published: (2025)

Vision Language Action Models in Robotic Manipulation: A Systematic Review
by: Din, Muhayy Ud, et al.
Published: (2025)

MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
by: Shi, Hao, et al.
Published: (2025)

GuideNav: User-Informed Development of a Vision-Only Robotic Navigation Assistant For Blind Travelers
by: Hwang, Hochul, et al.
Published: (2025)

Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation
by: Azghadi, Seyed Alireza Rahimi, et al.
Published: (2025)

Increasing the Efficiency of DETR for Maritime High-Resolution Images
by: Yehuala, Tinsae, et al.
Published: (2026)

AnyUser: Translating Sketched User Intent into Domestic Robots
by: Yang, Songyuan, et al.
Published: (2026)

Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation
by: Li, Yuyang, et al.
Published: (2025)