Saved in:
| Main Authors: | Huang, Xuying, Pan, Sicong, Bennewitz, Maren |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.07766 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Privacy-Preserving Semantic Segmentation from Ultra-Low-Resolution RGB Inputs
by: Huang, Xuying, et al.
Published: (2025)
by: Huang, Xuying, et al.
Published: (2025)
DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction
by: Pan, Sicong, et al.
Published: (2025)
by: Pan, Sicong, et al.
Published: (2025)
Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning
by: Pan, Sicong, et al.
Published: (2024)
by: Pan, Sicong, et al.
Published: (2024)
Designing Privacy-Preserving Visual Perception for Robot Navigation Based on User Privacy Preferences
by: Huang, Xuying, et al.
Published: (2026)
by: Huang, Xuying, et al.
Published: (2026)
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
by: Menon, Rohit, et al.
Published: (2025)
by: Menon, Rohit, et al.
Published: (2025)
SG-DOR: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants
by: Menon, Rohit, et al.
Published: (2026)
by: Menon, Rohit, et al.
Published: (2026)
Multi-Objective Reinforcement Learning for Adaptable Personalized Autonomous Driving
by: Surmann, Hendrik, et al.
Published: (2025)
by: Surmann, Hendrik, et al.
Published: (2025)
ObjView-Bench: Rethinking Difficulty and Deployment for Object-Centric View Planning
by: Pan, Sicong, et al.
Published: (2026)
by: Pan, Sicong, et al.
Published: (2026)
RobotSeg: A Model and Dataset for Segmenting Robots in Image and Video
by: Mei, Haiyang, et al.
Published: (2025)
by: Mei, Haiyang, et al.
Published: (2025)
CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration
by: Yao, Gongxin, et al.
Published: (2024)
by: Yao, Gongxin, et al.
Published: (2024)
Multi-Modal Camera-Based Detection of Vulnerable Road Users
by: Brown, Penelope, et al.
Published: (2025)
by: Brown, Penelope, et al.
Published: (2025)
Privacy Risks in Reinforcement Learning for Household Robots
by: Li, Miao, et al.
Published: (2023)
by: Li, Miao, et al.
Published: (2023)
Robot Manipulation in Salient Vision through Referring Image Segmentation and Geometric Constraints
by: Jiang, Chen, et al.
Published: (2024)
by: Jiang, Chen, et al.
Published: (2024)
RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation
by: Jiang, Yuming, et al.
Published: (2025)
by: Jiang, Yuming, et al.
Published: (2025)
Robot Goes Fishing: Rapid, High-Resolution Biological Hotspot Mapping in Coral Reefs with Vision-Guided Autonomous Underwater Vehicles
by: Yang, Daniel, et al.
Published: (2023)
by: Yang, Daniel, et al.
Published: (2023)
Multimodal Fusion and Vision-Language Models: A Survey for Robot Vision
by: Han, Xiaofeng, et al.
Published: (2025)
by: Han, Xiaofeng, et al.
Published: (2025)
DriveVLM-RL: Neuroscience-Inspired Reinforcement Learning with Vision-Language Models for Safe and Deployable Autonomous Driving
by: Huang, Zilin, et al.
Published: (2026)
by: Huang, Zilin, et al.
Published: (2026)
Image-based Geo-localization for Robotics: Are Black-box Vision-Language Models there yet?
by: Waheed, Sania, et al.
Published: (2025)
by: Waheed, Sania, et al.
Published: (2025)
MEM: Multi-Modal Elevation Mapping for Robotics and Learning
by: Erni, Gian, et al.
Published: (2023)
by: Erni, Gian, et al.
Published: (2023)
Low Resolution Next Best View for Robot Packing
by: Preziosa, Giuseppe Fabio, et al.
Published: (2025)
by: Preziosa, Giuseppe Fabio, et al.
Published: (2025)
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots
by: Ding, Pengxiang, et al.
Published: (2023)
by: Ding, Pengxiang, et al.
Published: (2023)
RoboGround: Robotic Manipulation with Grounded Vision-Language Priors
by: Huang, Haifeng, et al.
Published: (2025)
by: Huang, Haifeng, et al.
Published: (2025)
Bridging Text and Vision: A Multi-View Text-Vision Registration Approach for Cross-Modal Place Recognition
by: Shang, Tianyi, et al.
Published: (2025)
by: Shang, Tianyi, et al.
Published: (2025)
Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities
by: Zhou, Yiyun, et al.
Published: (2025)
by: Zhou, Yiyun, et al.
Published: (2025)
World Model for Robot Learning: A Comprehensive Survey
by: Hou, Bohan, et al.
Published: (2026)
by: Hou, Bohan, et al.
Published: (2026)
MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots
by: Huang, Ting, et al.
Published: (2025)
by: Huang, Ting, et al.
Published: (2025)
OnSiteVRU: A High-Resolution Trajectory Dataset for High-Density Vulnerable Road Users
by: Yan, Zhangcun, et al.
Published: (2025)
by: Yan, Zhangcun, et al.
Published: (2025)
RobotPan: A 360$^\circ$ Surround-View Robotic Vision System for Embodied Perception
by: Ma, Jiahao, et al.
Published: (2026)
by: Ma, Jiahao, et al.
Published: (2026)
A Benchmarking Study of Vision-based Robotic Grasping Algorithms
by: Rameshbabu, Bharath K, et al.
Published: (2025)
by: Rameshbabu, Bharath K, et al.
Published: (2025)
Towards an Accurate and Effective Robot Vision (The Problem of Topological Localization for Mobile Robots)
by: Boros, Emanuela
Published: (2025)
by: Boros, Emanuela
Published: (2025)
AsyMoE: Leveraging Modal Asymmetry for Enhanced Expert Specialization in Large Vision-Language Models
by: Zhang, Heng, et al.
Published: (2025)
by: Zhang, Heng, et al.
Published: (2025)
ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
by: Song, Wenxuan, et al.
Published: (2025)
by: Song, Wenxuan, et al.
Published: (2025)
SpikePingpong: Spike Vision-based Fast-Slow Pingpong Robot System
by: Wang, Hao, et al.
Published: (2025)
by: Wang, Hao, et al.
Published: (2025)
Vision Language Action Models in Robotic Manipulation: A Systematic Review
by: Din, Muhayy Ud, et al.
Published: (2025)
by: Din, Muhayy Ud, et al.
Published: (2025)
MemoryVLA: Perceptual-Cognitive Memory in Vision-Language-Action Models for Robotic Manipulation
by: Shi, Hao, et al.
Published: (2025)
by: Shi, Hao, et al.
Published: (2025)
GuideNav: User-Informed Development of a Vision-Only Robotic Navigation Assistant For Blind Travelers
by: Hwang, Hochul, et al.
Published: (2025)
by: Hwang, Hochul, et al.
Published: (2025)
Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation
by: Azghadi, Seyed Alireza Rahimi, et al.
Published: (2025)
by: Azghadi, Seyed Alireza Rahimi, et al.
Published: (2025)
Increasing the Efficiency of DETR for Maritime High-Resolution Images
by: Yehuala, Tinsae, et al.
Published: (2026)
by: Yehuala, Tinsae, et al.
Published: (2026)
AnyUser: Translating Sketched User Intent into Domestic Robots
by: Yang, Songyuan, et al.
Published: (2026)
by: Yang, Songyuan, et al.
Published: (2026)
Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation
by: Li, Yuyang, et al.
Published: (2025)
by: Li, Yuyang, et al.
Published: (2025)
Similar Items
-
Privacy-Preserving Semantic Segmentation from Ultra-Low-Resolution RGB Inputs
by: Huang, Xuying, et al.
Published: (2025) -
DM-OSVP++: One-Shot View Planning Using 3D Diffusion Models for Active RGB-Based Object Reconstruction
by: Pan, Sicong, et al.
Published: (2025) -
Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning
by: Pan, Sicong, et al.
Published: (2024) -
Designing Privacy-Preserving Visual Perception for Robot Navigation Based on User Privacy Preferences
by: Huang, Xuying, et al.
Published: (2026) -
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images
by: Menon, Rohit, et al.
Published: (2025)