Saved in:
| Main Authors: | Samavati, Taha, Soryani, Mohsen, Mansouri, Sina |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.00900 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Cortex 2.0: Grounding World Models in Real-World Industrial Deployment
by: Aida, Adriana, et al.
Published: (2026)
by: Aida, Adriana, et al.
Published: (2026)
Dream to Fly: Model-Based Reinforcement Learning for Vision-Based Drone Flight
by: Romero, Angel, et al.
Published: (2025)
by: Romero, Angel, et al.
Published: (2025)
WayFASTER: a Self-Supervised Traversability Prediction for Increased Navigation Awareness
by: Gasparino, Mateus Valverde, et al.
Published: (2024)
by: Gasparino, Mateus Valverde, et al.
Published: (2024)
Lifelong Learning in Vision-Language Models: Enhanced EWC with Cross-Modal Knowledge Retention
by: Durrani, Hamza Ahmed, et al.
Published: (2026)
by: Durrani, Hamza Ahmed, et al.
Published: (2026)
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
by: Arnaud, Sergio, et al.
Published: (2025)
by: Arnaud, Sergio, et al.
Published: (2025)
A Segmented Robot Grasping Perception Neural Network for Edge AI
by: Bröcheler, Casper, et al.
Published: (2025)
by: Bröcheler, Casper, et al.
Published: (2025)
CLARE: Continual Learning for Vision-Language-Action Models via Autonomous Adapter Routing and Expansion
by: Römer, Ralf, et al.
Published: (2026)
by: Römer, Ralf, et al.
Published: (2026)
ExpReS-VLA: Specializing Vision-Language-Action Models Through Experience Replay and Retrieval
by: Syed, Shahram Najam, et al.
Published: (2025)
by: Syed, Shahram Najam, et al.
Published: (2025)
Failure Prediction at Runtime for Generative Robot Policies
by: Römer, Ralf, et al.
Published: (2025)
by: Römer, Ralf, et al.
Published: (2025)
The Impact of 2D Segmentation Backbones on Point Cloud Predictions Using 4D Radar
by: Muckelroy III, William, et al.
Published: (2025)
by: Muckelroy III, William, et al.
Published: (2025)
Infrastructure-Centric World Models: Bridging Temporal Depth and Spatial Breadth for Roadside Perception
by: Meng, Siyuan, et al.
Published: (2026)
by: Meng, Siyuan, et al.
Published: (2026)
Motion Perceiver: Real-Time Occupancy Forecasting for Embedded Systems
by: Ferenczi, Bryce, et al.
Published: (2023)
by: Ferenczi, Bryce, et al.
Published: (2023)
Curb Your Attention: Causal Attention Gating for Robust Trajectory Prediction in Autonomous Driving
by: Ahmadi, Ehsan, et al.
Published: (2024)
by: Ahmadi, Ehsan, et al.
Published: (2024)
From Demonstrations to Safe Deployment: Path-Consistent Safety Filtering for Diffusion Policies
by: Römer, Ralf, et al.
Published: (2025)
by: Römer, Ralf, et al.
Published: (2025)
Bayesian Data Augmentation and Training for Perception DNN in Autonomous Aerial Vehicles
by: Rasul, Ashik E, et al.
Published: (2024)
by: Rasul, Ashik E, et al.
Published: (2024)
Large Language Models and 3D Vision for Intelligent Robotic Perception and Autonomy
by: Mehta, Vinit, et al.
Published: (2025)
by: Mehta, Vinit, et al.
Published: (2025)
EmbodiedLGR: Integrating Lightweight Graph Representation and Retrieval for Semantic-Spatial Memory in Robotic Agents
by: Riva, Paolo, et al.
Published: (2026)
by: Riva, Paolo, et al.
Published: (2026)
Contextual Graph Representations for Task-Driven 3D Perception and Planning
by: Agia, Christopher
Published: (2026)
by: Agia, Christopher
Published: (2026)
RoboPack: Learning Tactile-Informed Dynamics Models for Dense Packing
by: Ai, Bo, et al.
Published: (2024)
by: Ai, Bo, et al.
Published: (2024)
Deployment-Time Reliability of Learned Robot Policies
by: Agia, Christopher
Published: (2026)
by: Agia, Christopher
Published: (2026)
Sparse Concept Bottleneck Models: Gumbel Tricks in Contrastive Learning
by: Semenov, Andrei, et al.
Published: (2024)
by: Semenov, Andrei, et al.
Published: (2024)
Accelerating Model-Based Reinforcement Learning with State-Space World Models
by: Krinner, Maria, et al.
Published: (2025)
by: Krinner, Maria, et al.
Published: (2025)
UAV-assisted Visual SLAM Generating Reconstructed 3D Scene Graphs in GPS-denied Environments
by: Radwan, Ahmed, et al.
Published: (2024)
by: Radwan, Ahmed, et al.
Published: (2024)
Part-Level 3D Gaussian Vehicle Generation with Joint and Hinge Axis Estimation
by: Qian, Shiyao, et al.
Published: (2026)
by: Qian, Shiyao, et al.
Published: (2026)
CLIP-Joint-Detect: End-to-End Joint Training of Object Detectors with Contrastive Vision-Language Supervision
by: Raoufi, Behnam, et al.
Published: (2025)
by: Raoufi, Behnam, et al.
Published: (2025)
Unpacking Failure Modes of Generative Policies: Runtime Monitoring of Consistency and Progress
by: Agia, Christopher, et al.
Published: (2024)
by: Agia, Christopher, et al.
Published: (2024)
EduFlow: Advancing MLLMs' Problem-Solving Proficiency through Multi-Stage, Multi-Perspective Critique
by: Zhu, Chenglin, et al.
Published: (2025)
by: Zhu, Chenglin, et al.
Published: (2025)
Industrial Robot Motion Planning with GPUs: Integration of cuRobo for Extended DOF Systems
by: Abuelsamen, Luai, et al.
Published: (2025)
by: Abuelsamen, Luai, et al.
Published: (2025)
Cooperative Perception: A Resource-Efficient Framework for Multi-Drone 3D Scene Reconstruction Using Federated Diffusion and NeRF
by: Pourmandi, Massoud
Published: (2025)
by: Pourmandi, Massoud
Published: (2025)
Towards Cognitive Collaborative Robots: Semantic-Level Integration and Explainable Control for Human-Centric Cooperation
by: Oh, Jaehong
Published: (2025)
by: Oh, Jaehong
Published: (2025)
HumanEgo: Zero-Shot Robot Learning from Minutes of Human Egocentric Videos
by: Wang, Zhi, et al.
Published: (2026)
by: Wang, Zhi, et al.
Published: (2026)
COBRA-PPM: A Causal Bayesian Reasoning Architecture Using Probabilistic Programming for Robot Manipulation Under Uncertainty
by: Cannizzaro, Ricardo, et al.
Published: (2024)
by: Cannizzaro, Ricardo, et al.
Published: (2024)
OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis
by: Chen, Junting, et al.
Published: (2025)
by: Chen, Junting, et al.
Published: (2025)
OpenMap: Instruction Grounding via Open-Vocabulary Visual-Language Mapping
by: Li, Danyang, et al.
Published: (2025)
by: Li, Danyang, et al.
Published: (2025)
Vision-based Situational Graphs Exploiting Fiducial Markers for the Integration of Semantic Entities
by: Tourani, Ali, et al.
Published: (2023)
by: Tourani, Ali, et al.
Published: (2023)
Taking Flight with Dialogue: Enabling Natural Language Control for PX4-based Drone Agent
by: Lim, Shoon Kit, et al.
Published: (2025)
by: Lim, Shoon Kit, et al.
Published: (2025)
StratXplore: Strategic Novelty-seeking and Instruction-aligned Exploration for Vision and Language Navigation
by: Gopinathan, Muraleekrishna, et al.
Published: (2024)
by: Gopinathan, Muraleekrishna, et al.
Published: (2024)
Deep Probabilistic Traversability with Test-time Adaptation for Uncertainty-aware Planetary Rover Navigation
by: Endo, Masafumi, et al.
Published: (2024)
by: Endo, Masafumi, et al.
Published: (2024)
CoMoCAVs: Cohesive Decision-Guided Motion Planning for Connected and Autonomous Vehicles with Multi-Policy Reinforcement Learning
by: Hu, Pan
Published: (2025)
by: Hu, Pan
Published: (2025)
Pointing-Guided Target Estimation via Transformer-Based Attention
by: Müller, Luca, et al.
Published: (2025)
by: Müller, Luca, et al.
Published: (2025)
Similar Items
-
Cortex 2.0: Grounding World Models in Real-World Industrial Deployment
by: Aida, Adriana, et al.
Published: (2026) -
Dream to Fly: Model-Based Reinforcement Learning for Vision-Based Drone Flight
by: Romero, Angel, et al.
Published: (2025) -
WayFASTER: a Self-Supervised Traversability Prediction for Increased Navigation Awareness
by: Gasparino, Mateus Valverde, et al.
Published: (2024) -
Lifelong Learning in Vision-Language Models: Enhanced EWC with Cross-Modal Knowledge Retention
by: Durrani, Hamza Ahmed, et al.
Published: (2026) -
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3D
by: Arnaud, Sergio, et al.
Published: (2025)