:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Ahmad, Sarat, Hafeez, Maryam, Zaidi, Syed Ali Raza
Format:	Preprint
Published:	2026
Subjects:	Robotics Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.14921
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Benchmarking Vector, Graph and Hybrid Retrieval Augmented Generation (RAG) Pipelines for Open Radio Access Networks (ORAN)
by: Ahmad, Sarat, et al.
Published: (2025)

Generative AI on the Edge: Architecture and Performance Evaluation
by: Nezami, Zeinab, et al.
Published: (2024)

A Unified Framework for Real-Time Failure Handling in Robotics Using Vision-Language Models, Reactive Planner and Behavior Trees
by: Ahmad, Faseeh, et al.
Published: (2025)

From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems
by: Nezami, Zeinab, et al.
Published: (2025)

ROSGPT_Vision: Commanding Robots Using Only Language Models' Prompts
by: Benjdira, Bilel, et al.
Published: (2023)

Decision-Theoretic Safety Assessment of Persona-Driven Multi-Agent Systems in O-RAN
by: Nezami, Zeinab, et al.
Published: (2026)

A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
by: Zhai, Shaopeng, et al.
Published: (2025)

Safety Aware Task Planning via Large Language Models in Robotics
by: Khan, Azal Ahmad, et al.
Published: (2025)

Real-Time Human-Robot Interaction Intent Detection Using RGB-based Pose and Emotion Cues with Cross-Camera Model Generalization
by: Mohsen, Farida, et al.
Published: (2025)

Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance
by: Yang, Fengze, et al.
Published: (2025)

Adversarial Attacks on Robotic Vision Language Action Models
by: Jones, Eliot Krzysztof, et al.
Published: (2025)

ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge
by: Dai, Yuntao, et al.
Published: (2025)

SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation
by: Tan, Xinyu, et al.
Published: (2026)

Real-Time Imitation of Human Head Motions, Blinks and Emotions by Nao Robot: A Closed-Loop Approach
by: Rayati, Keyhan, et al.
Published: (2025)

RobotDesignGPT: Automated Robot Design Synthesis using Vision Language Models
by: Sontakke, Nitish, et al.
Published: (2026)

Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
by: Wang, Taowen, et al.
Published: (2024)

Vision-Language-Policy Model for Dynamic Robot Task Planning
by: Wang, Jin, et al.
Published: (2025)

Emergence of Human to Robot Transfer in Vision-Language-Action Models
by: Kareer, Simar, et al.
Published: (2025)

Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models
by: Chen, Annie S., et al.
Published: (2024)

Maestro: Orchestrating Robotics Modules with Vision-Language Models for Zero-Shot Generalist Robots
by: Shi, Junyao, et al.
Published: (2025)

Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse
by: Chen, Kan, et al.
Published: (2025)

Experiences from Benchmarking Vision-Language-Action Models for Robotic Manipulation
by: Zhang, Yihao, et al.
Published: (2025)

CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
by: Wang, Feiyang, et al.
Published: (2025)

Test-Time Adaptation for Tactile-Vision-Language Models
by: Ye, Chuyang, et al.
Published: (2026)

TTT-Parkour: Rapid Test-Time Training for Perceptive Robot Parkour
by: Zhu, Shaoting, et al.
Published: (2026)

VLMgineer: Vision Language Models as Robotic Toolsmiths
by: Gao, George Jiayuan, et al.
Published: (2025)

Real-Time Instrument Planning and Perception for Novel Measurements of Dynamic Phenomena
by: Zilberstein, Itai, et al.
Published: (2025)

Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models
by: Wu, Yanru, et al.
Published: (2026)

Build on Priors: Vision--Language--Guided Neuro-Symbolic Imitation Learning for Data-Efficient Real-World Robot Manipulation
by: Lorang, Pierrick, et al.
Published: (2026)

ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation
by: Zhao, Enyu, et al.
Published: (2025)

Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching
by: Dang, Xuzhe, et al.
Published: (2025)

Leveraging Foundation Models for Enhancing Robot Perception and Action
by: Mirjalili, Reihaneh
Published: (2025)

Vision-Language Foundation Models as Effective Robot Imitators
by: Li, Xinghang, et al.
Published: (2023)

RoboFlamingo-Plus: Fusion of Depth and RGB Perception with Vision-Language Models for Enhanced Robotic Manipulation
by: Wang, Sheng
Published: (2025)

LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation
by: Wang, Zhijie, et al.
Published: (2024)

LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation
by: Hong, Youngjin, et al.
Published: (2025)

AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models
by: Han, Yuxuan, et al.
Published: (2026)

V-VLAPS: Value-Guided Planning for Vision-Language-Action Models
by: Ren, Ke, et al.
Published: (2026)

Characterizing Vision-Language-Action Models across XPUs: Constraints and Acceleration for On-Robot Deployment
by: Zhou, Kaijun, et al.
Published: (2026)

GraspCorrect: Robotic Grasp Correction via Vision-Language Model-Guided Feedback
by: Lee, Sungjae, et al.
Published: (2025)