Saved in:
| Main Authors: | Ahmad, Sarat, Hafeez, Maryam, Zaidi, Syed Ali Raza |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.14921 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Benchmarking Vector, Graph and Hybrid Retrieval Augmented Generation (RAG) Pipelines for Open Radio Access Networks (ORAN)
by: Ahmad, Sarat, et al.
Published: (2025)
by: Ahmad, Sarat, et al.
Published: (2025)
Generative AI on the Edge: Architecture and Performance Evaluation
by: Nezami, Zeinab, et al.
Published: (2024)
by: Nezami, Zeinab, et al.
Published: (2024)
A Unified Framework for Real-Time Failure Handling in Robotics Using Vision-Language Models, Reactive Planner and Behavior Trees
by: Ahmad, Faseeh, et al.
Published: (2025)
by: Ahmad, Faseeh, et al.
Published: (2025)
From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems
by: Nezami, Zeinab, et al.
Published: (2025)
by: Nezami, Zeinab, et al.
Published: (2025)
ROSGPT_Vision: Commanding Robots Using Only Language Models' Prompts
by: Benjdira, Bilel, et al.
Published: (2023)
by: Benjdira, Bilel, et al.
Published: (2023)
Decision-Theoretic Safety Assessment of Persona-Driven Multi-Agent Systems in O-RAN
by: Nezami, Zeinab, et al.
Published: (2026)
by: Nezami, Zeinab, et al.
Published: (2026)
A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
by: Zhai, Shaopeng, et al.
Published: (2025)
by: Zhai, Shaopeng, et al.
Published: (2025)
Safety Aware Task Planning via Large Language Models in Robotics
by: Khan, Azal Ahmad, et al.
Published: (2025)
by: Khan, Azal Ahmad, et al.
Published: (2025)
Real-Time Human-Robot Interaction Intent Detection Using RGB-based Pose and Emotion Cues with Cross-Camera Model Generalization
by: Mohsen, Farida, et al.
Published: (2025)
by: Mohsen, Farida, et al.
Published: (2025)
Edge-Based Multimodal Sensor Data Fusion with Vision Language Models (VLMs) for Real-time Autonomous Vehicle Accident Avoidance
by: Yang, Fengze, et al.
Published: (2025)
by: Yang, Fengze, et al.
Published: (2025)
Adversarial Attacks on Robotic Vision Language Action Models
by: Jones, Eliot Krzysztof, et al.
Published: (2025)
by: Jones, Eliot Krzysztof, et al.
Published: (2025)
ActionFlow: A Pipelined Action Acceleration for Vision Language Models on Edge
by: Dai, Yuntao, et al.
Published: (2025)
by: Dai, Yuntao, et al.
Published: (2025)
SignVLA: A Gloss-Free Vision-Language-Action Framework for Real-Time Sign Language-Guided Robotic Manipulation
by: Tan, Xinyu, et al.
Published: (2026)
by: Tan, Xinyu, et al.
Published: (2026)
Real-Time Imitation of Human Head Motions, Blinks and Emotions by Nao Robot: A Closed-Loop Approach
by: Rayati, Keyhan, et al.
Published: (2025)
by: Rayati, Keyhan, et al.
Published: (2025)
RobotDesignGPT: Automated Robot Design Synthesis using Vision Language Models
by: Sontakke, Nitish, et al.
Published: (2026)
by: Sontakke, Nitish, et al.
Published: (2026)
Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
by: Wang, Taowen, et al.
Published: (2024)
by: Wang, Taowen, et al.
Published: (2024)
Vision-Language-Policy Model for Dynamic Robot Task Planning
by: Wang, Jin, et al.
Published: (2025)
by: Wang, Jin, et al.
Published: (2025)
Emergence of Human to Robot Transfer in Vision-Language-Action Models
by: Kareer, Simar, et al.
Published: (2025)
by: Kareer, Simar, et al.
Published: (2025)
Commonsense Reasoning for Legged Robot Adaptation with Vision-Language Models
by: Chen, Annie S., et al.
Published: (2024)
by: Chen, Annie S., et al.
Published: (2024)
Maestro: Orchestrating Robotics Modules with Vision-Language Models for Zero-Shot Generalist Robots
by: Shi, Junyao, et al.
Published: (2025)
by: Shi, Junyao, et al.
Published: (2025)
Task-Oriented Edge-Assisted Cross-System Design for Real-Time Human-Robot Interaction in Industrial Metaverse
by: Chen, Kan, et al.
Published: (2025)
by: Chen, Kan, et al.
Published: (2025)
Experiences from Benchmarking Vision-Language-Action Models for Robotic Manipulation
by: Zhang, Yihao, et al.
Published: (2025)
by: Zhang, Yihao, et al.
Published: (2025)
CubeRobot: Grounding Language in Rubik's Cube Manipulation via Vision-Language Model
by: Wang, Feiyang, et al.
Published: (2025)
by: Wang, Feiyang, et al.
Published: (2025)
Test-Time Adaptation for Tactile-Vision-Language Models
by: Ye, Chuyang, et al.
Published: (2026)
by: Ye, Chuyang, et al.
Published: (2026)
TTT-Parkour: Rapid Test-Time Training for Perceptive Robot Parkour
by: Zhu, Shaoting, et al.
Published: (2026)
by: Zhu, Shaoting, et al.
Published: (2026)
VLMgineer: Vision Language Models as Robotic Toolsmiths
by: Gao, George Jiayuan, et al.
Published: (2025)
by: Gao, George Jiayuan, et al.
Published: (2025)
Real-Time Instrument Planning and Perception for Novel Measurements of Dynamic Phenomena
by: Zilberstein, Itai, et al.
Published: (2025)
by: Zilberstein, Itai, et al.
Published: (2025)
Large Reward Models: Generalizable Online Robot Reward Generation with Vision-Language Models
by: Wu, Yanru, et al.
Published: (2026)
by: Wu, Yanru, et al.
Published: (2026)
Build on Priors: Vision--Language--Guided Neuro-Symbolic Imitation Learning for Data-Efficient Real-World Robot Manipulation
by: Lorang, Pierrick, et al.
Published: (2026)
by: Lorang, Pierrick, et al.
Published: (2026)
ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation
by: Zhao, Enyu, et al.
Published: (2025)
by: Zhao, Enyu, et al.
Published: (2025)
Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching
by: Dang, Xuzhe, et al.
Published: (2025)
by: Dang, Xuzhe, et al.
Published: (2025)
Leveraging Foundation Models for Enhancing Robot Perception and Action
by: Mirjalili, Reihaneh
Published: (2025)
by: Mirjalili, Reihaneh
Published: (2025)
Vision-Language Foundation Models as Effective Robot Imitators
by: Li, Xinghang, et al.
Published: (2023)
by: Li, Xinghang, et al.
Published: (2023)
RoboFlamingo-Plus: Fusion of Depth and RGB Perception with Vision-Language Models for Enhanced Robotic Manipulation
by: Wang, Sheng
Published: (2025)
by: Wang, Sheng
Published: (2025)
LADEV: A Language-Driven Testing and Evaluation Platform for Vision-Language-Action Models in Robotic Manipulation
by: Wang, Zhijie, et al.
Published: (2024)
by: Wang, Zhijie, et al.
Published: (2024)
LACY: A Vision-Language Model-based Language-Action Cycle for Self-Improving Robotic Manipulation
by: Hong, Youngjin, et al.
Published: (2025)
by: Hong, Youngjin, et al.
Published: (2025)
AppleVLM: End-to-end Autonomous Driving with Advanced Perception and Planning-Enhanced Vision-Language Models
by: Han, Yuxuan, et al.
Published: (2026)
by: Han, Yuxuan, et al.
Published: (2026)
V-VLAPS: Value-Guided Planning for Vision-Language-Action Models
by: Ren, Ke, et al.
Published: (2026)
by: Ren, Ke, et al.
Published: (2026)
Characterizing Vision-Language-Action Models across XPUs: Constraints and Acceleration for On-Robot Deployment
by: Zhou, Kaijun, et al.
Published: (2026)
by: Zhou, Kaijun, et al.
Published: (2026)
GraspCorrect: Robotic Grasp Correction via Vision-Language Model-Guided Feedback
by: Lee, Sungjae, et al.
Published: (2025)
by: Lee, Sungjae, et al.
Published: (2025)
Similar Items
-
Benchmarking Vector, Graph and Hybrid Retrieval Augmented Generation (RAG) Pipelines for Open Radio Access Networks (ORAN)
by: Ahmad, Sarat, et al.
Published: (2025) -
Generative AI on the Edge: Architecture and Performance Evaluation
by: Nezami, Zeinab, et al.
Published: (2024) -
A Unified Framework for Real-Time Failure Handling in Robotics Using Vision-Language Models, Reactive Planner and Behavior Trees
by: Ahmad, Faseeh, et al.
Published: (2025) -
From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems
by: Nezami, Zeinab, et al.
Published: (2025) -
ROSGPT_Vision: Commanding Robots Using Only Language Models' Prompts
by: Benjdira, Bilel, et al.
Published: (2023)