:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Byrd, Grayson, Rivera, Corban, Kemp, Bethany, Booker, Meghan, Schmidt, Aurora, de Melo, Celso M, Seenivasan, Lalithkumar, Unberath, Mathias
Format:	Preprint
Published:	2025
Subjects:	Robotics Artificial Intelligence
Online Access:	https://arxiv.org/abs/2510.06357
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EmbodiedRAG: Dynamic 3D Scene Graph Retrieval for Efficient and Scalable Robot Task Planning
by: Booker, Meghan, et al.
Published: (2024)

FLEET: Formal Language-Grounded Scheduling for Heterogeneous Robot Teams
by: Rivera, Corban, et al.
Published: (2025)

ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution
by: Rivera, Corban, et al.
Published: (2024)

Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance
by: Seenivasan, Lalithkumar, et al.
Published: (2025)

Towards Robust Surgical Automation via Digital Twin Representations from Foundation Models
by: Ding, Hao, et al.
Published: (2024)

AffordTissue: Dense Affordance Prediction for Tool-Action Specific Tissue Interaction
by: Maksutova, Aiza, et al.
Published: (2026)

Investigating Robot Control Policy Learning for Autonomous X-ray-guided Spine Procedures
by: Klitzner, Florence, et al.
Published: (2025)

TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research
by: Zhang, Han, et al.
Published: (2025)

DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras
by: Shu, Hongchao, et al.
Published: (2025)

StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
by: Zhang, Han, et al.
Published: (2024)

Position: Foundation Models Need Digital Twin Representations
by: Shen, Yiqing, et al.
Published: (2025)

Online Reasoning Video Segmentation with Just-in-Time Digital Twins
by: Shen, Yiqing, et al.
Published: (2025)

Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
by: Bai, Long, et al.
Published: (2024)

Open-Source, Cost-Aware Kinematically Feasible Planning for Mobile and Surface Robotics
by: Macenski, Steve, et al.
Published: (2024)

Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models
by: Xu, Haiweng, et al.
Published: (2026)

PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI
by: Ding, Wenbin, et al.
Published: (2025)

Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance
by: Shu, Hongchao, et al.
Published: (2024)

Survey of Vision-Language-Action Models for Embodied Manipulation
by: Li, Haoran, et al.
Published: (2025)

X-DiffVLA: X-Embodied Diffusion Action Heads for Vision-Language-Action Models
by: Li, Boyu, et al.
Published: (2026)

Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control
by: Chen, William, et al.
Published: (2026)

World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems
by: Li, Runze, et al.
Published: (2026)

3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering
by: Inigo, Blanca, et al.
Published: (2025)

Logically Constrained Robotics Transformers for Enhanced Perception-Action Planning
by: Kapoor, Parv, et al.
Published: (2024)

Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
by: Zhang, Hanxin, et al.
Published: (2026)

MEM: Multi-Scale Embodied Memory for Vision Language Action Models
by: Torne, Marcel, et al.
Published: (2026)

From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?
by: Shaji, Shinas, et al.
Published: (2026)

DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI
by: Yu, En, et al.
Published: (2026)

Model Adaptation for Time Constrained Embodied Control
by: Song, Jaehyun, et al.
Published: (2024)

Jailbreaking Embodied LLMs via Action-level Manipulation
by: Huang, Xinyu, et al.
Published: (2026)

Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
by: Guan, Weifan, et al.
Published: (2025)

RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
by: Tai, Cong, et al.
Published: (2025)

HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning
by: Shou, Quanxin, et al.
Published: (2026)

Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents
by: Yang, Zhejian, et al.
Published: (2025)

Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
by: Mei, Zhiting, et al.
Published: (2024)

SAW: Toward a Surgical Action World Model via Controllable and Scalable Video Generation
by: Rapuri, Sampath, et al.
Published: (2026)

Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin
by: Killeen, Benjamin D., et al.
Published: (2024)

Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models
by: Gupta, Sthithpragya, et al.
Published: (2024)

Resilience Meets Autonomy: Governing Embodied AI in Critical Infrastructure
by: Sharma, Puneet, et al.
Published: (2026)

PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments
by: Lim, Yunn Kang, et al.
Published: (2026)

Embodied AI in Mobile Robots: Coverage Path Planning with Large Language Models
by: Kong, Xiangrui, et al.
Published: (2024)