Saved in:
| Main Authors: | Byrd, Grayson, Rivera, Corban, Kemp, Bethany, Booker, Meghan, Schmidt, Aurora, de Melo, Celso M, Seenivasan, Lalithkumar, Unberath, Mathias |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.06357 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EmbodiedRAG: Dynamic 3D Scene Graph Retrieval for Efficient and Scalable Robot Task Planning
by: Booker, Meghan, et al.
Published: (2024)
by: Booker, Meghan, et al.
Published: (2024)
FLEET: Formal Language-Grounded Scheduling for Heterogeneous Robot Teams
by: Rivera, Corban, et al.
Published: (2025)
by: Rivera, Corban, et al.
Published: (2025)
ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution
by: Rivera, Corban, et al.
Published: (2024)
by: Rivera, Corban, et al.
Published: (2024)
Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance
by: Seenivasan, Lalithkumar, et al.
Published: (2025)
by: Seenivasan, Lalithkumar, et al.
Published: (2025)
Towards Robust Surgical Automation via Digital Twin Representations from Foundation Models
by: Ding, Hao, et al.
Published: (2024)
by: Ding, Hao, et al.
Published: (2024)
AffordTissue: Dense Affordance Prediction for Tool-Action Specific Tissue Interaction
by: Maksutova, Aiza, et al.
Published: (2026)
by: Maksutova, Aiza, et al.
Published: (2026)
Investigating Robot Control Policy Learning for Autonomous X-ray-guided Spine Procedures
by: Klitzner, Florence, et al.
Published: (2025)
by: Klitzner, Florence, et al.
Published: (2025)
TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research
by: Zhang, Han, et al.
Published: (2025)
by: Zhang, Han, et al.
Published: (2025)
DualVision ArthroNav: Investigating Opportunities to Enhance Localization and Reconstruction in Image-based Arthroscopy Navigation via External Cameras
by: Shu, Hongchao, et al.
Published: (2025)
by: Shu, Hongchao, et al.
Published: (2025)
StraightTrack: Towards Mixed Reality Navigation System for Percutaneous K-wire Insertion
by: Zhang, Han, et al.
Published: (2024)
by: Zhang, Han, et al.
Published: (2024)
Position: Foundation Models Need Digital Twin Representations
by: Shen, Yiqing, et al.
Published: (2025)
by: Shen, Yiqing, et al.
Published: (2025)
Online Reasoning Video Segmentation with Just-in-Time Digital Twins
by: Shen, Yiqing, et al.
Published: (2025)
by: Shen, Yiqing, et al.
Published: (2025)
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
by: Bai, Long, et al.
Published: (2024)
by: Bai, Long, et al.
Published: (2024)
Open-Source, Cost-Aware Kinematically Feasible Planning for Mobile and Surface Robotics
by: Macenski, Steve, et al.
Published: (2024)
by: Macenski, Steve, et al.
Published: (2024)
Unmasking the Illusion of Embodied Reasoning in Vision-Language-Action Models
by: Xu, Haiweng, et al.
Published: (2026)
by: Xu, Haiweng, et al.
Published: (2026)
PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI
by: Ding, Wenbin, et al.
Published: (2025)
by: Ding, Wenbin, et al.
Published: (2025)
Seamless Augmented Reality Integration in Arthroscopy: A Pipeline for Articular Reconstruction and Guidance
by: Shu, Hongchao, et al.
Published: (2024)
by: Shu, Hongchao, et al.
Published: (2024)
Survey of Vision-Language-Action Models for Embodied Manipulation
by: Li, Haoran, et al.
Published: (2025)
by: Li, Haoran, et al.
Published: (2025)
X-DiffVLA: X-Embodied Diffusion Action Heads for Vision-Language-Action Models
by: Li, Boyu, et al.
Published: (2026)
by: Li, Boyu, et al.
Published: (2026)
Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control
by: Chen, William, et al.
Published: (2026)
by: Chen, William, et al.
Published: (2026)
World-Value-Action Model: Implicit Planning for Vision-Language-Action Systems
by: Li, Runze, et al.
Published: (2026)
by: Li, Runze, et al.
Published: (2026)
3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering
by: Inigo, Blanca, et al.
Published: (2025)
by: Inigo, Blanca, et al.
Published: (2025)
Logically Constrained Robotics Transformers for Enhanced Perception-Action Planning
by: Kapoor, Parv, et al.
Published: (2024)
by: Kapoor, Parv, et al.
Published: (2024)
Embodied Interpretability: Linking Causal Understanding to Generalization in Vision-Language-Action Models
by: Zhang, Hanxin, et al.
Published: (2026)
by: Zhang, Hanxin, et al.
Published: (2026)
MEM: Multi-Scale Embodied Memory for Vision Language Action Models
by: Torne, Marcel, et al.
Published: (2026)
by: Torne, Marcel, et al.
Published: (2026)
From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?
by: Shaji, Shinas, et al.
Published: (2026)
by: Shaji, Shinas, et al.
Published: (2026)
DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI
by: Yu, En, et al.
Published: (2026)
by: Yu, En, et al.
Published: (2026)
Model Adaptation for Time Constrained Embodied Control
by: Song, Jaehyun, et al.
Published: (2024)
by: Song, Jaehyun, et al.
Published: (2024)
Jailbreaking Embodied LLMs via Action-level Manipulation
by: Huang, Xinyu, et al.
Published: (2026)
by: Huang, Xinyu, et al.
Published: (2026)
Efficient Vision-Language-Action Models for Embodied Manipulation: A Systematic Survey
by: Guan, Weifan, et al.
Published: (2025)
by: Guan, Weifan, et al.
Published: (2025)
RealMirror: A Comprehensive, Open-Source Vision-Language-Action Platform for Embodied AI
by: Tai, Cong, et al.
Published: (2025)
by: Tai, Cong, et al.
Published: (2025)
HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning
by: Shou, Quanxin, et al.
Published: (2026)
by: Shou, Quanxin, et al.
Published: (2026)
Agentic Robot: A Brain-Inspired Framework for Vision-Language-Action Models in Embodied Agents
by: Yang, Zhejian, et al.
Published: (2025)
by: Yang, Zhejian, et al.
Published: (2025)
Perceive With Confidence: Statistical Safety Assurances for Navigation with Learning-Based Perception
by: Mei, Zhiting, et al.
Published: (2024)
by: Mei, Zhiting, et al.
Published: (2024)
SAW: Toward a Surgical Action World Model via Controllable and Scalable Video Generation
by: Rapuri, Sampath, et al.
Published: (2026)
by: Rapuri, Sampath, et al.
Published: (2026)
Intelligent Control of Robotic X-ray Devices using a Language-promptable Digital Twin
by: Killeen, Benjamin D., et al.
Published: (2024)
by: Killeen, Benjamin D., et al.
Published: (2024)
Action Contextualization: Adaptive Task Planning and Action Tuning using Large Language Models
by: Gupta, Sthithpragya, et al.
Published: (2024)
by: Gupta, Sthithpragya, et al.
Published: (2024)
Resilience Meets Autonomy: Governing Embodied AI in Critical Infrastructure
by: Sharma, Puneet, et al.
Published: (2026)
by: Sharma, Puneet, et al.
Published: (2026)
PRISM: : Planning and Reasoning with Intent in Simulated Embodied Environments
by: Lim, Yunn Kang, et al.
Published: (2026)
by: Lim, Yunn Kang, et al.
Published: (2026)
Embodied AI in Mobile Robots: Coverage Path Planning with Large Language Models
by: Kong, Xiangrui, et al.
Published: (2024)
by: Kong, Xiangrui, et al.
Published: (2024)
Similar Items
-
EmbodiedRAG: Dynamic 3D Scene Graph Retrieval for Efficient and Scalable Robot Task Planning
by: Booker, Meghan, et al.
Published: (2024) -
FLEET: Formal Language-Grounded Scheduling for Heterogeneous Robot Teams
by: Rivera, Corban, et al.
Published: (2025) -
ConceptAgent: LLM-Driven Precondition Grounding and Tree Search for Robust Task Planning and Execution
by: Rivera, Corban, et al.
Published: (2024) -
Beyond Rigid AI: Towards Natural Human-Machine Symbiosis for Interoperative Surgical Assistance
by: Seenivasan, Lalithkumar, et al.
Published: (2025) -
Towards Robust Surgical Automation via Digital Twin Representations from Foundation Models
by: Ding, Hao, et al.
Published: (2024)