Saved in:
| Main Authors: | Gong, Xicheng, Li, Qiwei, Xu, Peiran, Mu, Yadong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.25813 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
by: Xu, Peiran, et al.
Published: (2025)
by: Xu, Peiran, et al.
Published: (2025)
RoboAgent: Chaining Basic Capabilities for Embodied Task Planning
by: Xu, Peiran, et al.
Published: (2026)
by: Xu, Peiran, et al.
Published: (2026)
RotVLA: Rotational Latent Action for Vision-Language-Action Model
by: Li, Qiwei, et al.
Published: (2026)
by: Li, Qiwei, et al.
Published: (2026)
ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation
by: Yan, Hongyu, et al.
Published: (2026)
by: Yan, Hongyu, et al.
Published: (2026)
Memory Centric Power Allocation for Multi-Agent Embodied Question Answering
by: Li, Chengyang, et al.
Published: (2026)
by: Li, Chengyang, et al.
Published: (2026)
FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy
by: Zhang, Haochen, et al.
Published: (2026)
by: Zhang, Haochen, et al.
Published: (2026)
HIMM: Human-Inspired Long-Term Memory Modeling for Embodied Exploration and Question Answering
by: Li, Ji, et al.
Published: (2026)
by: Li, Ji, et al.
Published: (2026)
Visual Environment-Interactive Planning for Embodied Complex-Question Answering
by: Lan, Ning, et al.
Published: (2025)
by: Lan, Ning, et al.
Published: (2025)
EfficientEQA: An Efficient Approach to Open-Vocabulary Embodied Question Answering
by: Cheng, Kai, et al.
Published: (2024)
by: Cheng, Kai, et al.
Published: (2024)
Is the House Ready For Sleeptime? Generating and Evaluating Situational Queries for Embodied Question Answering
by: Dorbala, Vishnu Sashank, et al.
Published: (2024)
by: Dorbala, Vishnu Sashank, et al.
Published: (2024)
Map-based Modular Approach for Zero-shot Embodied Question Answering
by: Sakamoto, Koya, et al.
Published: (2024)
by: Sakamoto, Koya, et al.
Published: (2024)
Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering
by: Ginting, Muhammad Fadhil, et al.
Published: (2025)
by: Ginting, Muhammad Fadhil, et al.
Published: (2025)
Explore until Confident: Efficient Exploration for Embodied Question Answering
by: Ren, Allen Z., et al.
Published: (2024)
by: Ren, Allen Z., et al.
Published: (2024)
ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering
by: Wang, Haisheng, et al.
Published: (2025)
by: Wang, Haisheng, et al.
Published: (2025)
Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images
by: Yan, Hongyu, et al.
Published: (2024)
by: Yan, Hongyu, et al.
Published: (2024)
NoisyEQA: Benchmarking Embodied Question Answering Against Noisy Queries
by: Wu, Tao, et al.
Published: (2024)
by: Wu, Tao, et al.
Published: (2024)
Closed-Loop Long-Horizon Robotic Planning via Equilibrium Sequence Modeling
by: Li, Jinghan, et al.
Published: (2024)
by: Li, Jinghan, et al.
Published: (2024)
RobotPan: A 360$^\circ$ Surround-View Robotic Vision System for Embodied Perception
by: Ma, Jiahao, et al.
Published: (2026)
by: Ma, Jiahao, et al.
Published: (2026)
LLM-Driven Self-Refinement for Embodied Drone Task Planning
by: Zhang, Deyu, et al.
Published: (2025)
by: Zhang, Deyu, et al.
Published: (2025)
Weakly-Supervised Affordance Grounding Guided by Part-Level Semantic Priors
by: Xu, Peiran, et al.
Published: (2025)
by: Xu, Peiran, et al.
Published: (2025)
Prune-Then-Plan: Step-Level Calibration for Stable Frontier Exploration in Embodied Question Answering
by: Frahm, Noah, et al.
Published: (2025)
by: Frahm, Noah, et al.
Published: (2025)
Large Model Empowered Embodied AI: A Survey on Decision-Making and Embodied Learning
by: Liang, Wenlong, et al.
Published: (2025)
by: Liang, Wenlong, et al.
Published: (2025)
Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection
by: Ma, Kangqi, et al.
Published: (2024)
by: Ma, Kangqi, et al.
Published: (2024)
HyCodePolicy: Hybrid Language Controllers for Multimodal Monitoring and Decision in Embodied Agents
by: Liu, Yibin, et al.
Published: (2025)
by: Liu, Yibin, et al.
Published: (2025)
Embodied Tactile Perception of Soft Objects Properties
by: Dutta, Anirvan, et al.
Published: (2025)
by: Dutta, Anirvan, et al.
Published: (2025)
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)
by: Li, Manling, et al.
Published: (2024)
When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
by: Wu, Tao, et al.
Published: (2025)
by: Wu, Tao, et al.
Published: (2025)
Safety of Embodied Navigation: A Survey
by: Wang, Zixia, et al.
Published: (2025)
by: Wang, Zixia, et al.
Published: (2025)
EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making
by: Lei, Zixing, et al.
Published: (2025)
by: Lei, Zixing, et al.
Published: (2025)
GraphEQA: Using 3D Semantic Scene Graphs for Real-time Embodied Question Answering
by: Saxena, Saumya, et al.
Published: (2024)
by: Saxena, Saumya, et al.
Published: (2024)
QueryCAD: Grounded Question Answering for CAD Models
by: Kienle, Claudius, et al.
Published: (2024)
by: Kienle, Claudius, et al.
Published: (2024)
MIPD: A Multi-sensory Interactive Perception Dataset for Embodied Intelligent Driving
by: Li, Zhiwei, et al.
Published: (2024)
by: Li, Zhiwei, et al.
Published: (2024)
Embodied Perception for Test-time Grasping Detection Adaptation with Knowledge Infusion
by: Liu, Jin, et al.
Published: (2025)
by: Liu, Jin, et al.
Published: (2025)
EmbodiedClaw: Conversational Workflow Execution for Embodied AI Development
by: Zhou, Xueyang, et al.
Published: (2026)
by: Zhou, Xueyang, et al.
Published: (2026)
AI or Human? Understanding Perceptions of Embodied Robots with LLMs
by: Hriscu, Lavinia, et al.
Published: (2025)
by: Hriscu, Lavinia, et al.
Published: (2025)
Open-Ended Multi-Modal Relational Reasoning for Video Question Answering
by: Luo, Haozheng, et al.
Published: (2020)
by: Luo, Haozheng, et al.
Published: (2020)
Look, Zoom, Understand: The Robotic Eyeball for Embodied Perception
by: Yang, Jiashu, et al.
Published: (2025)
by: Yang, Jiashu, et al.
Published: (2025)
EmbodiedCoder: Parameterized Embodied Mobile Manipulation via Modern Coding Model
by: Lin, Zefu, et al.
Published: (2025)
by: Lin, Zefu, et al.
Published: (2025)
Fourier Controller Networks for Real-Time Decision-Making in Embodied Learning
by: Tan, Hengkai, et al.
Published: (2024)
by: Tan, Hengkai, et al.
Published: (2024)
Agentic Self-Evolutionary Replanning for Embodied Navigation
by: Li, Guoliang, et al.
Published: (2026)
by: Li, Guoliang, et al.
Published: (2026)
Similar Items
-
NavQ: Learning a Q-Model for Foresighted Vision-and-Language Navigation
by: Xu, Peiran, et al.
Published: (2025) -
RoboAgent: Chaining Basic Capabilities for Embodied Task Planning
by: Xu, Peiran, et al.
Published: (2026) -
RotVLA: Rotational Latent Action for Vision-Language-Action Model
by: Li, Qiwei, et al.
Published: (2026) -
ProgressVLA: Progress-Guided Diffusion Policy for Vision-Language Robotic Manipulation
by: Yan, Hongyu, et al.
Published: (2026) -
Memory Centric Power Allocation for Multi-Agent Embodied Question Answering
by: Li, Chengyang, et al.
Published: (2026)