Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Wang, Ruoyu, Li, Xinshu, Wang, Chen, Yao, Lina
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2506.15377
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866913899920490496
author Wang, Ruoyu
Li, Xinshu
Wang, Chen
Yao, Lina
author_facet Wang, Ruoyu
Li, Xinshu
Wang, Chen
Yao, Lina
contents Visual Navigation is a core task in Embodied AI, enabling agents to navigate complex environments toward given objectives. Across diverse settings within Navigation tasks, many necessitate the modelling of sequential data accumulated from preceding time steps. While existing methods perform well, they typically process all historical observations simultaneously, overlooking the internal association structure within the data, which may limit the potential for further improvements in task performance. We address this by examining the unique characteristics of Navigation tasks through the lens of causality, introducing a causal framework to highlight the limitations of conventional sequential methods. Leveraging this insight, we propose Causality-Aware Navigation (CAN), which incorporates a Causal Understanding Module to enhance the agent's environmental understanding capability. Empirical evaluations show that our approach consistently outperforms baselines across various tasks and simulation environments. Extensive ablations studies attribute these gains to the Causal Understanding Module, which generalizes effectively in both Reinforcement and Supervised Learning settings without computational overhead.
format Preprint
id arxiv_https___arxiv_org_abs_2506_15377
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Efficient and Generalizable Environmental Understanding for Visual Navigation
Wang, Ruoyu
Li, Xinshu
Wang, Chen
Yao, Lina
Artificial Intelligence
Visual Navigation is a core task in Embodied AI, enabling agents to navigate complex environments toward given objectives. Across diverse settings within Navigation tasks, many necessitate the modelling of sequential data accumulated from preceding time steps. While existing methods perform well, they typically process all historical observations simultaneously, overlooking the internal association structure within the data, which may limit the potential for further improvements in task performance. We address this by examining the unique characteristics of Navigation tasks through the lens of causality, introducing a causal framework to highlight the limitations of conventional sequential methods. Leveraging this insight, we propose Causality-Aware Navigation (CAN), which incorporates a Causal Understanding Module to enhance the agent's environmental understanding capability. Empirical evaluations show that our approach consistently outperforms baselines across various tasks and simulation environments. Extensive ablations studies attribute these gains to the Causal Understanding Module, which generalizes effectively in both Reinforcement and Supervised Learning settings without computational overhead.
title Efficient and Generalizable Environmental Understanding for Visual Navigation
topic Artificial Intelligence
url https://arxiv.org/abs/2506.15377