Saved in:
Bibliographic Details
Main Authors: Walczak, Mikolaj, Aalishah, Romina, Mackey, Wyatt, Story, Brittany, Boothe Jr., David L., Waytowich, Nicholas, Lin, Xiaomin, Mohsenin, Tinoosh
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2506.03046
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913872482402304
author Walczak, Mikolaj
Aalishah, Romina
Mackey, Wyatt
Story, Brittany
Boothe Jr., David L.
Waytowich, Nicholas
Lin, Xiaomin
Mohsenin, Tinoosh
author_facet Walczak, Mikolaj
Aalishah, Romina
Mackey, Wyatt
Story, Brittany
Boothe Jr., David L.
Waytowich, Nicholas
Lin, Xiaomin
Mohsenin, Tinoosh
contents Deep reinforcement learning agents are often fragile while humans remain adaptive and flexible to varying scenarios. To bridge this gap, we present EDEN, a biologically inspired navigation framework that integrates learned entorhinal-like grid cell representations and reinforcement learning to enable autonomous navigation. Inspired by the mammalian entorhinal-hippocampal system, EDEN allows agents to perform path integration and vector-based navigation using visual and motion sensor data. At the core of EDEN is a grid cell encoder that transforms egocentric motion into periodic spatial codes, producing low-dimensional, interpretable embeddings of position. To generate these activations from raw sensory input, we combine fiducial marker detections in the lightweight MiniWorld simulator and DINO-based visual features in the high-fidelity Gazebo simulator. These spatial representations serve as input to a policy trained with Proximal Policy Optimization (PPO), enabling dynamic, goal-directed navigation. We evaluate EDEN in both MiniWorld, for rapid prototyping, and Gazebo, which offers realistic physics and perception noise. Compared to baseline agents using raw state inputs (e.g., position, velocity) or standard convolutional image encoders, EDEN achieves a 99% success rate, within the simple scenarios, and >94% within complex floorplans with occluded paths with more efficient and reliable step-wise navigation. In addition, as a replacement of ground truth activations, we present a trainable Grid Cell encoder enabling the development of periodic grid-like patterns from vision and motion sensor data, emulating the development of such patterns within biological mammals. This work represents a step toward biologically grounded spatial intelligence in robotics, bridging neural navigation principles with reinforcement learning for scalable deployment.
format Preprint
id arxiv_https___arxiv_org_abs_2506_03046
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment
Walczak, Mikolaj
Aalishah, Romina
Mackey, Wyatt
Story, Brittany
Boothe Jr., David L.
Waytowich, Nicholas
Lin, Xiaomin
Mohsenin, Tinoosh
Robotics
Artificial Intelligence
Deep reinforcement learning agents are often fragile while humans remain adaptive and flexible to varying scenarios. To bridge this gap, we present EDEN, a biologically inspired navigation framework that integrates learned entorhinal-like grid cell representations and reinforcement learning to enable autonomous navigation. Inspired by the mammalian entorhinal-hippocampal system, EDEN allows agents to perform path integration and vector-based navigation using visual and motion sensor data. At the core of EDEN is a grid cell encoder that transforms egocentric motion into periodic spatial codes, producing low-dimensional, interpretable embeddings of position. To generate these activations from raw sensory input, we combine fiducial marker detections in the lightweight MiniWorld simulator and DINO-based visual features in the high-fidelity Gazebo simulator. These spatial representations serve as input to a policy trained with Proximal Policy Optimization (PPO), enabling dynamic, goal-directed navigation. We evaluate EDEN in both MiniWorld, for rapid prototyping, and Gazebo, which offers realistic physics and perception noise. Compared to baseline agents using raw state inputs (e.g., position, velocity) or standard convolutional image encoders, EDEN achieves a 99% success rate, within the simple scenarios, and >94% within complex floorplans with occluded paths with more efficient and reliable step-wise navigation. In addition, as a replacement of ground truth activations, we present a trainable Grid Cell encoder enabling the development of periodic grid-like patterns from vision and motion sensor data, emulating the development of such patterns within biological mammals. This work represents a step toward biologically grounded spatial intelligence in robotics, bridging neural navigation principles with reinforcement learning for scalable deployment.
title EDEN: Entorhinal Driven Egocentric Navigation Toward Robotic Deployment
topic Robotics
Artificial Intelligence
url https://arxiv.org/abs/2506.03046