Saved in:
Bibliographic Details
Main Authors: Hakenes, Simon, Glasmachers, Tobias
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.18300
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper addresses the challenge of navigation in large, visually complex environments with sparse rewards. We propose a method that uses object-oriented macro actions grounded in a topological map, allowing a simple Deep Q-Network (DQN) to learn effective navigation policies. The agent builds a map by detecting objects from RGBD input and selecting discrete macro actions that correspond to navigating to these objects. This abstraction drastically reduces the complexity of the underlying reinforcement learning problem and enables generalization to unseen environments. We evaluate our approach in a photorealistic 3D simulation and show that it significantly outperforms a random baseline under both immediate and terminal reward conditions. Our results demonstrate that topological structure and macro-level abstraction can enable sample-efficient learning even from pixel data.