Saved in:
Bibliographic Details
Main Authors: Gurunathan, Tamil Selvan, Gangopadhyay, Aryya
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2602.19400
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911462548570112
author Gurunathan, Tamil Selvan
Gangopadhyay, Aryya
author_facet Gurunathan, Tamil Selvan
Gangopadhyay, Aryya
contents We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, θ)), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors improve autonomy and scalability for swarm and legged robotics.
format Preprint
id arxiv_https___arxiv_org_abs_2602_19400
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration
Gurunathan, Tamil Selvan
Gangopadhyay, Aryya
Robotics
Artificial Intelligence
Multiagent Systems
We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, θ)), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors improve autonomy and scalability for swarm and legged robotics.
title Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration
topic Robotics
Artificial Intelligence
Multiagent Systems
url https://arxiv.org/abs/2602.19400