Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Gurunathan, Tamil Selvan, Gangopadhyay, Aryya
Format:	Preprint
Published:	2026
Subjects:	Robotics Artificial Intelligence Multiagent Systems
Online Access:	https://arxiv.org/abs/2602.19400
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911462548570112
author	Gurunathan, Tamil Selvan Gangopadhyay, Aryya
author_facet	Gurunathan, Tamil Selvan Gangopadhyay, Aryya
contents	We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, θ)), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors improve autonomy and scalability for swarm and legged robotics.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_19400
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration Gurunathan, Tamil Selvan Gangopadhyay, Aryya Robotics Artificial Intelligence Multiagent Systems We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, θ)), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors improve autonomy and scalability for swarm and legged robotics.
title	Hilbert-Augmented Reinforcement Learning for Scalable Multi-Robot Coverage and Exploration
topic	Robotics Artificial Intelligence Multiagent Systems
url	https://arxiv.org/abs/2602.19400

Similar Items