Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Daubt, Guihlerme, Redder, Adrian
Format:	Preprint
Published:	2026
Subjects:	Systems and Control Machine Learning
Online Access:	https://arxiv.org/abs/2603.24241
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915890797215744
author	Daubt, Guihlerme Redder, Adrian
author_facet	Daubt, Guihlerme Redder, Adrian
contents	Safe navigation in complex environments remains a central challenge for reinforcement learning (RL) in robotics. This paper introduces Continuous Space-Time Empowerment for Physics-informed (C-STEP) safe RL, a novel measure of agent-centric safety tailored to deterministic, continuous domains. This measure can be used to design physics-informed intrinsic rewards by augmenting positive navigation reward functions. The reward incorporates the agents internal states (e.g., initial velocity) and forward dynamics to differentiate safe from risky behavior. By integrating C-STEP with navigation rewards, we obtain an intrinsic reward function that jointly optimizes task completion and collision avoidance. Numerical results demonstrate fewer collisions, reduced proximity to obstacles, and only marginal increases in travel time. Overall, C-STEP offers an interpretable, physics-informed approach to reward shaping in RL, contributing to safety for agentic mobile robotic systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_24241
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents Daubt, Guihlerme Redder, Adrian Systems and Control Machine Learning Safe navigation in complex environments remains a central challenge for reinforcement learning (RL) in robotics. This paper introduces Continuous Space-Time Empowerment for Physics-informed (C-STEP) safe RL, a novel measure of agent-centric safety tailored to deterministic, continuous domains. This measure can be used to design physics-informed intrinsic rewards by augmenting positive navigation reward functions. The reward incorporates the agents internal states (e.g., initial velocity) and forward dynamics to differentiate safe from risky behavior. By integrating C-STEP with navigation rewards, we obtain an intrinsic reward function that jointly optimizes task completion and collision avoidance. Numerical results demonstrate fewer collisions, reduced proximity to obstacles, and only marginal increases in travel time. Overall, C-STEP offers an interpretable, physics-informed approach to reward shaping in RL, contributing to safety for agentic mobile robotic systems.
title	C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents
topic	Systems and Control Machine Learning
url	https://arxiv.org/abs/2603.24241

Similar Items