Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Yüksel, Sadık Bera, Buyukkocak, Ali Tevfik, Aksaray, Derya
Formato:	Preprint
Publicado:	2026
Materias:	Robotics Machine Learning
Acceso en línea:	https://arxiv.org/abs/2603.17152
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866918395194114048
author	Yüksel, Sadık Bera Buyukkocak, Ali Tevfik Aksaray, Derya
author_facet	Yüksel, Sadık Bera Buyukkocak, Ali Tevfik Aksaray, Derya
contents	Reinforcement Learning (RL) has shown promise in various robotics applications, yet its deployment on real systems is still limited due to safety and operational constraints. The safe RL field has gained considerable attention in recent years, which focuses on imposing safety constraints throughout the learning process. However, real systems often require more complex constraints than just safety, such as periodic recharging or time-bounded visits to specific regions. Imposing such spatio-temporal tasks during learning still remains a challenge. Signal Temporal Logic (STL) is a formal language for specifying temporal properties of real-valued signals and provides a way to express such complex tasks. In this paper, we propose a framework that leverages sequential control barrier functions and model-free RL to ensure that the given STL tasks are satisfied throughout the learning process. Our method extends beyond traditional safety constraints by enforcing rich STL specifications, which can involve visits to dynamic targets with unknown trajectories. We also demonstrate the effectiveness of our framework through various simulations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_17152
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Shielded Reinforcement Learning Under Dynamic Temporal Logic Constraints Yüksel, Sadık Bera Buyukkocak, Ali Tevfik Aksaray, Derya Robotics Machine Learning Reinforcement Learning (RL) has shown promise in various robotics applications, yet its deployment on real systems is still limited due to safety and operational constraints. The safe RL field has gained considerable attention in recent years, which focuses on imposing safety constraints throughout the learning process. However, real systems often require more complex constraints than just safety, such as periodic recharging or time-bounded visits to specific regions. Imposing such spatio-temporal tasks during learning still remains a challenge. Signal Temporal Logic (STL) is a formal language for specifying temporal properties of real-valued signals and provides a way to express such complex tasks. In this paper, we propose a framework that leverages sequential control barrier functions and model-free RL to ensure that the given STL tasks are satisfied throughout the learning process. Our method extends beyond traditional safety constraints by enforcing rich STL specifications, which can involve visits to dynamic targets with unknown trajectories. We also demonstrate the effectiveness of our framework through various simulations.
title	Shielded Reinforcement Learning Under Dynamic Temporal Logic Constraints
topic	Robotics Machine Learning
url	https://arxiv.org/abs/2603.17152

Ejemplares similares