Saved in:
Bibliographic Details
Main Author: Alcaraz, Benoît
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.16651
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915931079311360
author Alcaraz, Benoît
author_facet Alcaraz, Benoît
contents In the past decade, artificial intelligence (AI) has developed quickly. With this rapid progression came the need for systems capable of complying with the rules and norms of our society so that they can be successfully and safely integrated into our daily lives. Inspired by the story of Pinocchio in ``Le avventure di Pinocchio - Storia di un burattino'', this thesis proposes a pipeline that addresses the problem of developing norm compliant and context-aware agents. Building on the AJAR, Jiminy, and NGRL architectures, the work introduces \pino, a hybrid model in which reinforcement learning agents are supervised by argumentation-based normative advisors. In order to make this pipeline operational, this thesis also presents a novel algorithm for automatically extracting the arguments and relationships that underlie the advisors' decisions. Finally, this thesis investigates the phenomenon of \textit{norm avoidance}, providing a definition and a mitigation strategy within the context of reinforcement learning agents. Each component of the pipeline is empirically evaluated. The thesis concludes with a discussion of related work, current limitations, and directions for future research.
format Preprint
id arxiv_https___arxiv_org_abs_2603_16651
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline
Alcaraz, Benoît
Artificial Intelligence
Multiagent Systems
In the past decade, artificial intelligence (AI) has developed quickly. With this rapid progression came the need for systems capable of complying with the rules and norms of our society so that they can be successfully and safely integrated into our daily lives. Inspired by the story of Pinocchio in ``Le avventure di Pinocchio - Storia di un burattino'', this thesis proposes a pipeline that addresses the problem of developing norm compliant and context-aware agents. Building on the AJAR, Jiminy, and NGRL architectures, the work introduces \pino, a hybrid model in which reinforcement learning agents are supervised by argumentation-based normative advisors. In order to make this pipeline operational, this thesis also presents a novel algorithm for automatically extracting the arguments and relationships that underlie the advisors' decisions. Finally, this thesis investigates the phenomenon of \textit{norm avoidance}, providing a definition and a mitigation strategy within the context of reinforcement learning agents. Each component of the pipeline is empirically evaluated. The thesis concludes with a discussion of related work, current limitations, and directions for future research.
title What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline
topic Artificial Intelligence
Multiagent Systems
url https://arxiv.org/abs/2603.16651