Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Walters, Michael, Kaufmann, Rafael, Sefas, Justice, Kopinski, Thomas
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning Multiagent Systems Data Analysis, Statistics and Probability
Online Access:	https://arxiv.org/abs/2502.04249
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929700932157440
author	Walters, Michael Kaufmann, Rafael Sefas, Justice Kopinski, Thomas
author_facet	Walters, Michael Kaufmann, Rafael Sefas, Justice Kopinski, Thomas
contents	We investigate the Free Energy Principle as a foundation for measuring risk in agentic and multi-agent systems. From these principles we introduce a Cumulative Risk Exposure metric that is flexible to differing contexts and needs. We contrast this to other popular theories for safe AI that hinge on massive amounts of data or describing arbitrarily complex world models. In our framework, stakeholders need only specify their preferences over system outcomes, providing straightforward and transparent decision rules for risk governance and mitigation. This framework naturally accounts for uncertainty in both world model and preference model, allowing for decision-making that is epistemically and axiologically humble, parsimonious, and future-proof. We demonstrate this novel approach in a simplified autonomous vehicle environment with multi-agent vehicles whose driving policies are mediated by gatekeepers that evaluate, in an online fashion, the risk to the collective safety in their neighborhood, and intervene through each vehicle's policy when appropriate. We show that the introduction of gatekeepers in an AV fleet, even at low penetration, can generate significant positive externalities in terms of increased system safety.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_04249
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study Walters, Michael Kaufmann, Rafael Sefas, Justice Kopinski, Thomas Artificial Intelligence Machine Learning Multiagent Systems Data Analysis, Statistics and Probability We investigate the Free Energy Principle as a foundation for measuring risk in agentic and multi-agent systems. From these principles we introduce a Cumulative Risk Exposure metric that is flexible to differing contexts and needs. We contrast this to other popular theories for safe AI that hinge on massive amounts of data or describing arbitrarily complex world models. In our framework, stakeholders need only specify their preferences over system outcomes, providing straightforward and transparent decision rules for risk governance and mitigation. This framework naturally accounts for uncertainty in both world model and preference model, allowing for decision-making that is epistemically and axiologically humble, parsimonious, and future-proof. We demonstrate this novel approach in a simplified autonomous vehicle environment with multi-agent vehicles whose driving policies are mediated by gatekeepers that evaluate, in an online fashion, the risk to the collective safety in their neighborhood, and intervene through each vehicle's policy when appropriate. We show that the introduction of gatekeepers in an AV fleet, even at low penetration, can generate significant positive externalities in terms of increased system safety.
title	Free Energy Risk Metrics for Systemically Safe AI: Gatekeeping Multi-Agent Study
topic	Artificial Intelligence Machine Learning Multiagent Systems Data Analysis, Statistics and Probability
url	https://arxiv.org/abs/2502.04249

Similar Items