Saved in:
Bibliographic Details
Main Authors: Leibo, Joel Z., Vezhnevets, Alexander Sasha, Diaz, Manfred, Agapiou, John P., Cunningham, William A., Sunehag, Peter, Cross, Logan, Koster, Raphael, Bileschi, Stanley M., Chang, Minsuk, Rahwan, Iyad, Osindero, Simon, Evans, James A.
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2603.14050
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914393541836800
author Leibo, Joel Z.
Vezhnevets, Alexander Sasha
Diaz, Manfred
Agapiou, John P.
Cunningham, William A.
Sunehag, Peter
Cross, Logan
Koster, Raphael
Bileschi, Stanley M.
Chang, Minsuk
Rahwan, Iyad
Osindero, Simon
Evans, James A.
author_facet Leibo, Joel Z.
Vezhnevets, Alexander Sasha
Diaz, Manfred
Agapiou, John P.
Cunningham, William A.
Sunehag, Peter
Cross, Logan
Koster, Raphael
Bileschi, Stanley M.
Chang, Minsuk
Rahwan, Iyad
Osindero, Simon
Evans, James A.
contents We propose a society-first theory of normative appropriateness where individuals, modeled as pre-trained actors with cognitive architectures analogous to Large Language Models (LLMs), generate behavior via predictive pattern completion. Our theory posits that individuals act by completing distributed symbolic patterns based on context, answering questions such as "What does a person such as I do in a situation such as this?". This sense-making mechanism provides a parsimonious account of the key features of human norms: their context-dependence, arbitrariness, automaticity, dynamism, and their support from social sanctioning. It challenges rational-choice theories of social norms by accounting for their key features without needing to exogenously posit scalar rewards or preference relations. By distinguishing between explicit norms, which we associate with in-context adaptation, and implicit norms, which we associate with long-term memory, the theory reconceptualizes several foundational ideas in cognitive science. In particular, it gives an alternative account to the data traditionally seen as supporting dual-process models, and it flips the role of rationality, allowing us to construe it as adherence to culturally-contingent justification standards.
format Preprint
id arxiv_https___arxiv_org_abs_2603_14050
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle A Theory of Appropriateness That Accounts for Norms of Rationality
Leibo, Joel Z.
Vezhnevets, Alexander Sasha
Diaz, Manfred
Agapiou, John P.
Cunningham, William A.
Sunehag, Peter
Cross, Logan
Koster, Raphael
Bileschi, Stanley M.
Chang, Minsuk
Rahwan, Iyad
Osindero, Simon
Evans, James A.
Neural and Evolutionary Computing
We propose a society-first theory of normative appropriateness where individuals, modeled as pre-trained actors with cognitive architectures analogous to Large Language Models (LLMs), generate behavior via predictive pattern completion. Our theory posits that individuals act by completing distributed symbolic patterns based on context, answering questions such as "What does a person such as I do in a situation such as this?". This sense-making mechanism provides a parsimonious account of the key features of human norms: their context-dependence, arbitrariness, automaticity, dynamism, and their support from social sanctioning. It challenges rational-choice theories of social norms by accounting for their key features without needing to exogenously posit scalar rewards or preference relations. By distinguishing between explicit norms, which we associate with in-context adaptation, and implicit norms, which we associate with long-term memory, the theory reconceptualizes several foundational ideas in cognitive science. In particular, it gives an alternative account to the data traditionally seen as supporting dual-process models, and it flips the role of rationality, allowing us to construe it as adherence to culturally-contingent justification standards.
title A Theory of Appropriateness That Accounts for Norms of Rationality
topic Neural and Evolutionary Computing
url https://arxiv.org/abs/2603.14050