Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Weller, Niklas, Barkett, Emilio
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.25256
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917530233208832
author	Weller, Niklas Barkett, Emilio
author_facet	Weller, Niklas Barkett, Emilio
contents	Aligning AI systems with organizational decision-making is typically framed as a single-target problem: make the model behave like the organization. We argue this framing obscures a deeper pluralistic challenge. We rely on a decision-policy capturing method to measure process alignment: whether an LLM weights information as the organization does, not merely whether it reaches the same conclusions. Applying this method to ECHR Article 6 decisions, process alignment strongly predicts output accuracy (r = 0.85, p < .001) and externalization substantially improves alignment for poorly-aligned models. Applying it to German consumer credit decisions, this relationship collapses (r = 0.15, p = .60): interventions produce inconsistent effects and the benchmark encodes potentially discriminatory historical patterns. This contrast is itself a pluralistic alignment finding: in contested domains, high process alignment is neither achievable via externalization nor unconditionally desirable. Output agreement alone cannot distinguish a model that has internalized an organizational policy from one that merely approximates its outcomes; process-level measurement is a necessary component of any pluralistic alignment evaluation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_25256
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	Whose Alignment? Comparing LLM Process Alignment Across Diverse Organizational Decision Contexts Weller, Niklas Barkett, Emilio Artificial Intelligence Aligning AI systems with organizational decision-making is typically framed as a single-target problem: make the model behave like the organization. We argue this framing obscures a deeper pluralistic challenge. We rely on a decision-policy capturing method to measure process alignment: whether an LLM weights information as the organization does, not merely whether it reaches the same conclusions. Applying this method to ECHR Article 6 decisions, process alignment strongly predicts output accuracy (r = 0.85, p < .001) and externalization substantially improves alignment for poorly-aligned models. Applying it to German consumer credit decisions, this relationship collapses (r = 0.15, p = .60): interventions produce inconsistent effects and the benchmark encodes potentially discriminatory historical patterns. This contrast is itself a pluralistic alignment finding: in contested domains, high process alignment is neither achievable via externalization nor unconditionally desirable. Output agreement alone cannot distinguish a model that has internalized an organizational policy from one that merely approximates its outcomes; process-level measurement is a necessary component of any pluralistic alignment evaluation.
title	Whose Alignment? Comparing LLM Process Alignment Across Diverse Organizational Decision Contexts
topic	Artificial Intelligence
url	https://arxiv.org/abs/2605.25256

Similar Items