Saved in:
| Main Authors: | Ripa, Michael, Davies, Jim |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.27338 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Moral Sycophancy in Vision Language Models
by: Rabby, Shadman, et al.
Published: (2026)
by: Rabby, Shadman, et al.
Published: (2026)
Are Large Language Models Moral Hypocrites? A Study Based on Moral Foundations
by: Nunes, José Luiz, et al.
Published: (2024)
by: Nunes, José Luiz, et al.
Published: (2024)
The Moral Mind(s) of Large Language Models
by: Seror, Avner
Published: (2024)
by: Seror, Avner
Published: (2024)
Adversarial Moral Stress Testing of Large Language Models
by: Jamshidi, Saeid, et al.
Published: (2026)
by: Jamshidi, Saeid, et al.
Published: (2026)
MoralBench: Moral Evaluation of LLMs
by: Ji, Jianchao, et al.
Published: (2024)
by: Ji, Jianchao, et al.
Published: (2024)
Do Language Models Understand Morality? Towards a Robust Detection of Moral Content
by: Bulla, Luana, et al.
Published: (2024)
by: Bulla, Luana, et al.
Published: (2024)
Tracing Moral Foundations in Large Language Models
by: Yu, Chenxiao, et al.
Published: (2026)
by: Yu, Chenxiao, et al.
Published: (2026)
Mechanistic Origin of Moral Indifference in Language Models
by: Li, Lingyu, et al.
Published: (2026)
by: Li, Lingyu, et al.
Published: (2026)
Differences in the Moral Foundations of Large Language Models
by: Kirgis, Peter
Published: (2025)
by: Kirgis, Peter
Published: (2025)
Visual Distraction Undermines Moral Reasoning in Vision-Language Models
by: Yang, Xinyi, et al.
Published: (2026)
by: Yang, Xinyi, et al.
Published: (2026)
MM-MoralBench: A MultiModal Moral Evaluation Benchmark for Large Vision-Language Models
by: Yan, Bei, et al.
Published: (2024)
by: Yan, Bei, et al.
Published: (2024)
One Model, Many Morals: Uncovering Cross-Linguistic Misalignments in Computational Moral Reasoning
by: Farid, Sualeha, et al.
Published: (2025)
by: Farid, Sualeha, et al.
Published: (2025)
Morality is Contextual: Learning Interpretable Moral Contexts from Human Data with Probabilistic Clustering and Large Language Models
by: Morlat, Geoffroy, et al.
Published: (2025)
by: Morlat, Geoffroy, et al.
Published: (2025)
Histoires Morales: A French Dataset for Assessing Moral Alignment
by: Leteno, Thibaud, et al.
Published: (2025)
by: Leteno, Thibaud, et al.
Published: (2025)
Normative Evaluation of Large Language Models with Everyday Moral Dilemmas
by: Sachdeva, Pratik S., et al.
Published: (2025)
by: Sachdeva, Pratik S., et al.
Published: (2025)
Exploring Cultural Variations in Moral Judgments with Large Language Models
by: Mohammadi, Hadi, et al.
Published: (2025)
by: Mohammadi, Hadi, et al.
Published: (2025)
SaGE: Evaluating Moral Consistency in Large Language Models
by: Bonagiri, Vamshi Krishna, et al.
Published: (2024)
by: Bonagiri, Vamshi Krishna, et al.
Published: (2024)
Large Language Models as Mirrors of Societal Moral Standards
by: Papadopoulou, Evi, et al.
Published: (2024)
by: Papadopoulou, Evi, et al.
Published: (2024)
Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models
by: Kasat, Aryan, et al.
Published: (2026)
by: Kasat, Aryan, et al.
Published: (2026)
Inertia in Moral and Value Judgments of Large Language Models
by: Lee, Bruce W., et al.
Published: (2024)
by: Lee, Bruce W., et al.
Published: (2024)
MOSAIC: Unveiling the Moral, Social and Individual Dimensions of Large Language Models
by: Coppolillo, Erica, et al.
Published: (2026)
by: Coppolillo, Erica, et al.
Published: (2026)
The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models
by: Jamshidi, Saeid, et al.
Published: (2025)
by: Jamshidi, Saeid, et al.
Published: (2025)
Moral Persuasion in Large Language Models: Evaluating Susceptibility and Ethical Alignment
by: Huang, Allison, et al.
Published: (2024)
by: Huang, Allison, et al.
Published: (2024)
ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs
by: Thomas, Rohan Subramanian, et al.
Published: (2026)
by: Thomas, Rohan Subramanian, et al.
Published: (2026)
Moral Lenses, Political Coordinates: Towards Ideological Positioning of Morally Conditioned LLMs
by: Yuan, Chenchen, et al.
Published: (2026)
by: Yuan, Chenchen, et al.
Published: (2026)
Normative Moral Pluralism for AI: A Framework for Deliberation in Complex Moral Contexts
by: Yaacov, David-Doron
Published: (2025)
by: Yaacov, David-Doron
Published: (2025)
Moral Agency in Silico: Exploring Free Will in Large Language Models
by: Porter, Morgan S.
Published: (2024)
by: Porter, Morgan S.
Published: (2024)
Inducing Human-like Biases in Moral Reasoning Language Models
by: Karpov, Artem, et al.
Published: (2024)
by: Karpov, Artem, et al.
Published: (2024)
The Emergent Moral Ecology: A Novel Framework for AI Moral Responsibility
by: Tan, Kwan Hong
Published: (2025)
by: Tan, Kwan Hong
Published: (2025)
The Morality of Probability: How Implicit Moral Biases in LLMs May Shape the Future of Human-AI Symbiosis
by: O'Doherty, Eoin, et al.
Published: (2025)
by: O'Doherty, Eoin, et al.
Published: (2025)
MoralReason: Generalizable Moral Decision Alignment For LLM Agents Using Reasoning-Level Reinforcement Learning
by: An, Zhiyu, et al.
Published: (2025)
by: An, Zhiyu, et al.
Published: (2025)
Why Machines Can't Be Moral: Turing's Halting Problem and the Moral Limits of Artificial Intelligence
by: Passamonti, Massimo
Published: (2024)
by: Passamonti, Massimo
Published: (2024)
The Moral Turing Test: Evaluating Human-LLM Alignment in Moral Decision-Making
by: Garcia, Basile, et al.
Published: (2024)
by: Garcia, Basile, et al.
Published: (2024)
An Algebraic Exposition of the Theory of Dyadic Morality
by: Varshney, Kush R.
Published: (2026)
by: Varshney, Kush R.
Published: (2026)
Addressing Moral Uncertainty using Large Language Models for Ethical Decision-Making
by: Dubey, Rohit K., et al.
Published: (2025)
by: Dubey, Rohit K., et al.
Published: (2025)
Whose Morality Do They Speak? Unraveling Cultural Bias in Multilingual Language Models
by: Aksoy, Meltem
Published: (2024)
by: Aksoy, Meltem
Published: (2024)
CMoralEval: A Moral Evaluation Benchmark for Chinese Large Language Models
by: Yu, Linhao, et al.
Published: (2024)
by: Yu, Linhao, et al.
Published: (2024)
Do VLMs Have a Moral Backbone? A Study on the Fragile Morality of Vision-Language Models
by: Liu, Zhining, et al.
Published: (2026)
by: Liu, Zhining, et al.
Published: (2026)
Contesting Artificial Moral Agents
by: Aijaz, Aisha
Published: (2026)
by: Aijaz, Aisha
Published: (2026)
MoralityGym: A Benchmark for Evaluating Hierarchical Moral Alignment in Sequential Decision-Making Agents
by: Rosen, Simon, et al.
Published: (2026)
by: Rosen, Simon, et al.
Published: (2026)
Similar Items
-
Moral Sycophancy in Vision Language Models
by: Rabby, Shadman, et al.
Published: (2026) -
Are Large Language Models Moral Hypocrites? A Study Based on Moral Foundations
by: Nunes, José Luiz, et al.
Published: (2024) -
The Moral Mind(s) of Large Language Models
by: Seror, Avner
Published: (2024) -
Adversarial Moral Stress Testing of Large Language Models
by: Jamshidi, Saeid, et al.
Published: (2026) -
MoralBench: Moral Evaluation of LLMs
by: Ji, Jianchao, et al.
Published: (2024)