Saved in:
Bibliographic Details
Main Authors: Liu, Guangliang, Qi, Zimo, Zhang, Xitong, Jiang, Lei, Johnson, Kristen Marie
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.16600
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866908581515755520
author Liu, Guangliang
Qi, Zimo
Zhang, Xitong
Jiang, Lei
Johnson, Kristen Marie
author_facet Liu, Guangliang
Qi, Zimo
Zhang, Xitong
Jiang, Lei
Johnson, Kristen Marie
contents Ensuring that Large Language Models (LLMs) return just responses which adhere to societal values is crucial for their broader application. Prior research has shown that LLMs often fail to perform satisfactorily on tasks requiring moral cognizance, such as ethics-based judgments. While current approaches have focused on fine-tuning LLMs with curated datasets to improve their capabilities on such tasks, choosing the optimal learning paradigm to enhance the ethical responses of LLMs remains an open research debate. In this work, we aim to address this fundamental question: can current learning paradigms enable LLMs to acquire sufficient moral reasoning capabilities? Drawing from distributional semantics theory and the pragmatic nature of moral discourse, our analysis indicates that performance improvements follow a mechanism similar to that of semantic-level tasks, and therefore remain affected by the pragmatic nature of morals latent in discourse, a phenomenon we name the pragmatic dilemma. We conclude that this pragmatic dilemma imposes significant limitations on the generalization ability of current learning paradigms, making it the primary bottleneck for moral reasoning acquisition in LLMs.
format Preprint
id arxiv_https___arxiv_org_abs_2502_16600
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
Liu, Guangliang
Qi, Zimo
Zhang, Xitong
Jiang, Lei
Johnson, Kristen Marie
Computation and Language
Ensuring that Large Language Models (LLMs) return just responses which adhere to societal values is crucial for their broader application. Prior research has shown that LLMs often fail to perform satisfactorily on tasks requiring moral cognizance, such as ethics-based judgments. While current approaches have focused on fine-tuning LLMs with curated datasets to improve their capabilities on such tasks, choosing the optimal learning paradigm to enhance the ethical responses of LLMs remains an open research debate. In this work, we aim to address this fundamental question: can current learning paradigms enable LLMs to acquire sufficient moral reasoning capabilities? Drawing from distributional semantics theory and the pragmatic nature of moral discourse, our analysis indicates that performance improvements follow a mechanism similar to that of semantic-level tasks, and therefore remain affected by the pragmatic nature of morals latent in discourse, a phenomenon we name the pragmatic dilemma. We conclude that this pragmatic dilemma imposes significant limitations on the generalization ability of current learning paradigms, making it the primary bottleneck for moral reasoning acquisition in LLMs.
title Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
topic Computation and Language
url https://arxiv.org/abs/2502.16600