Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Guangliang, Qi, Zimo, Zhang, Xitong, Jiang, Lei, Johnson, Kristen Marie
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2502.16600
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866908581515755520
author	Liu, Guangliang Qi, Zimo Zhang, Xitong Jiang, Lei Johnson, Kristen Marie
author_facet	Liu, Guangliang Qi, Zimo Zhang, Xitong Jiang, Lei Johnson, Kristen Marie
contents	Ensuring that Large Language Models (LLMs) return just responses which adhere to societal values is crucial for their broader application. Prior research has shown that LLMs often fail to perform satisfactorily on tasks requiring moral cognizance, such as ethics-based judgments. While current approaches have focused on fine-tuning LLMs with curated datasets to improve their capabilities on such tasks, choosing the optimal learning paradigm to enhance the ethical responses of LLMs remains an open research debate. In this work, we aim to address this fundamental question: can current learning paradigms enable LLMs to acquire sufficient moral reasoning capabilities? Drawing from distributional semantics theory and the pragmatic nature of moral discourse, our analysis indicates that performance improvements follow a mechanism similar to that of semantic-level tasks, and therefore remain affected by the pragmatic nature of morals latent in discourse, a phenomenon we name the pragmatic dilemma. We conclude that this pragmatic dilemma imposes significant limitations on the generalization ability of current learning paradigms, making it the primary bottleneck for moral reasoning acquisition in LLMs.
format	Preprint
id	arxiv_https___arxiv_org_abs_2502_16600
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization Liu, Guangliang Qi, Zimo Zhang, Xitong Jiang, Lei Johnson, Kristen Marie Computation and Language Ensuring that Large Language Models (LLMs) return just responses which adhere to societal values is crucial for their broader application. Prior research has shown that LLMs often fail to perform satisfactorily on tasks requiring moral cognizance, such as ethics-based judgments. While current approaches have focused on fine-tuning LLMs with curated datasets to improve their capabilities on such tasks, choosing the optimal learning paradigm to enhance the ethical responses of LLMs remains an open research debate. In this work, we aim to address this fundamental question: can current learning paradigms enable LLMs to acquire sufficient moral reasoning capabilities? Drawing from distributional semantics theory and the pragmatic nature of moral discourse, our analysis indicates that performance improvements follow a mechanism similar to that of semantic-level tasks, and therefore remain affected by the pragmatic nature of morals latent in discourse, a phenomenon we name the pragmatic dilemma. We conclude that this pragmatic dilemma imposes significant limitations on the generalization ability of current learning paradigms, making it the primary bottleneck for moral reasoning acquisition in LLMs.
title	Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
topic	Computation and Language
url	https://arxiv.org/abs/2502.16600

Similar Items