Saved in:
| Main Authors: | Chen, Bocheng, Chen, Xi, Zi, Han, Mao, Haitao, Qi, Zimo, Zhang, Xitong, Johnson, Kristen, Liu, Guangliang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03079 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
Pragmatic Inference for Moral Reasoning Acquisition: Generalization via Metapragmatic Links
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
Diagnosing the Performance Trade-off in Moral Alignment: A Case Study on Gender Stereotypes
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
Discourse Heuristics For Paradoxically Moral Self-Correction
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
On the Convergence of Moral Self-Correction in Large Language Models
by: Liu, Guangliang, et al.
Published: (2025)
by: Liu, Guangliang, et al.
Published: (2025)
Smaller Large Language Models Can Do Moral Self-Correction
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
Self-correction is Not An Innate Capability in Language Models
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
On the Intrinsic Self-Correction Capability of LLMs: Uncertainty and Latent Concept
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
A Survey to Recent Progress Towards Understanding In-Context Learning
by: Mao, Haitao, et al.
Published: (2024)
by: Mao, Haitao, et al.
Published: (2024)
Can Large Language Models Handle Discourse Particles? A Case Study of Colloquial Malay
by: Yusoff, Mariah Al Giptiah Binte, et al.
Published: (2026)
by: Yusoff, Mariah Al Giptiah Binte, et al.
Published: (2026)
Towards Understanding Task-agnostic Debiasing Through the Lenses of Intrinsic Bias and Forgetfulness
by: Liu, Guangliang, et al.
Published: (2024)
by: Liu, Guangliang, et al.
Published: (2024)
Deactivating Refusal Triggers: Understanding and Mitigating Overrefusal in Safety Alignment
by: Xue, Zhiyu, et al.
Published: (2026)
by: Xue, Zhiyu, et al.
Published: (2026)
No Free Lunch for Defending Against Prefilling Attack by In-Context Learning
by: Xue, Zhiyu, et al.
Published: (2024)
by: Xue, Zhiyu, et al.
Published: (2024)
Diagnosing Failures in Large Language Models' Answers: Integrating Error Attribution into Evaluation Framework
by: Xu, Zishan, et al.
Published: (2025)
by: Xu, Zishan, et al.
Published: (2025)
CRITICTOOL: Evaluating Self-Critique Capabilities of Large Language Models in Tool-Calling Error Scenarios
by: Huang, Shiting, et al.
Published: (2025)
by: Huang, Shiting, et al.
Published: (2025)
Label-free Node Classification on Graphs with Large Language Models (LLMS)
by: Chen, Zhikai, et al.
Published: (2023)
by: Chen, Zhikai, et al.
Published: (2023)
Are Language Models Sensitive to Morally Irrelevant Distractors?
by: Shaw, Andrew, et al.
Published: (2026)
by: Shaw, Andrew, et al.
Published: (2026)
The Dark Side of Human Feedback: Poisoning Large Language Models via User Inputs
by: Chen, Bocheng, et al.
Published: (2024)
by: Chen, Bocheng, et al.
Published: (2024)
BLADE: Enhancing Black-box Large Language Models with Small Domain-Specific Models
by: Li, Haitao, et al.
Published: (2024)
by: Li, Haitao, et al.
Published: (2024)
Language Acquisition Device in Large Language Models
by: Mita, Masato, et al.
Published: (2026)
by: Mita, Masato, et al.
Published: (2026)
ASR Error Correction using Large Language Models
by: Ma, Rao, et al.
Published: (2024)
by: Ma, Rao, et al.
Published: (2024)
Role and Relevance of the Learners’ Errors in Second Language Acquisition
by: Dr. Vinay Kumar Singh
Published: (2017)
by: Dr. Vinay Kumar Singh
Published: (2017)
Large Language Models Are State-of-the-Art Evaluator for Grammatical Error Correction
by: Kobayashi, Masamune, et al.
Published: (2024)
by: Kobayashi, Masamune, et al.
Published: (2024)
Rethinking the Roles of Large Language Models in Chinese Grammatical Error Correction
by: Li, Yinghui, et al.
Published: (2024)
by: Li, Yinghui, et al.
Published: (2024)
Computational Reasoning of Large Language Models
by: Wu, Haitao, et al.
Published: (2025)
by: Wu, Haitao, et al.
Published: (2025)
Towards Robust Instruction Tuning on Multimodal Large Language Models
by: Han, Wei, et al.
Published: (2024)
by: Han, Wei, et al.
Published: (2024)
A Language-agnostic Model of Child Language Acquisition
by: Mahon, Louis, et al.
Published: (2024)
by: Mahon, Louis, et al.
Published: (2024)
SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction
by: Zeng, Biaojie, et al.
Published: (2025)
by: Zeng, Biaojie, et al.
Published: (2025)
Inverse-Q*: Token Level Reinforcement Learning for Aligning Large Language Models Without Preference Data
by: Xia, Han, et al.
Published: (2024)
by: Xia, Han, et al.
Published: (2024)
Large Language Models based ASR Error Correction for Child Conversations
by: Xu, Anfeng, et al.
Published: (2025)
by: Xu, Anfeng, et al.
Published: (2025)
MedGo: A Chinese Medical Large Language Model
by: Zhang, Haitao, et al.
Published: (2024)
by: Zhang, Haitao, et al.
Published: (2024)
KRAIL: A Knowledge-Driven Framework for Base Human Reliability Analysis Integrating IDHEAS and Large Language Models
by: Xiao, Xingyu, et al.
Published: (2024)
by: Xiao, Xingyu, et al.
Published: (2024)
Graph Machine Learning in the Era of Large Language Models (LLMs)
by: Wang, Shijie, et al.
Published: (2024)
by: Wang, Shijie, et al.
Published: (2024)
HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning
by: Yang, Qihao, et al.
Published: (2025)
by: Yang, Qihao, et al.
Published: (2025)
Prompting Large Language Models with Human Error Markings for Self-Correcting Machine Translation
by: Berger, Nathaniel, et al.
Published: (2024)
by: Berger, Nathaniel, et al.
Published: (2024)
Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task
by: Qu, Fanyi, et al.
Published: (2023)
by: Qu, Fanyi, et al.
Published: (2023)
Toward Honest Language Models for Deductive Reasoning
by: Liu, Jiarui, et al.
Published: (2025)
by: Liu, Jiarui, et al.
Published: (2025)
Jailbreaking Large Language Models with Morality Attacks
by: Su, Ying, et al.
Published: (2026)
by: Su, Ying, et al.
Published: (2026)
Generative Speech Recognition Error Correction with Large Language Models and Task-Activating Prompting
by: Yang, Chao-Han Huck, et al.
Published: (2023)
by: Yang, Chao-Han Huck, et al.
Published: (2023)
Similar Items
-
Diagnosing Moral Reasoning Acquisition in Language Models: Pragmatics and Generalization
by: Liu, Guangliang, et al.
Published: (2025) -
Pragmatic Inference for Moral Reasoning Acquisition: Generalization via Metapragmatic Links
by: Liu, Guangliang, et al.
Published: (2025) -
Diagnosing the Performance Trade-off in Moral Alignment: A Case Study on Gender Stereotypes
by: Liu, Guangliang, et al.
Published: (2025) -
Discourse Heuristics For Paradoxically Moral Self-Correction
by: Liu, Guangliang, et al.
Published: (2025) -
On the Convergence of Moral Self-Correction in Large Language Models
by: Liu, Guangliang, et al.
Published: (2025)