:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lu, Yi-Long, Song, Jiajun, Wang, Wei
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2510.27328
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks
by: Lu, Yi-Long, et al.
Published: (2025)

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
by: Li, Zhaowei, et al.
Published: (2024)

Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models?
by: Lu, Yi-Long, et al.
Published: (2025)

Mind the Gap: The Divergence Between Human and LLM-Generated Tasks
by: Lu, Yi-Long, et al.
Published: (2025)

Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
by: Xie, Qiming, et al.
Published: (2023)

Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain
by: An, Jingmin, et al.
Published: (2025)

Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
by: Lu, Jinliang, et al.
Published: (2024)

Fact or Guesswork? Evaluating Large Language Models' Medical Knowledge with Structured One-Hop Judgments
by: Li, Jiaxi, et al.
Published: (2025)

Incoherent Probability Judgments in Large Language Models
by: Zhu, Jian-Qiao, et al.
Published: (2024)

Pretraining Exposure Explains Popularity Judgments in Large Language Models
by: Mozafari, Jamshid, et al.
Published: (2026)

Do Emotions Influence Moral Judgment in Large Language Models?
by: Saim, Mohammad, et al.
Published: (2026)

Inertia in Moral and Value Judgments of Large Language Models
by: Lee, Bruce W., et al.
Published: (2024)

If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)

Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models
by: Zhang, Yunhao, et al.
Published: (2024)

Exploring Cultural Variations in Moral Judgments with Large Language Models
by: Mohammadi, Hadi, et al.
Published: (2025)

Evaluating and Optimizing Educational Content with Large Language Model Judgments
by: He-Yueya, Joy, et al.
Published: (2024)

Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)

Aligning Large Language Models by On-Policy Self-Judgment
by: Lee, Sangkyu, et al.
Published: (2024)

Rethinking Personalization in Large Language Models at the Token Level
by: Zhang, Chenheng, et al.
Published: (2026)

Athena: Retrieval-augmented Legal Judgment Prediction with Large Language Models
by: Peng, Xiao, et al.
Published: (2024)

ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework
by: Qin, Kai, et al.
Published: (2026)

Reasons to Reject? Aligning Language Models with Judgments
by: Xu, Weiwen, et al.
Published: (2023)

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
by: Lu, Songshuo, et al.
Published: (2025)

Mitigating Judgment Preference Bias in Large Language Models through Group-Based Polling
by: Liu, Shuliang, et al.
Published: (2025)

A Survey on Unlearning in Large Language Models
by: Qiu, Ruichen, et al.
Published: (2025)

The Fragility Of Moral Judgment In Large Language Models
by: van Nuenen, Tom, et al.
Published: (2026)

Mitigating the Threshold Priming Effect in Large Language Model-Based Relevance Judgments via Personality Infusing
by: Chen, Nuo, et al.
Published: (2025)

Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
by: Lu, Jinliang, et al.
Published: (2024)

Can We Use Large Language Models to Fill Relevance Judgment Holes?
by: Abbasiantaeb, Zahra, et al.
Published: (2024)

Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
by: Yi, Xin, et al.
Published: (2025)

Interweaving Memories of a Siamese Large Language Model
by: Song, Xin, et al.
Published: (2024)

Semantic Representation Attack against Aligned Large Language Models
by: Lian, Jiawei, et al.
Published: (2025)

Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
by: Peng, Tianyu, et al.
Published: (2024)

SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
by: Ding, Peng, et al.
Published: (2025)

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments
by: Zhou, Han, et al.
Published: (2024)

Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
by: She, Shuaijie, et al.
Published: (2023)

SR-LLM: Rethinking the Structured Representation in Large Language Model
by: Zhang, Jiahuan, et al.
Published: (2025)

DPPA: Pruning Method for Large Language Model to Model Merging
by: Zhu, Yaochen, et al.
Published: (2024)

Towards a Unified View of Preference Learning for Large Language Models: A Survey
by: Gao, Bofei, et al.
Published: (2024)

Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study
by: Song, Mingyang, et al.
Published: (2023)