Saved in:
| Main Authors: | Lu, Yi-Long, Song, Jiajun, Wang, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.27328 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks
by: Lu, Yi-Long, et al.
Published: (2025)
by: Lu, Yi-Long, et al.
Published: (2025)
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
by: Li, Zhaowei, et al.
Published: (2024)
by: Li, Zhaowei, et al.
Published: (2024)
Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models?
by: Lu, Yi-Long, et al.
Published: (2025)
by: Lu, Yi-Long, et al.
Published: (2025)
Mind the Gap: The Divergence Between Human and LLM-Generated Tasks
by: Lu, Yi-Long, et al.
Published: (2025)
by: Lu, Yi-Long, et al.
Published: (2025)
Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
by: Xie, Qiming, et al.
Published: (2023)
by: Xie, Qiming, et al.
Published: (2023)
Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain
by: An, Jingmin, et al.
Published: (2025)
by: An, Jingmin, et al.
Published: (2025)
Diver: Large Language Model Decoding with Span-Level Mutual Information Verification
by: Lu, Jinliang, et al.
Published: (2024)
by: Lu, Jinliang, et al.
Published: (2024)
Fact or Guesswork? Evaluating Large Language Models' Medical Knowledge with Structured One-Hop Judgments
by: Li, Jiaxi, et al.
Published: (2025)
by: Li, Jiaxi, et al.
Published: (2025)
Incoherent Probability Judgments in Large Language Models
by: Zhu, Jian-Qiao, et al.
Published: (2024)
by: Zhu, Jian-Qiao, et al.
Published: (2024)
Pretraining Exposure Explains Popularity Judgments in Large Language Models
by: Mozafari, Jamshid, et al.
Published: (2026)
by: Mozafari, Jamshid, et al.
Published: (2026)
Do Emotions Influence Moral Judgment in Large Language Models?
by: Saim, Mohammad, et al.
Published: (2026)
by: Saim, Mohammad, et al.
Published: (2026)
Inertia in Moral and Value Judgments of Large Language Models
by: Lee, Bruce W., et al.
Published: (2024)
by: Lee, Bruce W., et al.
Published: (2024)
If Probable, Then Acceptable? Understanding Conditional Acceptability Judgments in Large Language Models
by: Orth, Jasmin, et al.
Published: (2025)
by: Orth, Jasmin, et al.
Published: (2025)
Navigating Brain Language Representations: A Comparative Analysis of Neural Language Models and Psychologically Plausible Models
by: Zhang, Yunhao, et al.
Published: (2024)
by: Zhang, Yunhao, et al.
Published: (2024)
Exploring Cultural Variations in Moral Judgments with Large Language Models
by: Mohammadi, Hadi, et al.
Published: (2025)
by: Mohammadi, Hadi, et al.
Published: (2025)
Evaluating and Optimizing Educational Content with Large Language Model Judgments
by: He-Yueya, Joy, et al.
Published: (2024)
by: He-Yueya, Joy, et al.
Published: (2024)
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility
by: Lepori, Michael A., et al.
Published: (2025)
by: Lepori, Michael A., et al.
Published: (2025)
Aligning Large Language Models by On-Policy Self-Judgment
by: Lee, Sangkyu, et al.
Published: (2024)
by: Lee, Sangkyu, et al.
Published: (2024)
Rethinking Personalization in Large Language Models at the Token Level
by: Zhang, Chenheng, et al.
Published: (2026)
by: Zhang, Chenheng, et al.
Published: (2026)
Athena: Retrieval-augmented Legal Judgment Prediction with Large Language Models
by: Peng, Xiao, et al.
Published: (2024)
by: Peng, Xiao, et al.
Published: (2024)
ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework
by: Qin, Kai, et al.
Published: (2026)
by: Qin, Kai, et al.
Published: (2026)
Reasons to Reject? Aligning Language Models with Judgments
by: Xu, Weiwen, et al.
Published: (2023)
by: Xu, Weiwen, et al.
Published: (2023)
URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
by: Lu, Songshuo, et al.
Published: (2025)
by: Lu, Songshuo, et al.
Published: (2025)
Mitigating Judgment Preference Bias in Large Language Models through Group-Based Polling
by: Liu, Shuliang, et al.
Published: (2025)
by: Liu, Shuliang, et al.
Published: (2025)
A Survey on Unlearning in Large Language Models
by: Qiu, Ruichen, et al.
Published: (2025)
by: Qiu, Ruichen, et al.
Published: (2025)
The Fragility Of Moral Judgment In Large Language Models
by: van Nuenen, Tom, et al.
Published: (2026)
by: van Nuenen, Tom, et al.
Published: (2026)
Mitigating the Threshold Priming Effect in Large Language Model-Based Relevance Judgments via Personality Infusing
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
Merge, Ensemble, and Cooperate! A Survey on Collaborative Strategies in the Era of Large Language Models
by: Lu, Jinliang, et al.
Published: (2024)
by: Lu, Jinliang, et al.
Published: (2024)
Can We Use Large Language Models to Fill Relevance Judgment Holes?
by: Abbasiantaeb, Zahra, et al.
Published: (2024)
by: Abbasiantaeb, Zahra, et al.
Published: (2024)
Unified Defense for Large Language Models against Jailbreak and Fine-Tuning Attacks in Education
by: Yi, Xin, et al.
Published: (2025)
by: Yi, Xin, et al.
Published: (2025)
Interweaving Memories of a Siamese Large Language Model
by: Song, Xin, et al.
Published: (2024)
by: Song, Xin, et al.
Published: (2024)
Semantic Representation Attack against Aligned Large Language Models
by: Lian, Jiawei, et al.
Published: (2025)
by: Lian, Jiawei, et al.
Published: (2025)
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution Alignment
by: Peng, Tianyu, et al.
Published: (2024)
by: Peng, Tianyu, et al.
Published: (2024)
SDGO: Self-Discrimination-Guided Optimization for Consistent Safety in Large Language Models
by: Ding, Peng, et al.
Published: (2025)
by: Ding, Peng, et al.
Published: (2025)
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments
by: Zhou, Han, et al.
Published: (2024)
by: Zhou, Han, et al.
Published: (2024)
Exploring the Factual Consistency in Dialogue Comprehension of Large Language Models
by: She, Shuaijie, et al.
Published: (2023)
by: She, Shuaijie, et al.
Published: (2023)
SR-LLM: Rethinking the Structured Representation in Large Language Model
by: Zhang, Jiahuan, et al.
Published: (2025)
by: Zhang, Jiahuan, et al.
Published: (2025)
DPPA: Pruning Method for Large Language Model to Model Merging
by: Zhu, Yaochen, et al.
Published: (2024)
by: Zhu, Yaochen, et al.
Published: (2024)
Towards a Unified View of Preference Learning for Large Language Models: A Survey
by: Gao, Bofei, et al.
Published: (2024)
by: Gao, Bofei, et al.
Published: (2024)
Large Language Models as Zero-Shot Keyphrase Extractors: A Preliminary Empirical Study
by: Song, Mingyang, et al.
Published: (2023)
by: Song, Mingyang, et al.
Published: (2023)
Similar Items
-
Systematic Bias in Large Language Models: Discrepant Response Patterns in Binary vs. Continuous Judgment Tasks
by: Lu, Yi-Long, et al.
Published: (2025) -
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
by: Li, Zhaowei, et al.
Published: (2024) -
Do Theory of Mind Benchmarks Need Explicit Human-like Reasoning in Language Models?
by: Lu, Yi-Long, et al.
Published: (2025) -
Mind the Gap: The Divergence Between Human and LLM-Generated Tasks
by: Lu, Yi-Long, et al.
Published: (2025) -
Ask Again, Then Fail: Large Language Models' Vacillations in Judgment
by: Xie, Qiming, et al.
Published: (2023)