Saved in:
| Main Authors: | Wang, Jiashuo, Wang, Haozhao, Sun, Shichao, Li, Wenjie |
|---|---|
| Format: | Preprint |
| Published: |
2023
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2310.05782 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Aligning MLLM Benchmark With Human Preferences via Structural Equation Modeling
by: Xiong, Shengwu., et al.
Published: (2025)
by: Xiong, Shengwu., et al.
Published: (2025)
CARE: Causality Reasoning for Empathetic Responses by Conditional Graph Generation
by: Wang, Jiashuo, et al.
Published: (2022)
by: Wang, Jiashuo, et al.
Published: (2022)
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023)
by: Liu, Wenhao, et al.
Published: (2023)
Personalized Large Language Model Assistant with Evolving Conditional Memory
by: Yuan, Ruifeng, et al.
Published: (2023)
by: Yuan, Ruifeng, et al.
Published: (2023)
Dissecting Human and LLM Preferences
by: Li, Junlong, et al.
Published: (2024)
by: Li, Junlong, et al.
Published: (2024)
SpeechAlign: Aligning Speech Generation to Human Preferences
by: Zhang, Dong, et al.
Published: (2024)
by: Zhang, Dong, et al.
Published: (2024)
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
by: Wang, Hanlin, et al.
Published: (2025)
by: Wang, Hanlin, et al.
Published: (2025)
How Far Are LLMs from Believable AI? A Benchmark for Evaluating the Believability of Human Behavior Simulation
by: Xiao, Yang, et al.
Published: (2023)
by: Xiao, Yang, et al.
Published: (2023)
Sentence-level Reward Model can Generalize Better for Aligning LLM from Human Preference
by: Qiu, Wenjie, et al.
Published: (2025)
by: Qiu, Wenjie, et al.
Published: (2025)
Aligning Large Language Model Behavior with Human Citation Preferences
by: Ando, Kenichiro, et al.
Published: (2026)
by: Ando, Kenichiro, et al.
Published: (2026)
AlignSum: Data Pyramid Hierarchical Fine-tuning for Aligning with Human Summarization Preference
by: Han, Yang, et al.
Published: (2024)
by: Han, Yang, et al.
Published: (2024)
Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models
by: Li, Shengzhi, et al.
Published: (2024)
by: Li, Shengzhi, et al.
Published: (2024)
Aligning Large Language Models with Searcher Preferences
by: Wu, Wei, et al.
Published: (2026)
by: Wu, Wei, et al.
Published: (2026)
MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time
by: Zhang, Mozhi, et al.
Published: (2024)
by: Zhang, Mozhi, et al.
Published: (2024)
Capturing Nuanced Preferences: Preference-Aligned Distillation for Small Language Models
by: Gu, Yanggan, et al.
Published: (2025)
by: Gu, Yanggan, et al.
Published: (2025)
SPECTRA: Revealing the Full Spectrum of User Preferences via Distributional LLM Inference
by: Zhang, Luyang, et al.
Published: (2025)
by: Zhang, Luyang, et al.
Published: (2025)
Bayesian Preference Elicitation with Language Models
by: Handa, Kunal, et al.
Published: (2024)
by: Handa, Kunal, et al.
Published: (2024)
Aligning Large Language Models with Implicit Preferences from User-Generated Content
by: Tan, Zhaoxuan, et al.
Published: (2025)
by: Tan, Zhaoxuan, et al.
Published: (2025)
Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback
by: Wang, Jiashuo, et al.
Published: (2024)
by: Wang, Jiashuo, et al.
Published: (2024)
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
Foresight Optimization for Strategic Reasoning in Large Language Models
by: Wang, Jiashuo, et al.
Published: (2026)
by: Wang, Jiashuo, et al.
Published: (2026)
Evaluating and Aligning CodeLLMs on Human Preference
by: Yang, Jian, et al.
Published: (2024)
by: Yang, Jian, et al.
Published: (2024)
Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments
by: Zhou, Han, et al.
Published: (2024)
by: Zhou, Han, et al.
Published: (2024)
FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in Large Language Models
by: Li, Yiyuan, et al.
Published: (2024)
by: Li, Yiyuan, et al.
Published: (2024)
Preference-Guided Reflective Sampling for Aligning Language Models
by: Ye, Hai, et al.
Published: (2024)
by: Ye, Hai, et al.
Published: (2024)
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
by: Liu, Yinhong, et al.
Published: (2024)
by: Liu, Yinhong, et al.
Published: (2024)
Aligning Model Evaluations with Human Preferences: Mitigating Token Count Bias in Language Model Assessments
by: Daynauth, Roland, et al.
Published: (2024)
by: Daynauth, Roland, et al.
Published: (2024)
Shallow Preference Signals: Large Language Model Aligns Even Better with Truncated Data?
by: Qi, Xuan, et al.
Published: (2025)
by: Qi, Xuan, et al.
Published: (2025)
Latent Preference Coding: Aligning Large Language Models via Discrete Latent Codes
by: Gong, Zhuocheng, et al.
Published: (2025)
by: Gong, Zhuocheng, et al.
Published: (2025)
Align Generative Artificial Intelligence with Human Preferences: A Novel Large Language Model Fine-Tuning Method for Online Review Management
by: Wang, Yanan, et al.
Published: (2026)
by: Wang, Yanan, et al.
Published: (2026)
Towards a Client-Centered Assessment of LLM Therapists by Client Simulation
by: Wang, Jiashuo, et al.
Published: (2024)
by: Wang, Jiashuo, et al.
Published: (2024)
Arch-Router: Aligning LLM Routing with Human Preferences
by: Tran, Co, et al.
Published: (2025)
by: Tran, Co, et al.
Published: (2025)
Vibe Checker: Aligning Code Evaluation with Human Preference
by: Zhong, Ming, et al.
Published: (2025)
by: Zhong, Ming, et al.
Published: (2025)
BATON: Aligning Text-to-Audio Model with Human Preference Feedback
by: Liao, Huan, et al.
Published: (2024)
by: Liao, Huan, et al.
Published: (2024)
TPO: Aligning Large Language Models with Multi-branch & Multi-step Preference Trees
by: Liao, Weibin, et al.
Published: (2024)
by: Liao, Weibin, et al.
Published: (2024)
Tool Retrieval Bridge: Aligning Vague Instructions with Retriever Preferences via Bridge Model
by: Chen, Kunfeng, et al.
Published: (2026)
by: Chen, Kunfeng, et al.
Published: (2026)
Preference-Oriented Supervised Fine-Tuning: Favoring Target Model Over Aligned Large Language Models
by: Fan, Yuchen, et al.
Published: (2024)
by: Fan, Yuchen, et al.
Published: (2024)
Prompt Chaining or Stepwise Prompt? Refinement in Text Summarization
by: Sun, Shichao, et al.
Published: (2024)
by: Sun, Shichao, et al.
Published: (2024)
HLPD: Aligning LLMs to Human Language Preference for Machine-Revised Text Detection
by: Dai, Fangqi, et al.
Published: (2025)
by: Dai, Fangqi, et al.
Published: (2025)
AlignCap: Aligning Speech Emotion Captioning to Human Preferences
by: Liang, Ziqi, et al.
Published: (2024)
by: Liang, Ziqi, et al.
Published: (2024)
Similar Items
-
Aligning MLLM Benchmark With Human Preferences via Structural Equation Modeling
by: Xiong, Shengwu., et al.
Published: (2025) -
CARE: Causality Reasoning for Empathetic Responses by Conditional Graph Generation
by: Wang, Jiashuo, et al.
Published: (2022) -
Aligning Large Language Models with Human Preferences through Representation Engineering
by: Liu, Wenhao, et al.
Published: (2023) -
Personalized Large Language Model Assistant with Evolving Conditional Memory
by: Yuan, Ruifeng, et al.
Published: (2023) -
Dissecting Human and LLM Preferences
by: Li, Junlong, et al.
Published: (2024)