:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Zhao, Yang, Wang, Yixin, Yin, Mingzhang
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computation and Language
Accesso online:	https://arxiv.org/abs/2410.04346
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

LiPO: Listwise Preference Optimization through Learning-to-Rank
di: Liu, Tianqi, et al.
Pubblicazione: (2024)

Holistic Utility Preference Learning for Listwise Alignment
di: Zhou, Jiacong, et al.
Pubblicazione: (2024)

Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models
di: Tang, Raphael, et al.
Pubblicazione: (2023)

Re-evaluating Automatic LLM System Ranking for Alignment with Human Preference
di: Gao, Mingqi, et al.
Pubblicazione: (2024)

Preference Ranking Optimization for Human Alignment
di: Song, Feifan, et al.
Pubblicazione: (2023)

Permutation-Consensus Listwise Judging for Robust Factuality Evaluation
di: Huang, Tianyi, et al.
Pubblicazione: (2026)

Rank-K: Test-Time Reasoning for Listwise Reranking
di: Yang, Eugene, et al.
Pubblicazione: (2025)

Spread Preference Annotation: Direct Preference Judgment for Efficient LLM Alignment
di: Kim, Dongyoung, et al.
Pubblicazione: (2024)

Self-supervised Attribute-aware Dynamic Preference Ranking Alignment
di: Yang, Hongyu, et al.
Pubblicazione: (2025)

Improving Alignment in LVLMs with Debiased Self-Judgment
di: Yang, Sihan, et al.
Pubblicazione: (2025)

How Many Human Judgments Are Enough? Feasibility Limits of Human Preference Evaluation
di: Lee, Wilson Y.
Pubblicazione: (2026)

LINKAGE: Listwise Ranking among Varied-Quality References for Non-Factoid QA Evaluation via LLMs
di: Yang, Sihui, et al.
Pubblicazione: (2024)

Rethinking the Necessity of Adaptive Retrieval-Augmented Generation through the Lens of Adaptive Listwise Ranking
di: Feng, Jun, et al.
Pubblicazione: (2026)

Listwise Preference Optimization with Element-wise Confusions for Aspect Sentiment Quad Prediction
di: Lai, Wenna, et al.
Pubblicazione: (2025)

AcuRank: Uncertainty-Aware Adaptive Computation for Listwise Reranking
di: Yoon, Soyoung, et al.
Pubblicazione: (2025)

Beyond the Surface: Measuring Self-Preference in LLM Judgments
di: Chen, Zhi-Yuan, et al.
Pubblicazione: (2025)

Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments
di: Zhou, Han, et al.
Pubblicazione: (2024)

Self-Calibrated Listwise Reranking with Large Language Models
di: Ren, Ruiyang, et al.
Pubblicazione: (2024)

Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment
di: Lin, Zhiyu, et al.
Pubblicazione: (2025)

Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments
di: Linhardt, Lorenz, et al.
Pubblicazione: (2025)

Uncovering Factor Level Preferences to Improve Human-Model Alignment
di: Oh, Juhyun, et al.
Pubblicazione: (2024)

Maximizing Signal in Human-Model Preference Alignment
di: Kraus, Kelsey, et al.
Pubblicazione: (2025)

E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker
di: Liu, Qi, et al.
Pubblicazione: (2025)

MM-SCALE: Grounded Multimodal Moral Reasoning via Scalar Judgment and Listwise Alignment
di: Park, Eunkyu, et al.
Pubblicazione: (2026)

Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data
di: Zhao, Shuai, et al.
Pubblicazione: (2025)

Tracing How Annotators Think: Augmenting Preference Judgments with Reading Processes
di: de Langis, Karin, et al.
Pubblicazione: (2025)

Putting HUMANS first: Efficient LAM Evaluation with Human Preference Alignment
di: Gan, Woody Haosheng, et al.
Pubblicazione: (2026)

Beyond Single-Point Judgment: Distribution Alignment for LLM-as-a-Judge
di: Chen, Luyu, et al.
Pubblicazione: (2025)

Curry-DPO: Enhancing Alignment using Curriculum Learning & Ranked Preferences
di: Pattnaik, Pulkit, et al.
Pubblicazione: (2024)

Human Bias in the Face of AI: Examining Human Judgment Against Text Labeled as AI Generated
di: Zhu, Tiffany, et al.
Pubblicazione: (2024)

ListConRanker: A Contrastive Text Reranker with Listwise Encoding
di: Liu, Junlong, et al.
Pubblicazione: (2025)

Human Preferences for Constructive Interactions in Language Model Alignment
di: Kyrychenko, Yara, et al.
Pubblicazione: (2025)

Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration
di: Lv, Hang, et al.
Pubblicazione: (2026)

Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models
di: Liu, Qi, et al.
Pubblicazione: (2024)

MaxMin-RLHF: Alignment with Diverse Human Preferences
di: Chakraborty, Souradip, et al.
Pubblicazione: (2024)

ContraSolver: Self-Alignment of Language Models by Resolving Internal Preference Contradictions
di: Zhang, Xu, et al.
Pubblicazione: (2024)

Confounding-Robust Policy Improvement with Human-AI Teams
di: Gao, Ruijiang, et al.
Pubblicazione: (2023)

Adjusting Interpretable Dimensions in Embedding Space with Human Judgments
di: Erk, Katrin, et al.
Pubblicazione: (2024)

Enhancing Human Evaluation in Machine Translation with Comparative Judgment
di: Song, Yixiao, et al.
Pubblicazione: (2025)

Using Natural Language Explanations to Rescale Human Judgments
di: Wadhwa, Manya, et al.
Pubblicazione: (2023)