Saved in:
| Main Authors: | Feng, Shengyu, He, Yun, Ma, Shuang, Li, Beibin, Xiong, Yuanhao, Li, Songlin, Mandyam, Karishma, Katz-Samuels, Julian, Bi, Shengjie, Yu, Licheng, Zhang, Hejia, Sankararaman, Karthik Abinav, Fang, Han, Yang, Yiming, Faruqui, Manaal |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.15242 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AdvancedIF: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following
by: He, Yun, et al.
Published: (2025)
by: He, Yun, et al.
Published: (2025)
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)
Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following
by: He, Yun, et al.
Published: (2024)
by: He, Yun, et al.
Published: (2024)
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
by: Slivkins, Aleksandrs, et al.
Published: (2022)
by: Slivkins, Aleksandrs, et al.
Published: (2022)
Recent advances in the Bradley--Terry Model: theory, algorithms, and applications
by: Fang, Shuxing, et al.
Published: (2026)
by: Fang, Shuxing, et al.
Published: (2026)
PageRank and the Bradley-Terry model
by: Selby, David Antony
Published: (2024)
by: Selby, David Antony
Published: (2024)
The Bradley-Terry Stochastic Block Model
by: Santi, Lapo, et al.
Published: (2025)
by: Santi, Lapo, et al.
Published: (2025)
Efficient Portfolio Selection through Preference Aggregation with Quicksort and the Bradley--Terry Model
by: Ge, Yurun, et al.
Published: (2025)
by: Ge, Yurun, et al.
Published: (2025)
Bradley-Terry and Multi-Objective Reward Modeling Are Complementary
by: Zhang, Zhiwei, et al.
Published: (2025)
by: Zhang, Zhiwei, et al.
Published: (2025)
The Perfect Blend: Redefining RLHF with Mixture of Judges
by: Xu, Tengyu, et al.
Published: (2024)
by: Xu, Tengyu, et al.
Published: (2024)
Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift
by: Li, Xiudi, et al.
Published: (2025)
by: Li, Xiudi, et al.
Published: (2025)
The many routes to the ubiquitous Bradley-Terry model
by: Hamilton, Ian, et al.
Published: (2023)
by: Hamilton, Ian, et al.
Published: (2023)
Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
by: Zhang, Yifan, et al.
Published: (2024)
by: Zhang, Yifan, et al.
Published: (2024)
Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
by: Sun, Hao, et al.
Published: (2024)
by: Sun, Hao, et al.
Published: (2024)
Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model
by: Hong, Yuzhong, et al.
Published: (2024)
by: Hong, Yuzhong, et al.
Published: (2024)
A spectral approach for the dynamic Bradley–Terry model
by: Xinyu Tian, et al.
Published: (2024)
by: Xinyu Tian, et al.
Published: (2024)
Minimax Hypothesis Testing for the Bradley-Terry-Luce Model
by: Makur, Anuran, et al.
Published: (2024)
by: Makur, Anuran, et al.
Published: (2024)
Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)
by: Wang, Chaoqi, et al.
Published: (2024)
Neural Bradley-Terry Rating: Quantifying Properties from Comparisons
by: Fujii, Satoru
Published: (2023)
by: Fujii, Satoru
Published: (2023)
To stay discovered: On tournament mean score sequences and the Bradley--Terry model
by: Aldous, David, et al.
Published: (2018)
by: Aldous, David, et al.
Published: (2018)
OpenDeepThink: Parallel Reasoning via Bradley-Terry Aggregation
by: Zhou, Shang, et al.
Published: (2026)
by: Zhou, Shang, et al.
Published: (2026)
The Bayesian Intransitive Bradley-Terry Model via Combinatorial Hodge Theory
by: Okahara, Hisaya, et al.
Published: (2026)
by: Okahara, Hisaya, et al.
Published: (2026)
Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
by: Fageot, Julien, et al.
Published: (2023)
by: Fageot, Julien, et al.
Published: (2023)
Generalized Parallel Scaling with Interdependent Generations
by: Dong, Harry, et al.
Published: (2025)
by: Dong, Harry, et al.
Published: (2025)
Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation
by: Vu, Tu, et al.
Published: (2024)
by: Vu, Tu, et al.
Published: (2024)
The incomplete Analytic Hierarchy Process and Bradley-Terry model: (in)consistency and information retrieval
by: Gyarmati, László, et al.
Published: (2022)
by: Gyarmati, László, et al.
Published: (2022)
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation
by: Krishna, Satyapriya, et al.
Published: (2024)
by: Krishna, Satyapriya, et al.
Published: (2024)
What Matters for Model Merging at Scale?
by: Yadav, Prateek, et al.
Published: (2024)
by: Yadav, Prateek, et al.
Published: (2024)
HYPO: Hyperspherical Out-of-Distribution Generalization
by: Bai, Haoyue, et al.
Published: (2024)
by: Bai, Haoyue, et al.
Published: (2024)
Inclusive Ranking of Indian States and Union Territories via Bayesian Bradley-Terry Model
by: Rizvi, Arshi, et al.
Published: (2026)
by: Rizvi, Arshi, et al.
Published: (2026)
Argument Quality Assessment with Large Language Models: A Pairwise Bradley-Terry Approach
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)
Scalable Bayesian Inference for Bradley--Terry Models with Ties: An Application to Honour Based Abuse
by: Seymour, Rowland G, et al.
Published: (2024)
by: Seymour, Rowland G, et al.
Published: (2024)
AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)
by: Aggarwal, Pranjal, et al.
Published: (2023)
Inference in a generalized Bradley-Terry model for paired comparisons with covariates and a growing number of subjects
by: Yan, Ting
Published: (2025)
by: Yan, Ting
Published: (2025)
An analysis of factors impacting team strengths in the Australian Football League using time-variant Bradley-Terry models
by: Soffner, Carlos Rafael Gonzalez, et al.
Published: (2024)
by: Soffner, Carlos Rafael Gonzalez, et al.
Published: (2024)
Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
by: Zhou, Runlong, et al.
Published: (2024)
by: Zhou, Runlong, et al.
Published: (2024)
Reinforcement Learning from User Feedback
by: Han, Eric, et al.
Published: (2025)
by: Han, Eric, et al.
Published: (2025)
On the Equivalence of Graph Convolution and Mixup
by: Han, Xiaotian, et al.
Published: (2023)
by: Han, Xiaotian, et al.
Published: (2023)
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
by: Yu, Zishun, et al.
Published: (2025)
by: Yu, Zishun, et al.
Published: (2025)
HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds
by: Geng, Hejia, et al.
Published: (2023)
by: Geng, Hejia, et al.
Published: (2023)
Similar Items
-
AdvancedIF: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following
by: He, Yun, et al.
Published: (2025) -
Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
by: Munkhdalai, Tsendsuren, et al.
Published: (2024) -
Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following
by: He, Yun, et al.
Published: (2024) -
Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
by: Slivkins, Aleksandrs, et al.
Published: (2022) -
Recent advances in the Bradley--Terry Model: theory, algorithms, and applications
by: Fang, Shuxing, et al.
Published: (2026)