:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Feng, Shengyu, He, Yun, Ma, Shuang, Li, Beibin, Xiong, Yuanhao, Li, Songlin, Mandyam, Karishma, Katz-Samuels, Julian, Bi, Shengjie, Yu, Licheng, Zhang, Hejia, Sankararaman, Karthik Abinav, Fang, Han, Yang, Yiming, Faruqui, Manaal
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2510.15242
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

AdvancedIF: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following
by: He, Yun, et al.
Published: (2025)

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
by: Munkhdalai, Tsendsuren, et al.
Published: (2024)

Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following
by: He, Yun, et al.
Published: (2024)

Contextual Bandits with Packing and Covering Constraints: A Modular Lagrangian Approach via Regression
by: Slivkins, Aleksandrs, et al.
Published: (2022)

Recent advances in the Bradley--Terry Model: theory, algorithms, and applications
by: Fang, Shuxing, et al.
Published: (2026)

PageRank and the Bradley-Terry model
by: Selby, David Antony
Published: (2024)

The Bradley-Terry Stochastic Block Model
by: Santi, Lapo, et al.
Published: (2025)

Efficient Portfolio Selection through Preference Aggregation with Quicksort and the Bradley--Terry Model
by: Ge, Yurun, et al.
Published: (2025)

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary
by: Zhang, Zhiwei, et al.
Published: (2025)

The Perfect Blend: Redefining RLHF with Mixture of Judges
by: Xu, Tengyu, et al.
Published: (2024)

Efficient Inference for Covariate-adjusted Bradley-Terry Model with Covariate Shift
by: Li, Xiudi, et al.
Published: (2025)

The many routes to the ubiquitous Bradley-Terry model
by: Hamilton, Ian, et al.
Published: (2023)

Beyond Bradley-Terry Models: A General Preference Model for Language Model Alignment
by: Zhang, Yifan, et al.
Published: (2024)

Rethinking Bradley-Terry Models in Preference-Based Reward Modeling: Foundations, Theory, and Alternatives
by: Sun, Hao, et al.
Published: (2024)

Energy-Based Preference Model Offers Better Offline Alignment than the Bradley-Terry Preference Model
by: Hong, Yuzhong, et al.
Published: (2024)

A spectral approach for the dynamic Bradley–Terry model
by: Xinyu Tian, et al.
Published: (2024)

Minimax Hypothesis Testing for the Bradley-Terry-Luce Model
by: Makur, Anuran, et al.
Published: (2024)

Preference Optimization with Multi-Sample Comparisons
by: Wang, Chaoqi, et al.
Published: (2024)

Neural Bradley-Terry Rating: Quantifying Properties from Comparisons
by: Fujii, Satoru
Published: (2023)

To stay discovered: On tournament mean score sequences and the Bradley--Terry model
by: Aldous, David, et al.
Published: (2018)

OpenDeepThink: Parallel Reasoning via Bradley-Terry Aggregation
by: Zhou, Shang, et al.
Published: (2026)

The Bayesian Intransitive Bradley-Terry Model via Combinatorial Hodge Theory
by: Okahara, Hisaya, et al.
Published: (2026)

Generalized Bradley-Terry Models for Score Estimation from Paired Comparisons
by: Fageot, Julien, et al.
Published: (2023)

Generalized Parallel Scaling with Interdependent Generations
by: Dong, Harry, et al.
Published: (2025)

Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation
by: Vu, Tu, et al.
Published: (2024)

The incomplete Analytic Hierarchy Process and Bradley-Terry model: (in)consistency and information retrieval
by: Gyarmati, László, et al.
Published: (2022)

Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation
by: Krishna, Satyapriya, et al.
Published: (2024)

What Matters for Model Merging at Scale?
by: Yadav, Prateek, et al.
Published: (2024)

HYPO: Hyperspherical Out-of-Distribution Generalization
by: Bai, Haoyue, et al.
Published: (2024)

Inclusive Ranking of Indian States and Union Territories via Bayesian Bradley-Terry Model
by: Rizvi, Arshi, et al.
Published: (2026)

Argument Quality Assessment with Large Language Models: A Pairwise Bradley-Terry Approach
by: Ocampo, Nicolás Benjamín, et al.
Published: (2026)

Scalable Bayesian Inference for Bradley--Terry Models with Ties: An Application to Honour Based Abuse
by: Seymour, Rowland G, et al.
Published: (2024)

AutoMix: Automatically Mixing Language Models
by: Aggarwal, Pranjal, et al.
Published: (2023)

Inference in a generalized Bradley-Terry model for paired comparisons with covariates and a growing number of subjects
by: Yan, Ting
Published: (2025)

An analysis of factors impacting team strengths in the Australian Football League using time-variant Bradley-Terry models
by: Soffner, Carlos Rafael Gonzalez, et al.
Published: (2024)

Reflect-RL: Two-Player Online RL Fine-Tuning for LMs
by: Zhou, Runlong, et al.
Published: (2024)

Reinforcement Learning from User Feedback
by: Han, Eric, et al.
Published: (2025)

On the Equivalence of Graph Convolution and Mixup
by: Han, Xiaotian, et al.
Published: (2023)

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
by: Yu, Zishun, et al.
Published: (2025)

HoSNN: Adversarially-Robust Homeostatic Spiking Neural Networks with Adaptive Firing Thresholds
by: Geng, Hejia, et al.
Published: (2023)