Saved in:
Bibliographic Details
Main Authors: Rajabinasab, Muhammad, Nejad, Afsaneh M., Zimek, Arthur
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.23563
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914591624134656
author Rajabinasab, Muhammad
Nejad, Afsaneh M.
Zimek, Arthur
author_facet Rajabinasab, Muhammad
Nejad, Afsaneh M.
Zimek, Arthur
contents Comprehensive evaluation of machine learning models is the key to make sure that they perform as robustly and consistently as desired. In order to summarize the experimental results and pick a winner, Critical Difference (CD) diagrams are used. Standard CD diagrams rely on discrete ranks, discarding the magnitude of performance gaps between models, raising an issue which we call magnitude-blindness. In order to address this issue, we propose Magnitude-Aware Rank Statistics (MARS) that incorporates a relative margin coefficient as a weight for the discrete ranks. This coefficient scales ranks based on the distance between the best and worst performers, with a dynamic projection to handle boundary cases. Followed by the calculation of a CD value, MARS results in a more realistic statistical representation of differences of model performances and more insights on how methods actually perform in vast and extensive experimental settings.
format Preprint
id arxiv_https___arxiv_org_abs_2605_23563
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle MARS: Magnitude-Aware Rank Statistics
Rajabinasab, Muhammad
Nejad, Afsaneh M.
Zimek, Arthur
Machine Learning
Comprehensive evaluation of machine learning models is the key to make sure that they perform as robustly and consistently as desired. In order to summarize the experimental results and pick a winner, Critical Difference (CD) diagrams are used. Standard CD diagrams rely on discrete ranks, discarding the magnitude of performance gaps between models, raising an issue which we call magnitude-blindness. In order to address this issue, we propose Magnitude-Aware Rank Statistics (MARS) that incorporates a relative margin coefficient as a weight for the discrete ranks. This coefficient scales ranks based on the distance between the best and worst performers, with a dynamic projection to handle boundary cases. Followed by the calculation of a CD value, MARS results in a more realistic statistical representation of differences of model performances and more insights on how methods actually perform in vast and extensive experimental settings.
title MARS: Magnitude-Aware Rank Statistics
topic Machine Learning
url https://arxiv.org/abs/2605.23563