Saved in:
Bibliographic Details
Main Authors: Rajabinasab, Muhammad, Nejad, Afsaneh M., Zimek, Arthur
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.23563
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Comprehensive evaluation of machine learning models is the key to make sure that they perform as robustly and consistently as desired. In order to summarize the experimental results and pick a winner, Critical Difference (CD) diagrams are used. Standard CD diagrams rely on discrete ranks, discarding the magnitude of performance gaps between models, raising an issue which we call magnitude-blindness. In order to address this issue, we propose Magnitude-Aware Rank Statistics (MARS) that incorporates a relative margin coefficient as a weight for the discrete ranks. This coefficient scales ranks based on the distance between the best and worst performers, with a dynamic projection to handle boundary cases. Followed by the calculation of a CD value, MARS results in a more realistic statistical representation of differences of model performances and more insights on how methods actually perform in vast and extensive experimental settings.