Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Ferreira, Juliett Suárez, Slavkovik, Marija, Casillas, Jorge
Format: Preprint
Veröffentlicht: 2025
Schlagworte:
Online-Zugang:https://arxiv.org/abs/2504.05923
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
_version_ 1866910906428948480
author Ferreira, Juliett Suárez
Slavkovik, Marija
Casillas, Jorge
author_facet Ferreira, Juliett Suárez
Slavkovik, Marija
Casillas, Jorge
contents Fairness constitutes a concern within machine learning (ML) applications. Currently, there is no study on how disparities in classification complexity between privileged and unprivileged groups could influence the fairness of solutions, which serves as a preliminary indicator of potential unfairness. In this work, we investigate this gap, specifically, we focus on synthetic datasets designed to capture a variety of biases ranging from historical bias to measurement and representational bias to evaluate how various complexity metrics differences correlate with group fairness metrics. We then apply association rule mining to identify patterns that link disproportionate complexity differences between groups with fairness-related outcomes, offering data-centric indicators to guide bias mitigation. Our findings are also validated by their application in real-world problems, providing evidence that quantifying group-wise classification complexity can uncover early indicators of potential fairness challenges. This investigation helps practitioners to proactively address bias in classification tasks.
format Preprint
id arxiv_https___arxiv_org_abs_2504_05923
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Uncovering Fairness through Data Complexity as an Early Indicator
Ferreira, Juliett Suárez
Slavkovik, Marija
Casillas, Jorge
Machine Learning
Artificial Intelligence
Data Structures and Algorithms
Fairness constitutes a concern within machine learning (ML) applications. Currently, there is no study on how disparities in classification complexity between privileged and unprivileged groups could influence the fairness of solutions, which serves as a preliminary indicator of potential unfairness. In this work, we investigate this gap, specifically, we focus on synthetic datasets designed to capture a variety of biases ranging from historical bias to measurement and representational bias to evaluate how various complexity metrics differences correlate with group fairness metrics. We then apply association rule mining to identify patterns that link disproportionate complexity differences between groups with fairness-related outcomes, offering data-centric indicators to guide bias mitigation. Our findings are also validated by their application in real-world problems, providing evidence that quantifying group-wise classification complexity can uncover early indicators of potential fairness challenges. This investigation helps practitioners to proactively address bias in classification tasks.
title Uncovering Fairness through Data Complexity as an Early Indicator
topic Machine Learning
Artificial Intelligence
Data Structures and Algorithms
url https://arxiv.org/abs/2504.05923