Saved in:
Bibliographic Details
Main Authors: Harman, Jason L., Scheuerman, Jaelle
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2403.11840
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper describes a generalizable model evaluation method that can be adapted to evaluate AI/ML models across multiple criteria including core scientific principles and more practical outcomes. Emerging from prediction competitions in Psychology and Decision Science, the method evaluates a group of candidate models of varying type and structure across multiple scientific, theoretic, and practical criteria. Ordinal ranking of criteria scores are evaluated using voting rules from the field of computational social choice and allow the comparison of divergent measures and types of models in a holistic evaluation. Additional advantages and applications are discussed.