Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Peng, Ru, Zou, Heming, Wang, Haobo, Zeng, Yawen, Huang, Zenan, Zhao, Junbo
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2401.12689
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916160015958016
author	Peng, Ru Zou, Heming Wang, Haobo Zeng, Yawen Huang, Zenan Zhao, Junbo
author_facet	Peng, Ru Zou, Heming Wang, Haobo Zeng, Yawen Huang, Zenan Zhao, Junbo
contents	The conventional evaluation protocols on machine learning models rely heavily on a labeled, i.i.d-assumed testing dataset, which is not often present in real world applications. The Automated Model Evaluation (AutoEval) shows an alternative to this traditional workflow, by forming a proximal prediction pipeline of the testing performance without the presence of ground-truth labels. Despite its recent successes, the AutoEval frameworks still suffer from an overconfidence issue, substantial storage and computational cost. In that regard, we propose a novel measure -- Meta-Distribution Energy (MDE) -- that allows the AutoEval framework to be both more efficient and effective. The core of the MDE is to establish a meta-distribution statistic, on the information (energy) associated with individual samples, then offer a smoother representation enabled by energy-based learning. We further provide our theoretical insights by connecting the MDE with the classification loss. We provide extensive experiments across modalities, datasets and different architectural backbones to validate MDE's validity, together with its superiority compared with prior approaches. We also prove MDE's versatility by showing its seamless integration with large-scale models, and easy adaption to learning scenarios with noisy- or imbalanced- labels. Code and data are available: https://github.com/pengr/Energy_AutoEval
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_12689
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Energy-based Automated Model Evaluation Peng, Ru Zou, Heming Wang, Haobo Zeng, Yawen Huang, Zenan Zhao, Junbo Machine Learning Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition The conventional evaluation protocols on machine learning models rely heavily on a labeled, i.i.d-assumed testing dataset, which is not often present in real world applications. The Automated Model Evaluation (AutoEval) shows an alternative to this traditional workflow, by forming a proximal prediction pipeline of the testing performance without the presence of ground-truth labels. Despite its recent successes, the AutoEval frameworks still suffer from an overconfidence issue, substantial storage and computational cost. In that regard, we propose a novel measure -- Meta-Distribution Energy (MDE) -- that allows the AutoEval framework to be both more efficient and effective. The core of the MDE is to establish a meta-distribution statistic, on the information (energy) associated with individual samples, then offer a smoother representation enabled by energy-based learning. We further provide our theoretical insights by connecting the MDE with the classification loss. We provide extensive experiments across modalities, datasets and different architectural backbones to validate MDE's validity, together with its superiority compared with prior approaches. We also prove MDE's versatility by showing its seamless integration with large-scale models, and easy adaption to learning scenarios with noisy- or imbalanced- labels. Code and data are available: https://github.com/pengr/Energy_AutoEval
title	Energy-based Automated Model Evaluation
topic	Machine Learning Artificial Intelligence Computation and Language Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2401.12689

Similar Items