Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kath, Hannes, Gouvêa, Thiago S., Sonntag, Daniel
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2602.13359
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866918337738440704
author	Kath, Hannes Gouvêa, Thiago S. Sonntag, Daniel
author_facet	Kath, Hannes Gouvêa, Thiago S. Sonntag, Daniel
contents	Machine learning models excel with abundant annotated data, but annotation is often costly and time-intensive. Active learning (AL) aims to improve the performance-to-annotation ratio by using query methods (QMs) to iteratively select the most informative samples. While AL research focuses mainly on QM development, the evaluation of this iterative process lacks appropriate performance metrics. This work reviews eight years of AL evaluation literature and formally introduces the speed-up factor, a quantitative multi-iteration QM performance metric that indicates the fraction of samples needed to match random sampling performance. Using four datasets from diverse domains and seven QMs of various types, we empirically evaluate the speed-up factor and compare it with state-of-the-art AL performance metrics. The results confirm the assumptions underlying the speed-up factor, demonstrate its accuracy in capturing the described fraction, and reveal its superior stability across iterations.
format	Preprint
id	arxiv_https___arxiv_org_abs_2602_13359
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric Kath, Hannes Gouvêa, Thiago S. Sonntag, Daniel Machine Learning Machine learning models excel with abundant annotated data, but annotation is often costly and time-intensive. Active learning (AL) aims to improve the performance-to-annotation ratio by using query methods (QMs) to iteratively select the most informative samples. While AL research focuses mainly on QM development, the evaluation of this iterative process lacks appropriate performance metrics. This work reviews eight years of AL evaluation literature and formally introduces the speed-up factor, a quantitative multi-iteration QM performance metric that indicates the fraction of samples needed to match random sampling performance. Using four datasets from diverse domains and seven QMs of various types, we empirically evaluate the speed-up factor and compare it with state-of-the-art AL performance metrics. The results confirm the assumptions underlying the speed-up factor, demonstrate its accuracy in capturing the described fraction, and reveal its superior stability across iterations.
title	The Speed-up Factor: A Quantitative Multi-Iteration Active Learning Performance Metric
topic	Machine Learning
url	https://arxiv.org/abs/2602.13359

Similar Items