Saved in:
Bibliographic Details
Main Authors: Flowers, Anna R., Franck, Christopher T., Gramacy, Robert B., Krometis, Justin A.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2512.20021
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909974503882752
author Flowers, Anna R.
Franck, Christopher T.
Gramacy, Robert B.
Krometis, Justin A.
author_facet Flowers, Anna R.
Franck, Christopher T.
Gramacy, Robert B.
Krometis, Justin A.
contents Collecting operationally realistic data to inform machine learning models can be costly. Before collecting new data, it is helpful to understand where a model is deficient. For example, object detectors trained on images of rare objects may not be good at identification in poorly represented conditions. We offer a way of informing subsequent data acquisition to maximize model performance by leveraging the toolkit of computer experiments and metadata describing the circumstances under which the training data was collected (e.g., season, time of day, location). We do this by evaluating the learner as the training data is varied according to its metadata. A Gaussian process (GP) surrogate fit to that response surface can inform new data acquisitions. This meta-learning approach offers improvements to learner performance as compared to data with randomly selected metadata, which we illustrate on both classic learning examples, and on a motivating application involving the collection of aerial images in search of airplanes.
format Preprint
id arxiv_https___arxiv_org_abs_2512_20021
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Gaussian Process Assisted Meta-learning for Image Classification and Object Detection Models
Flowers, Anna R.
Franck, Christopher T.
Gramacy, Robert B.
Krometis, Justin A.
Machine Learning
Collecting operationally realistic data to inform machine learning models can be costly. Before collecting new data, it is helpful to understand where a model is deficient. For example, object detectors trained on images of rare objects may not be good at identification in poorly represented conditions. We offer a way of informing subsequent data acquisition to maximize model performance by leveraging the toolkit of computer experiments and metadata describing the circumstances under which the training data was collected (e.g., season, time of day, location). We do this by evaluating the learner as the training data is varied according to its metadata. A Gaussian process (GP) surrogate fit to that response surface can inform new data acquisitions. This meta-learning approach offers improvements to learner performance as compared to data with randomly selected metadata, which we illustrate on both classic learning examples, and on a motivating application involving the collection of aerial images in search of airplanes.
title Gaussian Process Assisted Meta-learning for Image Classification and Object Detection Models
topic Machine Learning
url https://arxiv.org/abs/2512.20021