Saved in:
Bibliographic Details
Main Authors: Liang, Zhixuan, Zeng, Xingyu, Zhao, Rui, Luo, Ping
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2310.08387
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916720687448064
author Liang, Zhixuan
Zeng, Xingyu
Zhao, Rui
Luo, Ping
author_facet Liang, Zhixuan
Zeng, Xingyu
Zhao, Rui
Luo, Ping
contents Active learning strategies aim to train high-performance models with minimal labeled data by selecting the most informative instances for labeling. However, existing methods for assessing data informativeness often fail to align directly with task model performance metrics, such as mean average precision (mAP) in object detection. This paper introduces Mean-AP Guided Reinforced Active Learning for Object Detection (MGRAL), a novel approach that leverages the concept of expected model output changes as informativeness for deep detection networks, directly optimizing the sampling strategy using mAP. MGRAL employs a reinforcement learning agent based on LSTM architecture to efficiently navigate the combinatorial challenge of batch sample selection and the non-differentiable nature between performance and selected batches. The agent optimizes selection using policy gradient with mAP improvement as the reward signal. To address the computational intensity of mAP estimation with unlabeled samples, we implement fast look-up tables, ensuring real-world feasibility. We evaluate MGRAL on PASCAL VOC and MS COCO benchmarks across various backbone architectures. Our approach demonstrates strong performance, establishing a new paradigm in reinforcement learning-based active learning for object detection.
format Preprint
id arxiv_https___arxiv_org_abs_2310_08387
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle Aligning Data Selection with Performance: Performance-driven Reinforcement Learning for Active Learning in Object Detection
Liang, Zhixuan
Zeng, Xingyu
Zhao, Rui
Luo, Ping
Computer Vision and Pattern Recognition
Machine Learning
Active learning strategies aim to train high-performance models with minimal labeled data by selecting the most informative instances for labeling. However, existing methods for assessing data informativeness often fail to align directly with task model performance metrics, such as mean average precision (mAP) in object detection. This paper introduces Mean-AP Guided Reinforced Active Learning for Object Detection (MGRAL), a novel approach that leverages the concept of expected model output changes as informativeness for deep detection networks, directly optimizing the sampling strategy using mAP. MGRAL employs a reinforcement learning agent based on LSTM architecture to efficiently navigate the combinatorial challenge of batch sample selection and the non-differentiable nature between performance and selected batches. The agent optimizes selection using policy gradient with mAP improvement as the reward signal. To address the computational intensity of mAP estimation with unlabeled samples, we implement fast look-up tables, ensuring real-world feasibility. We evaluate MGRAL on PASCAL VOC and MS COCO benchmarks across various backbone architectures. Our approach demonstrates strong performance, establishing a new paradigm in reinforcement learning-based active learning for object detection.
title Aligning Data Selection with Performance: Performance-driven Reinforcement Learning for Active Learning in Object Detection
topic Computer Vision and Pattern Recognition
Machine Learning
url https://arxiv.org/abs/2310.08387