Saved in:
Bibliographic Details
Main Authors: Huang, Xinlei, Tang, Jialiang, Zheng, Xubin, Zhou, Jinjia, Yu, Wenxin, Jiang, Ning
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2409.07694
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909320300462080
author Huang, Xinlei
Tang, Jialiang
Zheng, Xubin
Zhou, Jinjia
Yu, Wenxin
Jiang, Ning
author_facet Huang, Xinlei
Tang, Jialiang
Zheng, Xubin
Zhou, Jinjia
Yu, Wenxin
Jiang, Ning
contents Knowledge Distillation (KD) transfers knowledge from a large pre-trained teacher network to a compact and efficient student network, making it suitable for deployment on resource-limited media terminals. However, traditional KD methods require balanced data to ensure robust training, which is often unavailable in practical applications. In such scenarios, a few head categories occupy a substantial proportion of examples. This imbalance biases the trained teacher network towards the head categories, resulting in severe performance degradation on the less represented tail categories for both the teacher and student networks. In this paper, we propose a novel framework called Knowledge Rectification Distillation (KRDistill) to address the imbalanced knowledge inherited in the teacher network through the incorporation of the balanced category priors. Furthermore, we rectify the biased predictions produced by the teacher network, particularly focusing on the tail categories. Consequently, the teacher network can provide balanced and accurate knowledge to train a reliable student network. Intensive experiments conducted on various long-tailed datasets demonstrate that our KRDistill can effectively train reliable student networks in realistic scenarios of data imbalance.
format Preprint
id arxiv_https___arxiv_org_abs_2409_07694
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios
Huang, Xinlei
Tang, Jialiang
Zheng, Xubin
Zhou, Jinjia
Yu, Wenxin
Jiang, Ning
Computer Vision and Pattern Recognition
Knowledge Distillation (KD) transfers knowledge from a large pre-trained teacher network to a compact and efficient student network, making it suitable for deployment on resource-limited media terminals. However, traditional KD methods require balanced data to ensure robust training, which is often unavailable in practical applications. In such scenarios, a few head categories occupy a substantial proportion of examples. This imbalance biases the trained teacher network towards the head categories, resulting in severe performance degradation on the less represented tail categories for both the teacher and student networks. In this paper, we propose a novel framework called Knowledge Rectification Distillation (KRDistill) to address the imbalanced knowledge inherited in the teacher network through the incorporation of the balanced category priors. Furthermore, we rectify the biased predictions produced by the teacher network, particularly focusing on the tail categories. Consequently, the teacher network can provide balanced and accurate knowledge to train a reliable student network. Intensive experiments conducted on various long-tailed datasets demonstrate that our KRDistill can effectively train reliable student networks in realistic scenarios of data imbalance.
title Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios
topic Computer Vision and Pattern Recognition
url https://arxiv.org/abs/2409.07694