Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Huang, Xinlei, Tang, Jialiang, Zheng, Xubin, Zhou, Jinjia, Yu, Wenxin, Jiang, Ning
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2409.07694
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909320300462080
author	Huang, Xinlei Tang, Jialiang Zheng, Xubin Zhou, Jinjia Yu, Wenxin Jiang, Ning
author_facet	Huang, Xinlei Tang, Jialiang Zheng, Xubin Zhou, Jinjia Yu, Wenxin Jiang, Ning
contents	Knowledge Distillation (KD) transfers knowledge from a large pre-trained teacher network to a compact and efficient student network, making it suitable for deployment on resource-limited media terminals. However, traditional KD methods require balanced data to ensure robust training, which is often unavailable in practical applications. In such scenarios, a few head categories occupy a substantial proportion of examples. This imbalance biases the trained teacher network towards the head categories, resulting in severe performance degradation on the less represented tail categories for both the teacher and student networks. In this paper, we propose a novel framework called Knowledge Rectification Distillation (KRDistill) to address the imbalanced knowledge inherited in the teacher network through the incorporation of the balanced category priors. Furthermore, we rectify the biased predictions produced by the teacher network, particularly focusing on the tail categories. Consequently, the teacher network can provide balanced and accurate knowledge to train a reliable student network. Intensive experiments conducted on various long-tailed datasets demonstrate that our KRDistill can effectively train reliable student networks in realistic scenarios of data imbalance.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_07694
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios Huang, Xinlei Tang, Jialiang Zheng, Xubin Zhou, Jinjia Yu, Wenxin Jiang, Ning Computer Vision and Pattern Recognition Knowledge Distillation (KD) transfers knowledge from a large pre-trained teacher network to a compact and efficient student network, making it suitable for deployment on resource-limited media terminals. However, traditional KD methods require balanced data to ensure robust training, which is often unavailable in practical applications. In such scenarios, a few head categories occupy a substantial proportion of examples. This imbalance biases the trained teacher network towards the head categories, resulting in severe performance degradation on the less represented tail categories for both the teacher and student networks. In this paper, we propose a novel framework called Knowledge Rectification Distillation (KRDistill) to address the imbalanced knowledge inherited in the teacher network through the incorporation of the balanced category priors. Furthermore, we rectify the biased predictions produced by the teacher network, particularly focusing on the tail categories. Consequently, the teacher network can provide balanced and accurate knowledge to train a reliable student network. Intensive experiments conducted on various long-tailed datasets demonstrate that our KRDistill can effectively train reliable student networks in realistic scenarios of data imbalance.
title	Learn from Balance: Rectifying Knowledge Transfer for Long-Tailed Scenarios
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2409.07694

Similar Items