Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Tang, Zihao, Lv, Zheqi, Zhang, Shengyu, Zhou, Yifan, Duan, Xinyu, Wu, Fei, Kuang, Kun
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.07030
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909139076120576
author	Tang, Zihao Lv, Zheqi Zhang, Shengyu Zhou, Yifan Duan, Xinyu Wu, Fei Kuang, Kun
author_facet	Tang, Zihao Lv, Zheqi Zhang, Shengyu Zhou, Yifan Duan, Xinyu Wu, Fei Kuang, Kun
contents	Due to privacy or patent concerns, a growing number of large models are released without granting access to their training data, making transferring their knowledge inefficient and problematic. In response, Data-Free Knowledge Distillation (DFKD) methods have emerged as direct solutions. However, simply adopting models derived from DFKD for real-world applications suffers significant performance degradation, due to the discrepancy between teachers' training data and real-world scenarios (student domain). The degradation stems from the portions of teachers' knowledge that are not applicable to the student domain. They are specific to the teacher domain and would undermine students' performance. Hence, selectively transferring teachers' appropriate knowledge becomes the primary challenge in DFKD. In this work, we propose a simple but effective method AuG-KD. It utilizes an uncertainty-guided and sample-specific anchor to align student-domain data with the teacher domain and leverages a generative method to progressively trade off the learning process between OOD knowledge distillation and domain-specific information learning via mixup learning. Extensive experiments in 3 datasets and 8 settings demonstrate the stability and superiority of our approach. Code available at https://github.com/IshiKura-a/AuG-KD .
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_07030
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation Tang, Zihao Lv, Zheqi Zhang, Shengyu Zhou, Yifan Duan, Xinyu Wu, Fei Kuang, Kun Machine Learning Computer Vision and Pattern Recognition Due to privacy or patent concerns, a growing number of large models are released without granting access to their training data, making transferring their knowledge inefficient and problematic. In response, Data-Free Knowledge Distillation (DFKD) methods have emerged as direct solutions. However, simply adopting models derived from DFKD for real-world applications suffers significant performance degradation, due to the discrepancy between teachers' training data and real-world scenarios (student domain). The degradation stems from the portions of teachers' knowledge that are not applicable to the student domain. They are specific to the teacher domain and would undermine students' performance. Hence, selectively transferring teachers' appropriate knowledge becomes the primary challenge in DFKD. In this work, we propose a simple but effective method AuG-KD. It utilizes an uncertainty-guided and sample-specific anchor to align student-domain data with the teacher domain and leverages a generative method to progressively trade off the learning process between OOD knowledge distillation and domain-specific information learning via mixup learning. Extensive experiments in 3 datasets and 8 settings demonstrate the stability and superiority of our approach. Code available at https://github.com/IshiKura-a/AuG-KD .
title	AuG-KD: Anchor-Based Mixup Generation for Out-of-Domain Knowledge Distillation
topic	Machine Learning Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.07030

Similar Items