Vista Equipo: :: Library Catalog

Guardado en:

Detalles Bibliográficos
Autores principales:	Xue, Xiang, Ji, Yatu, Ren, Qing-dao-er-ji, Shi, Bao, Lu, Min, Wu, Nier, Zhuang, Xufei, Xu, Haiteng, Cha, Gan-qi-qi-ge
Formato:	Preprint
Publicado:	2025
Materias:	Machine Learning Computer Vision and Pattern Recognition
Acceso en línea:	https://arxiv.org/abs/2509.12553
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

_version_	1866908540892872704
author	Xue, Xiang Ji, Yatu Ren, Qing-dao-er-ji Shi, Bao Lu, Min Wu, Nier Zhuang, Xufei Xu, Haiteng Cha, Gan-qi-qi-ge
author_facet	Xue, Xiang Ji, Yatu Ren, Qing-dao-er-ji Shi, Bao Lu, Min Wu, Nier Zhuang, Xufei Xu, Haiteng Cha, Gan-qi-qi-ge
contents	Logit Knowledge Distillation has gained substantial research interest in recent years due to its simplicity and lack of requirement for intermediate feature alignment; however, it suffers from limited interpretability in its decision-making process. To address this, we propose implicit Clustering Distillation (iCD): a simple and effective method that mines and transfers interpretable structural knowledge from logits, without requiring ground-truth labels or feature-space alignment. iCD leverages Gram matrices over decoupled local logit representations to enable student models to learn latent semantic structural patterns. Extensive experiments on benchmark datasets demonstrate the effectiveness of iCD across diverse teacher-student architectures, with particularly strong performance in fine-grained classification tasks -- achieving a peak improvement of +5.08% over the baseline. The code is available at: https://github.com/maomaochongaa/iCD.
format	Preprint
id	arxiv_https___arxiv_org_abs_2509_12553
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	iCD: A Implicit Clustering Distillation Mathod for Structural Information Mining Xue, Xiang Ji, Yatu Ren, Qing-dao-er-ji Shi, Bao Lu, Min Wu, Nier Zhuang, Xufei Xu, Haiteng Cha, Gan-qi-qi-ge Machine Learning Computer Vision and Pattern Recognition Logit Knowledge Distillation has gained substantial research interest in recent years due to its simplicity and lack of requirement for intermediate feature alignment; however, it suffers from limited interpretability in its decision-making process. To address this, we propose implicit Clustering Distillation (iCD): a simple and effective method that mines and transfers interpretable structural knowledge from logits, without requiring ground-truth labels or feature-space alignment. iCD leverages Gram matrices over decoupled local logit representations to enable student models to learn latent semantic structural patterns. Extensive experiments on benchmark datasets demonstrate the effectiveness of iCD across diverse teacher-student architectures, with particularly strong performance in fine-grained classification tasks -- achieving a peak improvement of +5.08% over the baseline. The code is available at: https://github.com/maomaochongaa/iCD.
title	iCD: A Implicit Clustering Distillation Mathod for Structural Information Mining
topic	Machine Learning Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2509.12553

Ejemplares similares