Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Dahri, Tarique, Memon, Zulfiqar Ali, Yu, Zhenyu, Idris, Mohd. Yamani Idna, Khan, Sheheryar, Ahmad, Sadiq, Shoman, Maged, Aziz, Saddam, Qureshi, Rizwan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2506.07055
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916784504832000
author	Dahri, Tarique Memon, Zulfiqar Ali Yu, Zhenyu Idris, Mohd. Yamani Idna Khan, Sheheryar Ahmad, Sadiq Shoman, Maged Aziz, Saddam Qureshi, Rizwan
author_facet	Dahri, Tarique Memon, Zulfiqar Ali Yu, Zhenyu Idris, Mohd. Yamani Idna Khan, Sheheryar Ahmad, Sadiq Shoman, Maged Aziz, Saddam Qureshi, Rizwan
contents	We introduce Layered Self-Supervised Knowledge Distillation (LSSKD) framework for training compact deep learning models. Unlike traditional methods that rely on pre-trained teacher networks, our approach appends auxiliary classifiers to intermediate feature maps, generating diverse self-supervised knowledge and enabling one-to-one transfer across different network stages. Our method achieves an average improvement of 4.54\% over the state-of-the-art PS-KD method and a 1.14% gain over SSKD on CIFAR-100, with a 0.32% improvement on ImageNet compared to HASSKD. Experiments on Tiny ImageNet and CIFAR-100 under few-shot learning scenarios also achieve state-of-the-art results. These findings demonstrate the effectiveness of our approach in enhancing model generalization and performance without the need for large over-parameterized teacher networks. Importantly, at the inference stage, all auxiliary classifiers can be removed, yielding no extra computational cost. This makes our model suitable for deploying small language models on affordable low-computing devices. Owing to its lightweight design and adaptability, our framework is particularly suitable for multimodal sensing and cyber-physical environments that require efficient and responsive inference. LSSKD facilitates the development of intelligent agents capable of learning from limited sensory data under weak supervision.
format	Preprint
id	arxiv_https___arxiv_org_abs_2506_07055
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge Dahri, Tarique Memon, Zulfiqar Ali Yu, Zhenyu Idris, Mohd. Yamani Idna Khan, Sheheryar Ahmad, Sadiq Shoman, Maged Aziz, Saddam Qureshi, Rizwan Computer Vision and Pattern Recognition We introduce Layered Self-Supervised Knowledge Distillation (LSSKD) framework for training compact deep learning models. Unlike traditional methods that rely on pre-trained teacher networks, our approach appends auxiliary classifiers to intermediate feature maps, generating diverse self-supervised knowledge and enabling one-to-one transfer across different network stages. Our method achieves an average improvement of 4.54\% over the state-of-the-art PS-KD method and a 1.14% gain over SSKD on CIFAR-100, with a 0.32% improvement on ImageNet compared to HASSKD. Experiments on Tiny ImageNet and CIFAR-100 under few-shot learning scenarios also achieve state-of-the-art results. These findings demonstrate the effectiveness of our approach in enhancing model generalization and performance without the need for large over-parameterized teacher networks. Importantly, at the inference stage, all auxiliary classifiers can be removed, yielding no extra computational cost. This makes our model suitable for deploying small language models on affordable low-computing devices. Owing to its lightweight design and adaptability, our framework is particularly suitable for multimodal sensing and cyber-physical environments that require efficient and responsive inference. LSSKD facilitates the development of intelligent agents capable of learning from limited sensory data under weak supervision.
title	A Layered Self-Supervised Knowledge Distillation Framework for Efficient Multimodal Learning on the Edge
topic	Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2506.07055

Similar Items