Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Zhi, Xiaoying, Babbar, Varun, Liu, Rundong, Sun, Pheobe, Silavong, Fran, Shi, Ruibo, Moran, Sean
Format:	Preprint
Published:	2023
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2302.10798
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913643172462592
author	Zhi, Xiaoying Babbar, Varun Liu, Rundong Sun, Pheobe Silavong, Fran Shi, Ruibo Moran, Sean
author_facet	Zhi, Xiaoying Babbar, Varun Liu, Rundong Sun, Pheobe Silavong, Fran Shi, Ruibo Moran, Sean
contents	The subject of green AI has been gaining attention within the deep learning community given the recent trend of ever larger and more complex neural network models. Existing solutions for reducing the computational load of training at inference time usually involve pruning the network parameters. Pruning schemes often create extra overhead either by iterative training and fine-tuning for static pruning or repeated computation of a dynamic pruning graph. We propose a new parameter pruning strategy for learning a lighter-weight sub-network that minimizes the energy cost while maintaining comparable performance to the fully parameterised network on given downstream tasks. Our proposed pruning scheme is green-oriented, as it only requires a one-off training to discover the optimal static sub-networks by dynamic pruning methods. The pruning scheme consists of a binary gating module and a polarizing loss function to uncover sub-networks with user-defined sparsity. Our method enables pruning and training simultaneously, which saves energy in both the training and inference phases and avoids extra computational overhead from gating modules at inference time. Our results on CIFAR-10, CIFAR-100, and Tiny Imagenet suggest that our scheme can remove 50% of connections in deep networks with <1% reduction in classification accuracy. Compared to other related pruning methods, our method demonstrates a lower drop in accuracy for equivalent reductions in computational cost.
format	Preprint
id	arxiv_https___arxiv_org_abs_2302_10798
institution	arXiv
publishDate	2023
record_format	arxiv
spellingShingle	Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training Zhi, Xiaoying Babbar, Varun Liu, Rundong Sun, Pheobe Silavong, Fran Shi, Ruibo Moran, Sean Machine Learning Computer Vision and Pattern Recognition The subject of green AI has been gaining attention within the deep learning community given the recent trend of ever larger and more complex neural network models. Existing solutions for reducing the computational load of training at inference time usually involve pruning the network parameters. Pruning schemes often create extra overhead either by iterative training and fine-tuning for static pruning or repeated computation of a dynamic pruning graph. We propose a new parameter pruning strategy for learning a lighter-weight sub-network that minimizes the energy cost while maintaining comparable performance to the fully parameterised network on given downstream tasks. Our proposed pruning scheme is green-oriented, as it only requires a one-off training to discover the optimal static sub-networks by dynamic pruning methods. The pruning scheme consists of a binary gating module and a polarizing loss function to uncover sub-networks with user-defined sparsity. Our method enables pruning and training simultaneously, which saves energy in both the training and inference phases and avoids extra computational overhead from gating modules at inference time. Our results on CIFAR-10, CIFAR-100, and Tiny Imagenet suggest that our scheme can remove 50% of connections in deep networks with <1% reduction in classification accuracy. Compared to other related pruning methods, our method demonstrates a lower drop in accuracy for equivalent reductions in computational cost.
title	Learning a Consensus Sub-Network with Polarization Regularization and One Pass Training
topic	Machine Learning Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2302.10798

Similar Items