Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ren, Dongdong, Li, Wenbin, Ding, Tianyu, Wang, Lei, Fan, Qi, Huo, Jing, Pan, Hongbing, Gao, Yang
Format:	Preprint
Published:	2024
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2404.08016
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911837262446592
author	Ren, Dongdong Li, Wenbin Ding, Tianyu Wang, Lei Fan, Qi Huo, Jing Pan, Hongbing Gao, Yang
author_facet	Ren, Dongdong Li, Wenbin Ding, Tianyu Wang, Lei Fan, Qi Huo, Jing Pan, Hongbing Gao, Yang
contents	Recent advancements in model pruning have focused on developing new algorithms and improving upon benchmarks. However, the practical application of these algorithms across various models and platforms remains a significant challenge. To address this challenge, we propose ONNXPruner, a versatile pruning adapter designed for the ONNX format models. ONNXPruner streamlines the adaptation process across diverse deep learning frameworks and hardware platforms. A novel aspect of ONNXPruner is its use of node association trees, which automatically adapt to various model architectures. These trees clarify the structural relationships between nodes, guiding the pruning process, particularly highlighting the impact on interconnected nodes. Furthermore, we introduce a tree-level evaluation method. By leveraging node association trees, this method allows for a comprehensive analysis beyond traditional single-node evaluations, enhancing pruning performance without the need for extra operations. Experiments across multiple models and datasets confirm ONNXPruner's strong adaptability and increased efficacy. Our work aims to advance the practical application of model pruning.
format	Preprint
id	arxiv_https___arxiv_org_abs_2404_08016
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	ONNXPruner: ONNX-Based General Model Pruning Adapter Ren, Dongdong Li, Wenbin Ding, Tianyu Wang, Lei Fan, Qi Huo, Jing Pan, Hongbing Gao, Yang Machine Learning Recent advancements in model pruning have focused on developing new algorithms and improving upon benchmarks. However, the practical application of these algorithms across various models and platforms remains a significant challenge. To address this challenge, we propose ONNXPruner, a versatile pruning adapter designed for the ONNX format models. ONNXPruner streamlines the adaptation process across diverse deep learning frameworks and hardware platforms. A novel aspect of ONNXPruner is its use of node association trees, which automatically adapt to various model architectures. These trees clarify the structural relationships between nodes, guiding the pruning process, particularly highlighting the impact on interconnected nodes. Furthermore, we introduce a tree-level evaluation method. By leveraging node association trees, this method allows for a comprehensive analysis beyond traditional single-node evaluations, enhancing pruning performance without the need for extra operations. Experiments across multiple models and datasets confirm ONNXPruner's strong adaptability and increased efficacy. Our work aims to advance the practical application of model pruning.
title	ONNXPruner: ONNX-Based General Model Pruning Adapter
topic	Machine Learning
url	https://arxiv.org/abs/2404.08016

Similar Items