Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ji, Mengfei, Chang, Yuchun, Zhang, Baolin, Al-Ars, Zaid
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.01845
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909133360332800
author	Ji, Mengfei Chang, Yuchun Zhang, Baolin Al-Ars, Zaid
author_facet	Ji, Mengfei Chang, Yuchun Zhang, Baolin Al-Ars, Zaid
contents	As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency but also superior accuracy performance. We present four versions of the NASH strategy in this paper, all of which show higher accuracy than the original models. The strategy can be applied to various convolutional neural networks, selecting specific model operations among many to guide the training process toward higher accuracy. Experimental results show that applying NASH on ResNet18 or ResNet34 achieves a top 1 accuracy increase of up to 3.1% and a top 5 accuracy increase of up to 2.2% compared to the non-NASH version when tested on the ImageNet data set. We also integrated this approach into the FINN hardware model synthesis tool to automate the application of our approach and the generation of the hardware model. Results show that using FINN can achieve a maximum throughput of 324.5 fps. In addition, NASH models can also result in a better trade-off between accuracy and hardware resource utilization. The accuracy-hardware (HW) Pareto curve shows that the models with the four NASH versions represent the best trade-offs achieving the highest accuracy for a given HW utilization. The code for our implementation is open-source and publicly available on GitHub at https://github.com/MFJI/NASH.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_01845
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models Ji, Mengfei Chang, Yuchun Zhang, Baolin Al-Ars, Zaid Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition As machine learning (ML) algorithms get deployed in an ever-increasing number of applications, these algorithms need to achieve better trade-offs between high accuracy, high throughput and low latency. This paper introduces NASH, a novel approach that applies neural architecture search to machine learning hardware. Using NASH, hardware designs can achieve not only high throughput and low latency but also superior accuracy performance. We present four versions of the NASH strategy in this paper, all of which show higher accuracy than the original models. The strategy can be applied to various convolutional neural networks, selecting specific model operations among many to guide the training process toward higher accuracy. Experimental results show that applying NASH on ResNet18 or ResNet34 achieves a top 1 accuracy increase of up to 3.1% and a top 5 accuracy increase of up to 2.2% compared to the non-NASH version when tested on the ImageNet data set. We also integrated this approach into the FINN hardware model synthesis tool to automate the application of our approach and the generation of the hardware model. Results show that using FINN can achieve a maximum throughput of 324.5 fps. In addition, NASH models can also result in a better trade-off between accuracy and hardware resource utilization. The accuracy-hardware (HW) Pareto curve shows that the models with the four NASH versions represent the best trade-offs achieving the highest accuracy for a given HW utilization. The code for our implementation is open-source and publicly available on GitHub at https://github.com/MFJI/NASH.
title	NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
topic	Machine Learning Artificial Intelligence Computer Vision and Pattern Recognition
url	https://arxiv.org/abs/2403.01845

Similar Items