Saved in:
Bibliographic Details
Main Authors: Xie, Tong, Hu, Yixuan, Wei, Renjie, Li, Meng, Wang, Yuan, Wang, Runsheng, Huang, Ru
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2402.12820
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914686141726720
author Xie, Tong
Hu, Yixuan
Wei, Renjie
Li, Meng
Wang, Yuan
Wang, Runsheng
Huang, Ru
author_facet Xie, Tong
Hu, Yixuan
Wei, Renjie
Li, Meng
Wang, Yuan
Wang, Runsheng
Huang, Ru
contents Stochastic computing (SC) has emerged as a promising computing paradigm for neural acceleration. However, how to accelerate the state-of-the-art Vision Transformer (ViT) with SC remains unclear. Unlike convolutional neural networks, ViTs introduce notable compatibility and efficiency challenges because of their nonlinear functions, e.g., softmax and Gaussian Error Linear Units (GELU). In this paper, for the first time, a ViT accelerator based on end-to-end SC, dubbed ASCEND, is proposed. ASCEND co-designs the SC circuits and ViT networks to enable accurate yet efficient acceleration. To overcome the compatibility challenges, ASCEND proposes a novel deterministic SC block for GELU and leverages an SC-friendly iterative approximate algorithm to design an accurate and efficient softmax circuit. To improve inference efficiency, ASCEND develops a two-stage training pipeline to produce accurate low-precision ViTs. With extensive experiments, we show the proposed GELU and softmax blocks achieve 56.3% and 22.6% error reduction compared to existing SC designs, respectively and reduce the area-delay product (ADP) by 5.29x and 12.6x, respectively. Moreover, compared to the baseline low-precision ViTs, ASCEND also achieves significant accuracy improvements on CIFAR10 and CIFAR100.
format Preprint
id arxiv_https___arxiv_org_abs_2402_12820
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer
Xie, Tong
Hu, Yixuan
Wei, Renjie
Li, Meng
Wang, Yuan
Wang, Runsheng
Huang, Ru
Systems and Control
Stochastic computing (SC) has emerged as a promising computing paradigm for neural acceleration. However, how to accelerate the state-of-the-art Vision Transformer (ViT) with SC remains unclear. Unlike convolutional neural networks, ViTs introduce notable compatibility and efficiency challenges because of their nonlinear functions, e.g., softmax and Gaussian Error Linear Units (GELU). In this paper, for the first time, a ViT accelerator based on end-to-end SC, dubbed ASCEND, is proposed. ASCEND co-designs the SC circuits and ViT networks to enable accurate yet efficient acceleration. To overcome the compatibility challenges, ASCEND proposes a novel deterministic SC block for GELU and leverages an SC-friendly iterative approximate algorithm to design an accurate and efficient softmax circuit. To improve inference efficiency, ASCEND develops a two-stage training pipeline to produce accurate low-precision ViTs. With extensive experiments, we show the proposed GELU and softmax blocks achieve 56.3% and 22.6% error reduction compared to existing SC designs, respectively and reduce the area-delay product (ADP) by 5.29x and 12.6x, respectively. Moreover, compared to the baseline low-precision ViTs, ASCEND also achieves significant accuracy improvements on CIFAR10 and CIFAR100.
title ASCEND: Accurate yet Efficient End-to-End Stochastic Computing Acceleration of Vision Transformer
topic Systems and Control
url https://arxiv.org/abs/2402.12820