Gardado en:
Detalles Bibliográficos
Main Authors: Xuan, Wei, Xuan, Zihao, Fu, Rongliang, Lin, Ning, Wong, Kwunhang, Yuan, Zikang, Feng, Lang, Wang, Zhongrui, Ho, Tsung-Yi, Jiao, Yuzhong, Liang, Luhong
Formato: Preprint
Publicado: 2026
Subjects:
Acceso en liña:https://arxiv.org/abs/2602.20521
Tags: Engadir etiqueta
Sen Etiquetas, Sexa o primeiro en etiquetar este rexistro!
_version_ 1866911464330100736
author Xuan, Wei
Xuan, Zihao
Fu, Rongliang
Lin, Ning
Wong, Kwunhang
Yuan, Zikang
Feng, Lang
Wang, Zhongrui
Ho, Tsung-Yi
Jiao, Yuzhong
Liang, Luhong
author_facet Xuan, Wei
Xuan, Zihao
Fu, Rongliang
Lin, Ning
Wong, Kwunhang
Yuan, Zikang
Feng, Lang
Wang, Zhongrui
Ho, Tsung-Yi
Jiao, Yuzhong
Liang, Luhong
contents The rapid deployment of deep neural network (DNN) accelerators in safety-critical domains such as autonomous vehicles, healthcare systems, and financial infrastructure necessitates robust mechanisms to safeguard data confidentiality and computational integrity. Existing security solutions for DNN accelerators, however, suffer from excessive hardware resource demands and frequent off-chip memory access overheads, which degrade performance and scalability. To address these challenges, this paper presents a secure and efficient memory protection framework for DNN accelerators with minimal overhead. First, we propose a bandwidth-aware cryptographic scheme that adapts encryption granularity based on memory traffic patterns, striking a balance between security and resource efficiency. Second, we observe that both the overlapping regions in the intra-layer tiling's sliding window pattern and those resulting from inter-layer tiling strategy discrepancies introduce substantial redundant memory accesses and repeated computational overhead in cryptography. Third, we introduce a multi-level authentication mechanism that effectively eliminates unnecessary off-chip memory accesses, enhancing performance and energy efficiency. Experimental results show that this work decreases performance overhead by over 12% and achieves 87% energy efficiency improvement for both server and edge neural processing units (NPUs), while ensuring robust scalability.
format Preprint
id arxiv_https___arxiv_org_abs_2602_20521
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Towards Secure and Efficient DNN Accelerators via Hardware-Software Co-Design
Xuan, Wei
Xuan, Zihao
Fu, Rongliang
Lin, Ning
Wong, Kwunhang
Yuan, Zikang
Feng, Lang
Wang, Zhongrui
Ho, Tsung-Yi
Jiao, Yuzhong
Liang, Luhong
Cryptography and Security
The rapid deployment of deep neural network (DNN) accelerators in safety-critical domains such as autonomous vehicles, healthcare systems, and financial infrastructure necessitates robust mechanisms to safeguard data confidentiality and computational integrity. Existing security solutions for DNN accelerators, however, suffer from excessive hardware resource demands and frequent off-chip memory access overheads, which degrade performance and scalability. To address these challenges, this paper presents a secure and efficient memory protection framework for DNN accelerators with minimal overhead. First, we propose a bandwidth-aware cryptographic scheme that adapts encryption granularity based on memory traffic patterns, striking a balance between security and resource efficiency. Second, we observe that both the overlapping regions in the intra-layer tiling's sliding window pattern and those resulting from inter-layer tiling strategy discrepancies introduce substantial redundant memory accesses and repeated computational overhead in cryptography. Third, we introduce a multi-level authentication mechanism that effectively eliminates unnecessary off-chip memory accesses, enhancing performance and energy efficiency. Experimental results show that this work decreases performance overhead by over 12% and achieves 87% energy efficiency improvement for both server and edge neural processing units (NPUs), while ensuring robust scalability.
title Towards Secure and Efficient DNN Accelerators via Hardware-Software Co-Design
topic Cryptography and Security
url https://arxiv.org/abs/2602.20521