Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ou, Wenhui, Wu, Zhuoyu, Zhang, Yipu, Wang, Zheng, Yue, C. Patrick
Format:	Preprint
Published:	2026
Subjects:	Hardware Architecture
Online Access:	https://arxiv.org/abs/2603.01165
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915825898749952
author	Ou, Wenhui Wu, Zhuoyu Zhang, Yipu Wang, Zheng Yue, C. Patrick
author_facet	Ou, Wenhui Wu, Zhuoyu Zhang, Yipu Wang, Zheng Yue, C. Patrick
contents	Recently, multi-layer perceptrons (MLPs) widely used in modern AI applications suffer from limited real-time performance due to intensive memory access overhead. Kolmogorov--Arnold Networks (KANs) have attracted increasing attention as an alternative architecture with similar structures to MLPs but improved parameter efficiency. However, the lack of dedicated hardware support limits the practical performance benefits of KANs. Moreover, since many edge workloads still rely heavily on MLPs, accelerators designed exclusively for KANs become inefficient and impractical. In this work, we present VIKIN, a reconfigurable accelerator that efficiently supports both KAN and MLP inference using unified hardware. VIKIN introduces a pipeline execution mode and two-stage sparsity support for efficient KAN processing, while enabling parallel-mode acceleration to improve MLP throughput under the same sparsity framework. Experiments on real-world datasets demonstrate that replacing MLPs with KANs on VIKIN achieves $1.28\times$ acceleration with $19.58\%$ reduced accuracy loss. For a higher-accuracy KAN model requiring $3.29\times$ more operations, VIKIN incurs only $1.24\times$ latency overhead compared with the baseline KAN model. In addition, VIKIN achieves $1.25\times$ speedup and $4.87\times$ higher energy efficiency than a representative edge GPU when executing KAN workloads.
format	Preprint
id	arxiv_https___arxiv_org_abs_2603_01165
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	VIKIN: A Reconfigurable Accelerator for KANs and MLPs with Two-Stage Sparsity Support Ou, Wenhui Wu, Zhuoyu Zhang, Yipu Wang, Zheng Yue, C. Patrick Hardware Architecture Recently, multi-layer perceptrons (MLPs) widely used in modern AI applications suffer from limited real-time performance due to intensive memory access overhead. Kolmogorov--Arnold Networks (KANs) have attracted increasing attention as an alternative architecture with similar structures to MLPs but improved parameter efficiency. However, the lack of dedicated hardware support limits the practical performance benefits of KANs. Moreover, since many edge workloads still rely heavily on MLPs, accelerators designed exclusively for KANs become inefficient and impractical. In this work, we present VIKIN, a reconfigurable accelerator that efficiently supports both KAN and MLP inference using unified hardware. VIKIN introduces a pipeline execution mode and two-stage sparsity support for efficient KAN processing, while enabling parallel-mode acceleration to improve MLP throughput under the same sparsity framework. Experiments on real-world datasets demonstrate that replacing MLPs with KANs on VIKIN achieves $1.28\times$ acceleration with $19.58\%$ reduced accuracy loss. For a higher-accuracy KAN model requiring $3.29\times$ more operations, VIKIN incurs only $1.24\times$ latency overhead compared with the baseline KAN model. In addition, VIKIN achieves $1.25\times$ speedup and $4.87\times$ higher energy efficiency than a representative edge GPU when executing KAN workloads.
title	VIKIN: A Reconfigurable Accelerator for KANs and MLPs with Two-Stage Sparsity Support
topic	Hardware Architecture
url	https://arxiv.org/abs/2603.01165

Similar Items