Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Hou, Yuntian, Ji, Tianrui, Zhang, Di, Stefanidis, Angelos
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2407.11075
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911151668854784
author	Hou, Yuntian Ji, Tianrui Zhang, Di Stefanidis, Angelos
author_facet	Hou, Yuntian Ji, Tianrui Zhang, Di Stefanidis, Angelos
contents	Kolmogorov-Arnold Networks (KANs) have gained significant attention as an alternative to traditional multilayer perceptrons, with proponents claiming superior interpretability and performance through learnable univariate activation functions. However, recent systematic evaluations reveal substantial discrepancies between theoretical claims and empirical evidence. This critical assessment examines KANs' actual performance across diverse domains using fair comparison methodologies that control for parameters and computational costs. Our analysis demonstrates that KANs outperform MLPs only in symbolic regression tasks, while consistently underperforming in machine learning, computer vision, and natural language processing benchmarks. The claimed advantages largely stem from B-spline activation functions rather than architectural innovations, and computational overhead (1.36-100x slower) severely limits practical deployment. Furthermore, theoretical claims about breaking the "curse of dimensionality" lack rigorous mathematical foundation. We systematically identify the conditions under which KANs provide value versus traditional approaches, establish evaluation standards for future research, and propose a priority-based roadmap for addressing fundamental limitations. This work provides researchers and practitioners with evidence-based guidance for the rational adoption of KANs while highlighting critical research gaps that must be addressed for broader applicability.
format	Preprint
id	arxiv_https___arxiv_org_abs_2407_11075
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Kolmogorov-Arnold Networks: A Critical Assessment of Claims, Performance, and Practical Viability Hou, Yuntian Ji, Tianrui Zhang, Di Stefanidis, Angelos Machine Learning Artificial Intelligence Kolmogorov-Arnold Networks (KANs) have gained significant attention as an alternative to traditional multilayer perceptrons, with proponents claiming superior interpretability and performance through learnable univariate activation functions. However, recent systematic evaluations reveal substantial discrepancies between theoretical claims and empirical evidence. This critical assessment examines KANs' actual performance across diverse domains using fair comparison methodologies that control for parameters and computational costs. Our analysis demonstrates that KANs outperform MLPs only in symbolic regression tasks, while consistently underperforming in machine learning, computer vision, and natural language processing benchmarks. The claimed advantages largely stem from B-spline activation functions rather than architectural innovations, and computational overhead (1.36-100x slower) severely limits practical deployment. Furthermore, theoretical claims about breaking the "curse of dimensionality" lack rigorous mathematical foundation. We systematically identify the conditions under which KANs provide value versus traditional approaches, establish evaluation standards for future research, and propose a priority-based roadmap for addressing fundamental limitations. This work provides researchers and practitioners with evidence-based guidance for the rational adoption of KANs while highlighting critical research gaps that must be addressed for broader applicability.
title	Kolmogorov-Arnold Networks: A Critical Assessment of Claims, Performance, and Practical Viability
topic	Machine Learning Artificial Intelligence
url	https://arxiv.org/abs/2407.11075

Similar Items