Saved in:
Bibliographic Details
Main Authors: Hou, Yuntian, Ji, Tianrui, Zhang, Di, Stefanidis, Angelos
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.11075
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911151668854784
author Hou, Yuntian
Ji, Tianrui
Zhang, Di
Stefanidis, Angelos
author_facet Hou, Yuntian
Ji, Tianrui
Zhang, Di
Stefanidis, Angelos
contents Kolmogorov-Arnold Networks (KANs) have gained significant attention as an alternative to traditional multilayer perceptrons, with proponents claiming superior interpretability and performance through learnable univariate activation functions. However, recent systematic evaluations reveal substantial discrepancies between theoretical claims and empirical evidence. This critical assessment examines KANs' actual performance across diverse domains using fair comparison methodologies that control for parameters and computational costs. Our analysis demonstrates that KANs outperform MLPs only in symbolic regression tasks, while consistently underperforming in machine learning, computer vision, and natural language processing benchmarks. The claimed advantages largely stem from B-spline activation functions rather than architectural innovations, and computational overhead (1.36-100x slower) severely limits practical deployment. Furthermore, theoretical claims about breaking the "curse of dimensionality" lack rigorous mathematical foundation. We systematically identify the conditions under which KANs provide value versus traditional approaches, establish evaluation standards for future research, and propose a priority-based roadmap for addressing fundamental limitations. This work provides researchers and practitioners with evidence-based guidance for the rational adoption of KANs while highlighting critical research gaps that must be addressed for broader applicability.
format Preprint
id arxiv_https___arxiv_org_abs_2407_11075
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Kolmogorov-Arnold Networks: A Critical Assessment of Claims, Performance, and Practical Viability
Hou, Yuntian
Ji, Tianrui
Zhang, Di
Stefanidis, Angelos
Machine Learning
Artificial Intelligence
Kolmogorov-Arnold Networks (KANs) have gained significant attention as an alternative to traditional multilayer perceptrons, with proponents claiming superior interpretability and performance through learnable univariate activation functions. However, recent systematic evaluations reveal substantial discrepancies between theoretical claims and empirical evidence. This critical assessment examines KANs' actual performance across diverse domains using fair comparison methodologies that control for parameters and computational costs. Our analysis demonstrates that KANs outperform MLPs only in symbolic regression tasks, while consistently underperforming in machine learning, computer vision, and natural language processing benchmarks. The claimed advantages largely stem from B-spline activation functions rather than architectural innovations, and computational overhead (1.36-100x slower) severely limits practical deployment. Furthermore, theoretical claims about breaking the "curse of dimensionality" lack rigorous mathematical foundation. We systematically identify the conditions under which KANs provide value versus traditional approaches, establish evaluation standards for future research, and propose a priority-based roadmap for addressing fundamental limitations. This work provides researchers and practitioners with evidence-based guidance for the rational adoption of KANs while highlighting critical research gaps that must be addressed for broader applicability.
title Kolmogorov-Arnold Networks: A Critical Assessment of Claims, Performance, and Practical Viability
topic Machine Learning
Artificial Intelligence
url https://arxiv.org/abs/2407.11075