Saved in:
Bibliographic Details
Main Author: Bagrow, James
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.02190
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917457281679360
author Bagrow, James
author_facet Bagrow, James
contents Kolmogorov-Arnold networks (KANs) offer a potent combination of accuracy and interpretability, thanks to their compositions of learnable univariate activation functions. However, the activations of well-fitting KANs tend to exhibit pathologically high-curvature oscillations, making them difficult to interpret, and standard regularization penalties do not prevent this. Here we derive a basis-agnostic curvature penalty and show that penalized models can maintain accuracy while achieving substantially smoother activations. Accounting for how function composition shapes curvature, we prove an upper bound on the full model's curvature relative to the curvature penalty, and use this to motivate richer forms of penalties. Scientific machine learning is increasingly bottlenecked by the trade-off between accuracy and interpretability. Results such as ours that improve interpretability without sacrificing accuracy will further strengthen KANs as a practical tool for both prediction and insight.
format Preprint
id arxiv_https___arxiv_org_abs_2605_02190
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle KANs need curvature: penalties for compositional smoothness
Bagrow, James
Machine Learning
Data Analysis, Statistics and Probability
Kolmogorov-Arnold networks (KANs) offer a potent combination of accuracy and interpretability, thanks to their compositions of learnable univariate activation functions. However, the activations of well-fitting KANs tend to exhibit pathologically high-curvature oscillations, making them difficult to interpret, and standard regularization penalties do not prevent this. Here we derive a basis-agnostic curvature penalty and show that penalized models can maintain accuracy while achieving substantially smoother activations. Accounting for how function composition shapes curvature, we prove an upper bound on the full model's curvature relative to the curvature penalty, and use this to motivate richer forms of penalties. Scientific machine learning is increasingly bottlenecked by the trade-off between accuracy and interpretability. Results such as ours that improve interpretability without sacrificing accuracy will further strengthen KANs as a practical tool for both prediction and insight.
title KANs need curvature: penalties for compositional smoothness
topic Machine Learning
Data Analysis, Statistics and Probability
url https://arxiv.org/abs/2605.02190