June 4, 2024Open Access

ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU

Key Points

Key points are not available for this paper at this time.

Abstract

Limited by the complexity of basis function (B-spline) calculations, Kolmogorov-Arnold Networks (KAN) suffer from restricted parallel computing capability on GPUs. This paper proposes a novel ReLU-KAN implementation that inherits the core idea of KAN. By adopting ReLU (Rectified Linear Unit) and point-wise multiplication, we simplify the design of KAN's basis function and optimize the computation process for efficient CUDA computing. The proposed ReLU-KAN architecture can be readily implemented on existing deep learning frameworks (e. g. , PyTorch) for both inference and training. Experimental results demonstrate that ReLU-KAN achieves a 20x speedup compared to traditional KAN with 4-layer networks. Furthermore, ReLU-KAN exhibits a more stable training process with superior fitting ability while preserving the "catastrophic forgetting avoidance" property of KAN. You can get the code in https: //github. com/quiqi/reluₖan

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Qiu et al. (Tue,) studied this question.

www.synapsesocial.com/papers/68e6634ab6db6435875efc17 — DOI: https://doi.org/10.48550/arxiv.2406.02075

Authors

Qi Qiu

Tao Zhu

Helin Gong

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

ReLU-KAN: New Kolmogorov-Arnold Networks that Only Need Matrix Addition, Dot Multiplication, and ReLU

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion