Key points are not available for this paper at this time.
Limited by the complexity of basis function (B-spline) calculations, Kolmogorov-Arnold Networks (KAN) suffer from restricted parallel computing capability on GPUs. This paper proposes a novel ReLU-KAN implementation that inherits the core idea of KAN. By adopting ReLU (Rectified Linear Unit) and point-wise multiplication, we simplify the design of KAN's basis function and optimize the computation process for efficient CUDA computing. The proposed ReLU-KAN architecture can be readily implemented on existing deep learning frameworks (e. g. , PyTorch) for both inference and training. Experimental results demonstrate that ReLU-KAN achieves a 20x speedup compared to traditional KAN with 4-layer networks. Furthermore, ReLU-KAN exhibits a more stable training process with superior fitting ability while preserving the "catastrophic forgetting avoidance" property of KAN. You can get the code in https: //github. com/quiqi/reluₖan
Building similarity graph...
Analyzing shared references across papers
Loading...
Qiu et al. (Tue,) studied this question.
www.synapsesocial.com/papers/68e6634ab6db6435875efc17 — DOI: https://doi.org/10.48550/arxiv.2406.02075
Qi Qiu
Tao Zhu
Helin Gong
Building similarity graph...
Analyzing shared references across papers
Loading...