Transformers models have significantly changed many areas of Machine Learning, due to their structure and extensive number of parameters, which enables them to capture complex patterns in data. Building on this foundation, an energy-based version, the Energy Transformer (ET), has emerged as a powerful variation, achieving parameter efficiency without sacrificing performances. In this work, we introduce an innovative evolution of the ET model: the ET-KAN architecture. By integrating a Kolmogorov–Arnold Network (KAN) in place of the energy function, our model generalizes the ET as structural design, unlocking enhanced learning capabilities. We demonstrate the potential of this new architecture through an image reconstruction task, where it achieves comparable or higher results respect to the ET (Loss 0. 08 when covering more than half of the image), outperforming the standard ET architecture while using fewer parameters. The model hereby constructed paves the way to a deeper investigation of the interplay between KAN and energy-based models, for addressing some of the key limitations of traditional transformers.
Building similarity graph...
Analyzing shared references across papers
Loading...
Marullo et al. (Wed,) studied this question.
synapsesocial.com/papers/69abc1235af8044f7a4e9bfe — DOI: https://doi.org/10.1007/s40747-026-02233-3
Chiara Marullo
Istituto Nazionale di Alta Matematica Francesco Severi
Giuseppe Buonaiuto
Universidad Francisco de Vitoria
Francesco Gargiulo
Institute for High Performance Computing and Networking
Complex & Intelligent Systems
Institute for High Performance Computing and Networking
Istituto Nazionale di Alta Matematica Francesco Severi
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: