Key points are not available for this paper at this time.
This paper introduces Xpikeformer, a hybrid analog-digital hardware architecture designed to accelerate spiking neural network (SNN) -based transformer models. By combining the energy efficiency and temporal dynamics of SNNs with the powerful sequence modeling capabilities of transformers, Xpikeformer leverages mixed analog-digital computing techniques to enhance performance and energy efficiency. The architecture integrates analog in-memory computing (AIMC) for feedforward and fully connected layers, and a stochastic spiking attention (SSA) engine for efficient attention mechanisms. We detail the design, implementation, and evaluation of Xpikeformer, demonstrating significant improvements in energy consumption and computational efficiency. Through an image classification task and a wireless communication symbol detection task, we show that Xpikeformer can achieve software-comparable inference accuracy. Energy evaluations reveal that Xpikeformer achieves up to a 17. 8--19. 2 reduction in energy consumption compared to state-of-the-art digital ANN transformers and up to a 5. 9--6. 8 reduction compared to fully digital SNN transformers. Xpikeformer also achieves a 12. 0 speedup compared to the GPU implementation of spiking transformers.
Building similarity graph...
Analyzing shared references across papers
Loading...
Song et al. (Fri,) studied this question.
www.synapsesocial.com/papers/68e5c0e5b6db643587558602 — DOI: https://doi.org/10.48550/arxiv.2408.08794
Zihang Song
Prabodh Katti
Osvaldo Simeone
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: