Key points are not available for this paper at this time.
Recent generalizations of the Hopfield model of associative memories are able to store a number P of random patterns that grows exponentially with the number N of neurons, P=exp(αN). Besides the huge storage capacity, another interesting feature of these networks is their connection to the attention mechanism which is part of the Transformer architecture widely applied in deep learning. In this work, we study a generic family of pattern ensembles using a statistical mechanics analysis which gives exact asymptotic thresholds for the retrieval of a typical pattern, α1, and lower bounds for the maximum of the load α for which all patterns can be retrieved, αc, as well as sizes of attraction basins. We discuss in detail the cases of Gaussian and spherical patterns, and show that they display rich and qualitatively different phase diagrams.Received 26 July 2023Revised 29 November 2023Accepted 11 January 2024DOI:https://doi.org/10.1103/PhysRevLett.132.077301© 2024 American Physical SocietyPhysics Subject Headings (PhySH)Physical SystemsDisordered systemsSpin glassesTechniquesDeep learningMachine learningReplica theoryStatistical Physics & ThermodynamicsInterdisciplinary Physics
Building similarity graph...
Analyzing shared references across papers
Loading...
Carlo Lucibello
IMT School for Advanced Studies Lucca
Marc Mézard
Bocconi University
Physical Review Letters
Bocconi University
Building similarity graph...
Analyzing shared references across papers
Loading...
Lucibello et al. (Tue,) studied this question.
synapsesocial.com/papers/68e7941db6db643587705a56 — DOI: https://doi.org/10.1103/physrevlett.132.077301