How social reinforcement learning can lead to metastable polarisation and the voter model | Synapse