Maynard Smith revisited: A multi-agent reinforcement learning approach to the coevolution of signalling behaviour | Synapse