Performance of large language models on neuroanatomy-based medical riddles: a comparative study | Synapse