Let large language models judge each other: multi-agent peer-reviewed reasoning for medical question answering | Synapse