Evaluating large language models for evidence-based clinical question answering | Synapse