Evaluating large language models as agents in the clinic | Synapse