Context Matching is not Reasoning: Assessing Generalized Evaluation of Generative Language Models in Clinical Settings | Synapse