A Dataset for Evaluating Large Language Models on Chinese National Medical Licensing Examinations | Synapse