Machines flunking an exam: Evaluating large language models on course-related open questions | Synapse