Key points are not available for this paper at this time.
Multiple-choice questions (MCQs) are commonly used across all levels of math education since they can be deployed and graded at a large scale. A critical component of MCQs is the distractors, i.e., incorrect answers crafted to reflect student errors or misconceptions. Automatically generating them in math MCQs, e.g., with large language models, has been challenging. In this work, we propose a novel method to enhance the quality of generated distractors through overgenerate-and-rank, training a ranking model to predict how likely distractors are to be selected by real students. Experimental results on a real-world dataset and human evaluation with math teachers show that our ranking model increases alignment with human-authored distractors, although human-authored ones are still preferred over generated ones.
Building similarity graph...
Analyzing shared references across papers
Loading...
Scarlatos et al. (Thu,) studied this question.
www.synapsesocial.com/papers/68e6eab8b6db64358766585d — DOI: https://doi.org/10.48550/arxiv.2405.05144
Alexander Scarlatos
Wanyong Feng
Digory Smith
Building similarity graph...
Analyzing shared references across papers
Loading...
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: