Key points are not available for this paper at this time.
Large Language Models (LLMs) face documented challenges in solving mathematical problems. While substantial work has been done to quantify and improve LLMs’ abilities to solve static math problems, evaluating their performance in real-time math tutoring scenarios presents distinct challenges that remain underexplored. This paper specifically addresses the accuracy of LLMs in performing math correctly while tutoring students. It highlights the unique difficulties of this context, classifies types of interactions students may have with an LLM, presents a dataset, Conversation-Based Math Tutoring Accuracy Dataset (CoMTA Dataset), for evaluating the mathematical accuracy in tutoring scenarios, and discusses techniques to address these issues. Additionally, it evaluates the mathematical accuracy of a range of models in LLM-based tutoring.
Building similarity graph...
Analyzing shared references across papers
Loading...
Miller et al. (Wed,) studied this question.
www.synapsesocial.com/papers/68e617f5b6db6435875aa38f — DOI: https://doi.org/10.35542/osf.io/5zwv3
Pepper Miller
Kristen E. DiCerbo
Building similarity graph...
Analyzing shared references across papers
Loading...