Comparing ChatGPT and DeepSeek for Assessment of Multiple-Choice Questions in Orthopedic Medical Education: Cross-Sectional Study

Key Points

Key points are not available for this paper at this time.

Abstract

ChatGPT outperformed DeepSeek in correctness and response time, demonstrating its efficiency in evaluating orthopedic MCQs. This high reliability suggests its potential for integration into medical assessments. However, our results indicate that some MCQs will require revisions by instructors to improve their clarity. Further studies are needed to evaluate the role of artificial intelligence in other disciplines and to validate other LLMs.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Anusitviwat et al. (Fri,) studied this question.

synapsesocial.com/papers/69dba1705b363cdf1c835ae5 https://doi.org/https://doi.org/10.2196/75607

Bookmark

View Full Paper