Key points are not available for this paper at this time.
Abstract Recent advancements in large language models (LLMs) have shown potential in enhancing educational practices, particularly in technology-assisted learning environments. This study critically evaluates the reasoning capabilities of LLMs, such as ChatGPT, within the context of chemistry education. We designed targeted adversarial prompts that challenge the models to solve complex chemistry problems and assessed their performance. By pushing the boundaries of LLM reasoning, we aim to identify their limitations and strengths in handling queries within the chemistry domain. Our findings expose inherent weaknesses in current AI systems, emphasizing the necessity of cautious AI deployment in teaching methodologies. We argue for a balanced approach, leveraging the benefits of LLMs while mitigating their limitations, to facilitate their seamless adoption in education.
Building similarity graph...
Analyzing shared references across papers
Loading...
Suna-Şeyma Uçar
University of the Basque Country
Iñigo López-Gazpio
University of the Basque Country
Josu Lopez‐Gazpio
University of the Basque Country
Education and Information Technologies
University of the Basque Country
Building similarity graph...
Analyzing shared references across papers
Loading...
Uçar et al. (Fri,) studied this question.
synapsesocial.com/papers/6a0eff2653f874f2b2230d2a — DOI: https://doi.org/10.1007/s10639-024-13295-6
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: