January 3, 2025Open Access

Evaluating and challenging the reasoning capabilities of generative artificial intelligence for technology-assisted chemistry education

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract Recent advancements in large language models (LLMs) have shown potential in enhancing educational practices, particularly in technology-assisted learning environments. This study critically evaluates the reasoning capabilities of LLMs, such as ChatGPT, within the context of chemistry education. We designed targeted adversarial prompts that challenge the models to solve complex chemistry problems and assessed their performance. By pushing the boundaries of LLM reasoning, we aim to identify their limitations and strengths in handling queries within the chemistry domain. Our findings expose inherent weaknesses in current AI systems, emphasizing the necessity of cautious AI deployment in teaching methodologies. We argue for a balanced approach, leveraging the benefits of LLMs while mitigating their limitations, to facilitate their seamless adoption in education.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Suna-Şeyma Uçar

University of the Basque Country

Iñigo López-Gazpio

University of the Basque Country

Josu Lopez‐Gazpio

University of the Basque Country

Journals

Education and Information Technologies

Actions

Institutions

University of the Basque Country

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Uçar et al. (Fri,) studied this question.

synapsesocial.com/papers/6a0eff2653f874f2b2230d2a — DOI: https://doi.org/10.1007/s10639-024-13295-6

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Evaluating Academic Answers Generated Using ChatGPT· 2023 · 283 citations
Critical Thinking and Education· 1982 · 426 citations
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering· 2022 · 214 citations
Was This Title Generated by ChatGPT? Considerations for Artificial Intelligence Text-Generation Software Programs for Chemists and Chemistry Educators· 2023 · 149 citations
MoleculeNet: a benchmark for molecular machine learning· 2017 · 2,924 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Evaluating Academic Answers Generated Using ChatGPT· 2023 · 283 citations
Critical Thinking and Education· 1982 · 426 citations
Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering· 2022 · 214 citations
Was This Title Generated by ChatGPT? Considerations for Artificial Intelligence Text-Generation Software Programs for Chemists and Chemistry Educators· 2023 · 149 citations
MoleculeNet: a benchmark for molecular machine learning· 2017 · 2,924 citations

Evaluating and challenging the reasoning capabilities of generative artificial intelligence for technology-assisted chemistry education

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider