What question did this study set out to answer?

This research aims to evaluate the effectiveness of genAI chatbots on learning success and academic achievement in higher education.

April 20, 2026Open Access

AI Chatbots in Higher Education: Comparing Expectations to Evidence

Key Points

This research aims to evaluate the effectiveness of genAI chatbots on learning success and academic achievement in higher education.
Conducted a randomized controlled field experiment involving around 500 undergraduate students.
Assessed the impact of genAI chatbots on interest, self-efficacy, engagement, and academic achievement.
Utilized pre- and post-treatment surveys and tests over a semester across in-person and asynchronous online modalities.
No statistically significant impact of the genAI chatbot was found on any measured outcome related to learning success.
Findings challenge existing assumptions about the instructional effectiveness of AI in educational settings.

Abstract

Given the rapid advancements, and notable failures, in large language model generative AI (genAI), there are elevated expectations that retrieval-augmented generation (RAG) AI chatbots will revolutionize higher education by offering individualized, always-available tutoring based on validated content. However, experimental evidence on their effectiveness remains scarce. Using a randomized controlled field experiment, this study examines the effects of a genAI chatbot on key precursors to learning success (i.e., interest, self-efficacy, and engagement) and academic achievement for ≈500 undergraduate students across two modalities (in-person and asynchronous online). We completed a semester-long controlled experiment with pre- and post-treatment surveys and tests. Despite expectations, we found the genAI chatbot had no statistically significant impact on any measured outcome. These early results challenge assumptions about AI’s instructional effectiveness and suggest universities should further investigate the pedagogical value of AI chatbots before making substantial investments or committing to long-term contracts. We recommend future research to increase the generalizability of the findings and to discover methods to improve efficacy of AI chatbots in higher education.

Read Full Paperexternally

KI fragen

Bookmark

View Full Paper