What question did this study set out to answer?

The aim is to review empirical studies on the integration of open-source large language models in education and their impact on learning.

February 28, 2026Open Access

Open-Source Large Language Models in Education: A Narrative Review of Evidence, Pedagogical Roles, and Learning Outcomes

Key Points

The aim is to review empirical studies on the integration of open-source large language models in education and their impact on learning.
Conducted a narrative review of peer-reviewed studies on open-source LLMs in education.
Focused on their deployment in instructional contexts, learner evidence, and teacher engagement.
Developed a four-role model describing how teachers interact with AI in educational settings.
Positive learner perceptions of open-source LLMs were reported, though evidence of measurable learning outcomes was inconsistent.
The four-role model includes roles of Designer, Facilitator, Monitor, and Evaluator for teachers.
Identified orchestration dimensions and tensions in implementing LLMs in higher education contexts.

Abstract

Open-source large language models (LLMs) are increasingly explored in educational contexts due to their transparency, adaptability, and alignment with institutional governance and equity considerations. Despite growing interest, empirical research on how open-source LLMs are deployed in education and what evidence currently supports their integration remains limited and fragmented. This paper presents a state-of-the-art narrative review of peer-reviewed, human empirical studies examining the use of open-source LLMs in education. Guided by three questions, the review synthesizes how open-source LLMs are deployed across instructional contexts, what learner-related evidence is reported, and how teachers engage in human–AI collaboration. The reviewed literature is concentrated in higher education, particularly within computer science and programming domains, with applications focused on post-class tutoring, guidance, and formative feedback. Learner perceptions are generally positive, but evidence linking open-source LLM use to measurable learning outcomes remains emerging and inconsistent. Through interpretive synthesis, the review articulates a four-role model—Designer, Facilitator, Monitor, and Evaluator—that captures how teacher agency is enacted across AI-supported instructional workflows. This review maps recurring orchestration dimensions, decision points, and tensions that characterize early implementations, and it proposes a minimal orchestration reporting scaffold (configuration, boundaries, logging, adjudication) intended to support auditability and cross-study comparison as the empirical base develops.

Read Full Paperexternally

Bookmark

View Full Paper

Cite This Study

Lin et al. (Fri,) studied this question.

synapsesocial.com/papers/69a287f20a974eb0d3c03c68 https://doi.org/https://doi.org/10.3390/aieduc2010004

Bookmark

View Full Paper