What question did this study set out to answer?

This study aims to develop and evaluate an architectural system combining RAG and AI agents for automated assessment in Moodle.

May 14, 2026Open Access

An Integrated RAG and Agent-Based Architecture for Automated Assessment in Moodle

Key Points

This study aims to develop and evaluate an architectural system combining RAG and AI agents for automated assessment in Moodle.
Moodle instance deployed for experimental purposes with 32 students in Bulgarian- and English-language sections.
Data collected at student (N=32) and task (N=160) levels, including AI-generated scores and instructor-assigned scores.
Bias analysis to assess assessment objectivity across language groups.
The proposed system reduced grading time significantly while maintaining high agreement with expert assessments.
Analysis showed minimal systematic deviation in grading across language groups, ensuring objectivity.
The architecture can be applied effectively in various educational settings beyond Moodle.

Abstract

The growing adoption of Generative AI in education has created opportunities to automate complex pedagogical tasks, yet reliably and scalably assessing open-ended responses remains a challenge. This study proposes and evaluates an architectural solution for integrating a Large Language Model (LLM) into Moodle, combining Retrieval-Augmented Generation (RAG) and AI agent mechanisms to enable automated grading of open-ended student responses. A Moodle instance was deployed for experimental purposes, with 32 students across Bulgarian- and English-language sections, yielding data at the student (N = 32) and task (N = 160) levels, including AI-generated and instructor-assigned scores and system processing logs. The results demonstrate that the proposed system achieves substantial reductions in grading time while maintaining high agreement with expert assessments. Bias analysis revealed minimal systematic deviation across both language groups, indicating that the system preserves assessment objectivity without consistent over- or underestimation based on language. These findings suggest that a combined RAG and agentic LLM architecture can deliver efficient, accurate, and linguistically robust automated assessment within an LMS environment, offering practical design guidelines applicable to other educational platforms and similar systems.

Read Full Paperexternally

Mark Helpful

Bookmark

Relay

View Full Paper