What question did this study set out to answer?

The aim is to develop a transparent framework for distinguishing between AI-assisted and self-written essays in educational settings.

April 16, 2026Open Access

Explainable Machine Learning Framework for Evaluating Educational Validity and Academic Integrity in AI-Assisted Student Writing

Key Points

The aim is to develop a transparent framework for distinguishing between AI-assisted and self-written essays in educational settings.
Developed an explainable machine learning framework using Random Forest and fine-tuned BERT.
Utilized SHAP to interpret model predictions and identify key features.
Analyzed a dataset of 960 essays from varying education levels.
Achieved 91.8% accuracy (F1 = 0.895) with Random Forest and 90.2% accuracy (F1 = 0.885) with BERT.
Critical thinking depth and semantic originality emerged as key features for differentiation.
AI tools were found to improve linguistic fluency but reduce reasoning depth and creative variability.

Abstract

The widespread use of generative artificial intelligence (AI) tools such as ChatGPT, GrammarlyGO, and Quillbot across education sectors has challenged the authenticity of student writing and the validity of academic assessment. Current evaluation systems largely depend on opaque plagiarism detectors or automated scoring models that lack interpretability. This study proposes an explainable machine learning (ML) framework integrating Random Forest, fine-tuned BERT, and SHapley Additive Explanations (SHAP) to distinguish AI-assisted from self-written essays transparently. A dataset of 960 essays from students across different education levels was analyzed. The proposed framework achieved 91.8% accuracy (F1 = 0.895) with Random Forest and 90.2% accuracy (F1 = 0.885) with BERT. SHAP analysis revealed that critical thinking depth and semantic originality were the most influential dimensions differentiating AI-generated from human writing. The results indicate that AI tools enhance linguistic fluency yet reduce reasoning depth and creative variation. This study contributes a transparent, interpretable, and pedagogically aligned approach for detecting AI involvement in education. The framework provides practical guidance for educators and policymakers to ensure responsible AI integration while safeguarding educational validity and academic integrity

Explainable Machine Learning Framework for Evaluating Educational Validity and Academic Integrity in AI-Assisted Student Writing

Key Points

Abstract

Cite This Study