The increasing demand for scalable and fair assessment of open-form responses in digital education shows the need for intelligent grading systems capable of balancing syntactic precision with semantic understanding. This study proposes a hybrid semantic–syntactic NLP framework for automated grading of short-answer and cloze-type questions. The framework integrates a rule-based matcher for syntactic accuracy, MPNet (Masked and Permuted Pre-trained Network) embeddings for semantic similarity, and a fine-tuned DeBERTa (Decoding-enhanced Bidirectional Encoder Representations from Transformer with Disentangled Attention) regressor for continuous score prediction, while a T5-small model provides pedagogically aligned feedback generation. Evaluations were conducted using benchmark corpora, synthetic cloze datasets, and a domain-specific short-answer corpus. Results demonstrate that the hybrid system outperforms traditional baselines, achieving 91% accuracy, a 0.89 F1 score, a mean absolute error of 0.36, and strong inter-rater agreement (κ = 0.87), aligning closely with human graders. Qualitative analyses show that the framework successfully recognizes paraphrased responses, assigns partial credit, and generates meaningful feedback. Ablation studies further validate the necessity of each subsystem, with performance significantly declining when components were removed. The findings confirm that the proposed framework is both computationally robust and pedagogically valuable, establishing a foundation for scalable, interpretable, and fair automated grading in contemporary educational environments.
Julius et al. (Thu,) studied this question.