What question did this study set out to answer?

This research aims to create a system that automatically generates and evaluates educational quizzes using advanced language models.

April 3, 2026Open Access

Automatic Quiz Generation and Evaluation System using Large Language Models with Distractor Optimization

Key Points

This research aims to create a system that automatically generates and evaluates educational quizzes using advanced language models.
Developed a multi-layered architecture for quiz generation and evaluation.
Implemented a novel distractor optimization module using semantic similarity techniques.
Utilized Bloom's Taxonomy for question generation across different cognitive levels.
Conducted experimental analysis of diversity and quality metrics.
Achieved a diversity score of 0.97 with a zero duplicate rate in generated questions.
Found a strong positive correlation (r = 0.96) between diversity and overall question quality.
Identified a plausibility-relevance trade-off (r = −0.73) for future enhancement.

Abstract

This paper presents an intelligent system for the automatic generation and evaluation of educational quizzes using Large Language Models (LLMs) with a novel distractor optimization module. The system employs a multi-layered architecture covering resource processing, topic extraction, question generation, and quality evaluation. A key contribution is the multi-stage distractor optimization pipeline, which uses semantic similarity techniques (TF-IDF, Sentence-BERT, cosine similarity) to ensure distractors are plausible, diverse, and non-redundant. Questions are generated across cognitive levels using Bloom's Taxonomy classification. Experimental results demonstrate a diversity score of 0.97 with zero duplicate rate. Correlation analysis reveals that diversity strongly predicts overall question quality (r = 0.96), and a plausibility-relevance trade-off (r = −0.73) is identified as a key direction for future improvement. The system is built using Flask, Celery, Redis, SQLAlchemy, and integrates LLM APIs with transformer-based semantic models for end-to-end quiz generation.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Ch.Sravanthi Sowdanya Ch.Sravanthi Sowdanya

T.Venkatesh T.Venkatesh

K.Yajath K.Yajath

Actions

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Automatic Quiz Generation and Evaluation System using Large Language Models with Distractor Optimization

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Actions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider