What question did this study set out to answer?

This research aims to evaluate the effectiveness of AI-generated feedback against teacher feedback in Medical English essays.

April 11, 2026Open Access

Comparing LLM-Generated and Teacher Feedback on Medical Esp Writing

Key Points

This research aims to evaluate the effectiveness of AI-generated feedback against teacher feedback in Medical English essays.
Compared feedback from teachers and two types of AI prompting (minimalist and structured) on 50 Medical English essays.
Analyzed the detail and coverage of feedback using error taxonomy and comment metrics.
Explored the qualitative differences in focus areas between human and AI feedback.
Structured prompting led to more detailed and extensive feedback with higher comment counts.
Minimalist prompting was limited to surface-level issues.
Teacher feedback emphasized coherence and higher-order writing concerns, indicating a low overlap with AI feedback.

Abstract

This conference paper explores the use of large language models (LLMs) in writing assessment within Medical English and English for Specific Purposes (ESP) contexts. The study compares teacher feedback with AI-generated feedback on 50 Medical English essays, focusing on the role of prompt engineering in shaping feedback quality. Three conditions were analysed: teacher feedback, AI-minimalist prompting, and AI-structured prompting. Results show that structured prompt engineering produces more detailed, extensive, and systematic feedback, with higher comment counts, longer responses, and near-complete coverage of an error taxonomy. In contrast, minimalist prompting focuses on surface-level issues, while teacher feedback prioritises coherence and higher-order writing concerns. Low overlap between human and AI feedback highlights their complementary roles in writing assessment. The findings contribute to ongoing research on AI in education, demonstrating how LLM-generated feedback can support scalable, efficient, and systematic evaluation in Medical English and ESP, while reinforcing the importance of human expertise in pedagogically sensitive contexts.

Comparing LLM-Generated and Teacher Feedback on Medical Esp Writing

Key Points

Abstract

Cite This Study