Using Automated Procedures to Score Educational Essays Written in Three Languages

Key Points

Key points are not available for this paper at this time.

Abstract

Abstract The purpose of this study is to describe and evaluate a multilingual automated essay scoring (AES) system for grading essays in three languages. Two different sentence embedding models were evaluated within the AES system, multilingual BERT (mBERT) and language‐agnostic BERT sentence embedding (LaBSE). German, Italian, and Czech essays were holistically scored using the Common European Framework of Reference of Languages. The AES system with mBERT produced results that were consistent with human raters overall across all three language groups. The system also produced accurate predictions for some but not all of the score levels within each language. The AES system with LaBSE produced results that were even more consistent with the human raters overall across all three language groups compared to mBERT. In addition, the system produced accurate predictions for the majority of the score levels within each language. The performance differences between mBERT and LaBSE can be explained by considering how each language embedding model is implemented. Implications of this study for educational testing are also discussed.

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Authors

Tahereh Firoozi

University of Alberta

Hamid Mohammadi

University of Alberta

Mark J. Gierl

University of Alberta

Journals

Journal of Educational Measurement

Actions

Institutions

University of Alberta

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Firoozi et al. (Mon,) studied this question.

synapsesocial.com/papers/68e5f73ab6db64358758bd5b — DOI: https://doi.org/10.1111/jedm.12406

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

The Routledge International Handbook of Automated Essay Evaluation· 2024 · 19 citations
Recurrent Layer Aggregation using LSTM· 2019 · 2 citations
Using Learning Analytics within an e-Assessment Platform for a TransFormative Evaluation in Bilingual Contexts· 2019 · 3 citations
MizAR 60 for Mizar 50· 2023 · 76,144 citations
The Routledge Handbook of Second Language Acquisition and Corpora· 2020 · 122 citations

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

The Routledge International Handbook of Automated Essay Evaluation· 2024 · 19 citations
Recurrent Layer Aggregation using LSTM· 2019 · 2 citations
Using Learning Analytics within an e-Assessment Platform for a TransFormative Evaluation in Bilingual Contexts· 2019 · 3 citations
MizAR 60 for Mizar 50· 2023 · 76,144 citations
The Routledge Handbook of Second Language Acquisition and Corpora· 2020 · 122 citations

Using Automated Procedures to Score Educational Essays Written in Three Languages

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Authors

Journals

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Also consider