What question did this study set out to answer?

The aim is to compare the quality of translations produced by various large language models against established benchmarks.

May 8, 2026

Assessing the Quality of Large Language Models in Machine Translation Tasks

Key Points

The aim is to compare the quality of translations produced by various large language models against established benchmarks.
Comparative evaluation of translations from six large language models: DeepSeek, Grok, Mistral, Qwen, GigaChat, and Yandex.
Assessment of translation quality using both quantitative metrics (BLEU, METEOR, chrF) and qualitative expert analysis.
Comparisons made against Google Translate and based on criteria of adequacy, equivalence, and harmony.
Modern large language models outperform classical machine translation, addressing traditional challenges effectively.
LLMs demonstrate significant capability in translating expressive linguistic elements such as phraseologisms and puns.
Expert evaluation highlights improved adequacy and equivalence in translations by LLMs compared to reference translations.

Abstract

This article presents a comparative evaluation of machine translation quality across several large language models (LLMs), i.e., DeepSeek, Grok, Mistral, Qwen, GigaChat, and Yandex, based on translations of expressive linguistic means (phraseologisms, homonyms, puns, etc.) and texts of various functional styles. Translation quality is assessed quantitatively using coherence metrics (BLEU, METEOR, and chrF) and qualitatively through expert analysis based on adequacy, equivalence, and harmony criteria against reference translations, with additional comparison to Google Translate. The findings demonstrate that modern LLMs can overcome classical machine translation challenges and represent a new paradigm for developing human–AI hybrid systems.

Bookmark

Cite This Study

Мыльникова et al. (Sun,) studied this question.

synapsesocial.com/papers/69fd7d94bfa21ec5bbf05f70 https://doi.org/https://doi.org/10.3103/s0005105526700020

Bookmark