What question did this study set out to answer?

This research aims to assess how asymmetries in data and model architecture influence multilingual translation performance in LLMs.

April 28, 2026Open Access

LingualX64: a multilingual benchmark for evaluating symmetry and asymmetry in LLM translation

Key Points

This research aims to assess how asymmetries in data and model architecture influence multilingual translation performance in LLMs.
Introduced LingualX64, a dataset covering 64 languages with minimized overlap with LLM training data.
Evaluated LLM translation performance under zero-shot conditions to assess disparities across languages.
Balanced representation of linguistic features to enhance cross-linguistic generalization assessments.
Identified significant performance disparities across languages linked to data scarcity and linguistic complexity.
Observed that existing LLMs demonstrate varied translation quality based on underlying data asymmetries.
Revealed the necessity for interventions to address these asymmetries to enhance multilingual translation effectiveness.

Abstract

Large Language Models (LLMs) have revolutionized Natural Language Processing, including machine translation (MT), achieving unprecedented performance. However, this progress masks underlying asymmetries in training data and model architecture that impact multilingual translation quality. This paper introduces LingualX64, a novel dataset spanning 64 languages, designed to evaluate the extent to which these asymmetries affect LLM translation performance, particularly under zero-shot conditions. LingualX64 is constructed to minimize data overlap with existing LLM training corpora and to provide a balanced representation of diverse linguistic features, enabling a more robust assessment of cross-linguistic generalization. Our evaluation reveals significant performance disparities across languages, highlighting the impact of data scarcity and linguistic complexity on translation quality. These findings underscore the need for strategies to mitigate asymmetries in LLM training and model design to achieve more equitable and robust multilingual translation capabilities. LingualX64 provides a valuable benchmark for researchers and developers seeking to address these challenges and unlock the full potential of LLMs for global communication.

Bookmark

View Full Paper

Bookmark

View Full Paper

LingualX64: a multilingual benchmark for evaluating symmetry and asymmetry in LLM translation

Key Points

Abstract

Cite This Study